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Novel Proteins and Nucleic Acids Encoding Same 



FIELD OF THE INVENTION 

5 The present invention relates to novel polypeptides that are targets of small 

molecule drugs and that have properties related to stimulation of biochemical or 
physiological responses in a ceil, a tissue, an organ or an organism. More particularly, the 
novel polypeptides are gene products of novel genes, or are specified biologically active 
fragments or derivatives thereof. Methods of use encompass diagnostic and prognostic 
10 assay procedures as well as methods of treating diverse pathological conditions. 
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BACKGROUND 

Eukaryotic cells are characterized by biochemical and physiological processes 
which under normal conditions are exquisitely balanced to achieve the preservation and 
propagation of the cells. When such cells are components of multicellular organisms such 
5 as vertebrates, or more particularly organisms such as mammals, the regulation of the 
biochemical and 

physiological processes involves intricate signaling pathways. Frequently, such signaling 
pathways are constituted of extracellular signaling proteins, cellular receptors that bind the 
signaling proteins and signal transducing components located within the cells. 

1 0 Signaling proteins may be classified as endocrine effectors, paracrine effectors or 

autocrine effectors. Endocrine effectors are signaling molecules secreted by a given organ 
into the circulatory system, which are then transported to a distant target organ or tissue. 
The target cells include the receptors for the endocrine effector, and when the endocrine 
effector binds, a signaling cascade is induced. Paracrine effectors involve secreting cells 

15 and receptor cells in close proximity to each other, for example two different classes of 
cells in the same tissue or organ. One class of cells secretes the paracrine effector, which 
then reaches the second class of cells, for example by diffusion through the extracellular 
fluid. The second class of cells contains the receptors for the paracrine effector; binding 
of the effector results in induction of the signaling cascade that elicits the corresponding 

20 biochemical or physiological effect. Autocrine effectors are highly analogous to paracrine 
effectors, except that the same cell type that secretes the autocrine effector also contains 
the receptor. Thus the autocrine effector binds to receptors on the same cell, or on 
identical neighboring cells. The binding process then elicits the characteristic biochemical 
or physiological effect. 

25 Signaling processes may elicit a variety of effects on cells and tissues including by 

way of nonlimiting example induction of cell or tissue proliferation, suppression of growth 
or proliferation, induction of differentiation or maturation of a cell or tissue, and 
suppression of differentiation or maturation of a cell or tissue. 

Many pathological conditions involve dysregulation of expression of important 

30 effector proteins. In certain classes of pathologies the dysregulation is manifested as 
diminished or suppressed level of synthesis and secretion protein effectors. In a clinical 
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setting a subject may he suspected of suffering from a condition brought on by diminished 
or suppressed levels of a protein effector of interest. Therefore there is a need to be able to 
assay for the level of the protein effector of interest in a biological sample from such a 
subject, and to compare the level with that characteristic of a nonpathological condition. 
5 There further is a need to provide the protein effector as a product of manufacture. 

Administration of the effector to a subject in need thereof is useful in treatment of the 
^ pathological condition, or the protein effector deficiency or suppression may be favorably 
acted upon by the administration of another small molecule drug product. Accordingly, 
there is a need for a method of treatment of a pathological condition brought on by a 

10 diminished or suppressed levels of the protein effector of interest. 

Small molecule targets have been implicated in various disease states or 
pathologies. These targets may be proteins, and particularly enzymatic proteins, which are 
acted upon by small molecule drugs for the purpose of altering target function and 
achieving a desired result. Cellular, animal and clinical studies can be performed to 

15 elucidate the genetic contribution to the etiology and pathogenesis of conditions in which 
small molecule targets are implicated in a variety of physiologic, pharmacologic or native 
states. These studies utilize the core technologies at CuraGen Corporation to look at 
differential gene expression, protein-protein interactions, large-scale sequencing of 
expressed genes and the association of genetic variations such as, but not limited to, single 

20 nucleotide polymorphisms (SNPs) or splice variants in and between biological samples 
from experimental and control groups. The goal of such studies is to identify potential 
avenues for therapeutic intervention in order to prevent, treat the consequences or cure the 
conditions. 

In order to treat diseases, pathologies and other abnormal states or conditions in 
25 which a mammalian organism has been diagnosed as being, or as being at risk for 
becoming, other than in a normal state or condition, it is important to identify new 
therapeutic agents. Such a procedure includes at least the steps of identifying a target 
component within an affected tissue or organ, and identifying a candidate therapeutic 
agent that modulates the functional attributes of the target. The target component may be 
30 any biological macromolecule implicated in the disease or pathology. Commonly the 
target is a polypeptide or protein with specific functional attributes. Other classes of 
macromolecule may be a nucleic acid, a polysaccharide, a lipid such as a complex lipid or 
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a glycolipid; in addition a target may be a sub-cellular structure or extra-cellular structure 
that is comprised of more than one of these classes of macromolecule. Once such a target 
has been identified, it may be employed in a screening assay in order to identify favorable 
candidate therapeutic agents from among a large population of substances or compounds. 
5 In many cases the objective of such screening assays is to identify small molecule 

candidates; this is commonly approached by the use of combinatorial methodologies to 
develop the population of substances to be tested. The implementation of high throughput 
screening methodologies is advantageous when working with large, combinatorial 
libraries of compounds. 

10 SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences 
encoding novel polypeptides. These nucleic acids and polypeptides, as well as derivatives, 
homologs, analogs and fragments thereof, will hereinafter be collectively designated as 
"NOVX" nucleic acid, which represents the nucleotide sequence selected from the group 

15 consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, or 

polypeptide sequences, which represents the group consisting of SEQ ID NO: 2n, wherein 
n is an integer between 1 and 101. 

In one aspect, the invention provides an isolated polypeptide comprising a mature 
form of a NOVX amino acid. One example is a variant of a mature form of a NOVX 

20 amino acid sequence, wherein any amino acid in the mature form is changed to a different 
amino acid, provided that no more than 15% of the amino acid residues in the sequence of 
the mature form are so changed. The amino acid can be, for example, a NOVX amino 
acid sequence or a variant of a NOVX amino acid sequence, wherein any amino acid 
specified in the chosen sequence is changed to a different amino acid, provided that no 

25 more than 15% of the amino acid residues in the sequence are so changed. The invention 
also includes fragments of any of these. In another aspect, the invention also ^includes an 
isolated nucleic acid that encodes a NOVX polypeptide, or a fragment, homolog, analog or 
derivative thereof. 

Also included in the invention is a NOVX polypeptide that is a naturally occurring 
30 allelic variant of a NOVX sequence. In one embodiment, the allelic variant includes an 
amino acid sequence that is the translation of a nucleic acid sequence differing by a single 
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nucleotide from a NOVX nucleic acid sequence. In another embodiment, the NOVX 
polypeptide is a variant polypeptide described therein, wherein any amino acid specified in 
the chosen sequence is changed to provide a conservative substitution. In one 
embodiment, the invention discloses a method for determining the presence or amount of 

5 the NOVX polypeptide in a sample. The method involves the steps of: providing a 
sample; introducing the sample to an antibody that binds immunospecifically to the 
polypeptide; and determining the presence or amount of antibody bound to the NOVX 
polypeptide, thereby determining the presence or amount of the NOVX polypeptide in the 
sample. In another embodiment, the invention provides a method for determining the 

1 0 presence of or predisposition to a disease associated with altered levels of a NOVX 

polypeptide in a mammalian subject. This method involves the steps of: measuring the 
level of expression of the polypeptide in a sample from the first mammalian subject; and 
comparing the amount of the polypeptide in the sample of the first step to the amount of 
the polypeptide present in a control sample from a second mammalian subject known not 

15 to have, or not to be predisposed to, the disease, wherein an alteration in the expression 
level of the polypeptide in the first subject as compared to the control sample indicates the 
presence of or predisposition to the disease. 

In a further embodiment, the invention includes a method of identifying an agent 
that binds to a NOVX polypeptide. This method involves the steps of: introducing the 

20 polypeptide to the agent; and determining whether the agent binds to the polypeptide. In 
various embodiments, the agent is a cellular receptor or a downstream effector. 

In another aspect, the invention provides a method for identifying a potential 
therapeutic agent for use in treatment of a pathology, wherein the pathology is related to 
aberrant expression or aberrant physiological interactions of a NOVX polypeptide. The 

25 method involves the steps of: providing a cell expressing the NOVX polypeptide and 
having a property or function ascribable to the polypeptide; contacting the cell with a 
composition comprising a candidate substance; and determining whether the substance 
alters the property or function ascribable to the polypeptide; whereby, if an alteration 
observed in the presence of the substance is not observed when the cell is contacted with a 

30 composition devoid of the substance, the substance is identified as a potential therapeutic 
agent. In another aspect, the invention describes a method for screening for a modulator 
of activity or of latency or predisposition to a pathology associated with the NOVX 
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polypeptide. This method involves the following steps: administering a test compound to 
a test animal at increased risk for a pathology associated with the NOVX polypeptide* 
wherein the test animal recombinantiy expresses the NOVX polypeptide. This method 
involves the steps of measuring the activity of the NOVX polypeptide in the test animal 
5 after administering the compound of step; and comparing the activity of the protein in the 
test animal with the activity of the NOVX polypeptide in a control animal not 
administered the polypeptide, wherein a change in the activity of the NOVX polypeptide 
in the test animal relative to the control animal indicates the test compound is a modulator 
of latency of, or predisposition to, a pathology associated with the NOVX polypeptide. In 

10 one embodiment, the test animal is a recombinant test animal that expresses a test protein 
transgene or expresses the transgene under the control of a promoter at an increased level 
relative to a wild-type test animal, and wherein the promoter is not the native gene 
promoter of the transgene. In another aspect, the invention includes a method for 
modulating the activity of the NOVX polypeptide, the method comprising introducing a 

1 5 cell sample expressing the NOVX polypeptide with a compound that binds to the 
polypeptide in an amount sufficient to modulate the activity of the polypeptide. 

The invention also includes an isolated nucleic acid that encodes a NOVX 
polypeptide, or a fragment, homolog, analog or derivative thereof. In a preferred 
embodiment, the nucleic acid molecule comprises the nucleotide sequence of a naturally 

20 occurring allelic nucleic acid variant. In another embodiment, the nucleic acid encodes a 
variant polypeptide, wherein the variant polypeptide has the polypeptide sequence of a . 
naturally occurring polypeptide variant. In another embodiment, the nucleic acid molecule 
differs by a single nucleotide from a NOVX nucleic acid sequence. In one embodiment, 
the NOVX nucleic acid molecule hybridizes under stringent conditions to the nucleotide 

25 sequence selected from the group consisting of SEQ ID NO: 2n-l, wherein n is an integer 
between 1 and 1 01, or a complement of the nucleotide sequence. In another aspect, the 
invention provides a vector or a cell expressing a NOVX nucleotide sequence. 

In one embodiment, the invention discloses a method for modulating the activity of 
a NOVX polypeptide. The method includes the steps of: introducing a cell sample 

30 expressing the NOVX polypeptide with a compound that binds to the polypeptide in an 
amount sufficient to modulate the activity of the polypeptide. In another embodiment, the 
invention includes an isolated NOVX nucleic acid molecule comprising a nucleic acid 
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sequence encoding a polypeptide comprising a NOVX amino acid sequence or a variant of 
a mature form of the NOVX amino acid sequence, wherein any amino acid in the mature 
form of the chosen sequence is changed to a different amino acid, provided that no more 
than 15% of the amino acid residues in the sequence of the mature form are so changed. 

5 In another embodiment, the invention includes an amino acid sequence that is a variant of 
the NOVX amino acid sequence, in which any amino acid specified in the chosen 
sequence is changed to a different amino acid, provided that no more than 15% of the 
amino acid residues in the sequence are so changed. 

In one embodiment, the invention discloses a NOVX nucleic acid fragment 

1 0 encoding at least a portion of a NOVX polypeptide or any variant of the polypeptide, 
wherein any amino acid of the chosen sequence is changed to a different amino acid, 
provided that no more than 10% of the amino acid residues in the sequence are so 
changed. In another embodiment, the invention includes the complement of any of the 
NOVX nucleic acid molecules or a naturally occurring allelic nucleic acid variant. In 

1 5 another embodiment, the invention discloses a NOVX nucleic acid molecule that encodes 
a variant polypeptide, wherein the variant polypeptide has the polypeptide sequence of a 
naturally occurring polypeptide variant. In another embodiment, the invention discloses a 
NOVX nucleic acid, wherein the nucleic acid molecule differs by a single nucleotide from 
a NOVX nucleic acid sequence. 

20 In another aspect, the invention includes a NOVX nucleic acid, wherein one or 

more nucleotides in the NOVX nucleotide sequence is changed to a different nucleotide 
provided that no more than 15% of the nucleotides are so changed. In one embodiment, 
the invention discloses a nucleic acid fragment of the NOVX nucleotide sequence and a 
nucleic acid fragment wherein one or more nucleotides in the NOVX nucleotide sequence 

25 is changed from that selected from the group consisting of the chosen sequence to a 

different nucleotide provided that no more than 15% of the nucleotides are so changed. In 
another embodiment, the invention includes a nucleic acid molecule wherein the nucleic 
acid molecule hybridizes under stringent conditions to a NOVX nucleotide sequence or a 
complement of the NOVX nucleotide sequence. In one embodiment, the invention 

30 includes a nucleic acid molecule, wherein the sequence is changed such that no more than 
15% of the nucleotides in the coding sequence differ from the NOVX nucleotide sequence 
or a fragment thereof 
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In a further aspect, the invention includes a method for determining the presence 
or amount of the NOVX nucleic acid in a sample. The method involves the steps of: 
providing the sample; introducing the sample to a probe that binds to the nucleic acid 
molecule; and determining the presence or amount of the probe bound to the NOVX 
5 nucleic acid molecule, thereby determining the presence or amount of the NOVX nucleic 
acid molecule in the sample. In one embodiment, the presence or amount of the nucleic 
acid molecule is used as a marker for cell or tissue type. 

In another aspect, the invention discloses a method for determining the presence of 
or predisposition to a disease associated with altered levels of the NOVX nucleic acid 

10 molecule of in a first mammalian subject. The method involves the steps of: measuring 
the amount of NOVX nucleic acid in a sample from the first mammalian subject; and 
comparing the amount of the nucleic acid in the sample of step (a) to the amount of 
NOVX nucleic acid present in a control sample from a second mammalian subject known 
not to have or not be predisposed to, the disease; wherein an alteration in the level of the 

15 nucleic acid in the first subject as compared to the control sample indicates the presence of 
or predisposition to the disease. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although methods and materials similar or equivalent to those 

20 described herein can be used in the practice or testing of the present invention, suitable 
methods and materials are described below. All publications, patent applications, patents, 
and other references mentioned herein are incorporated by reference in their entirety. In 
the case of conflict, the present specification, including definitions, will control. In 
addition, the materials, methods, and examples are illustrative only and not intended to be 

25 limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides novel nucleotides and polypeptides encoded 
30 thereby. Included in the invention are the novel nucleic acid sequences, their encoded 
polypeptides, antibodies, and other related compounds. The sequences are collectively 
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referred to herein as "NOVX nucleic acids" or "NOVX polynucleotides" and the 
corresponding encoded polypeptides are referred to as "NOVX polypeptides" or "NOVX 
proteins." Unless indicated otherwise, "NOVX" is meant to refer to any of the novel 
sequences disclosed herein. Table 1 provides a summary of the NOVX nucleic acids and 
5 their encoded polypeptides. 



TABLE 1. Sequences and Corresponding SEQ ID Numbers 



NOVX 
Assignment 


Internal 
Identification 


SEQ ID 

NO 
(nucleic 

acid) 


SEQ ID NO 
(polypeptide) 


Homology 


la 


CGI 00 126-01 


1 


2 


KERATIN ASSOCIATED 
PROTEIN 4.9 


2a 


CG100146-01 


3 


4 


UDP-Glucuronosyltransferase 
2B15 Precursor like homo 
sapiens 


3a 


CGI 00 179-01 


5 


6 


cyclophilin A like homo sapiens 


4a 


CGI 002 12-01 


7 


8 


Zinc-containing alcohol 
dehydrogenase like homo 
sapiens 


4b 


CG100212-02 


9 


10 


Zinc-containing alcohol 
dehydrogenase like homo 
sapiens 


5a 


CGI 00222-01 


11 


12 


NADP-Dependent Leukotriene 
B4 12-Hydroxydehydrogenase 
like homo sapiens 


6a 


CGI 00266-0 1 


13 


14 


cyclophilin A like homo sapiens 


6b 


CGI 00266-02 


15 


16 


cyclophilin A like homo sapiens 


7a 


CG100427-01 


17 


18 


cyclophilin A like homo sapiens 


8a 


CG 100456-01 


19 


20 


CoA Transferase like homo 
sapiens 


9a 


CG 100466-01 


21 


22 


Novel Adenine Nucleotide 
Translocator 2 like homo sapiens 


10a 


CGI 00609-01 


23 


24 


GLUTATHIONE S- 
TRANSFERASE THETA Hike 
homo sapiens 


11a 


CG10063I-01 


25 


26 


Clathrin Light Chain A like 
homo sapiens 


lib 


CGI 0063 1-02 


27 


28 


Clathrin Light Chain A like 
homo sapiens 


11c 


CG 10063 1-04 


29 


30 


Clathrin Light Chain A like 
homo sapiens 


12a 


CG100710-01 


31 


32 


AAA ( ATPase Associated with 
various Activities) like homo 
sapiens 


13a 


CG10Q730-O1 


33 


34 


exoribonuclease like homo 
sapiens 


14a 


CG 100819-01 


35 


36 


POLYNUCLEOTIDE 
PHOSPHORYLASE like homo 
sapiens 


15a 


CG100872-01 


37 


38 


PROTEIN-ARGININE 
DEIMINASE TYPE V like 
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homo sapiens 


16a 


CG100980-01 


39 


40 


PROTEIN-ARGININE 
DEIMINASE TYPE III like 
homo sapiens 


17a 


CGI 72805-01 


41 


42 


Peroxisomal Enoyl-CoA 
Hydratase like homo sapiens 


18a 


CG56763-01 


43 


44 


GPCR like homo sapiens 


19a 


CG56777-01 


. 45 


46 


Prostaglandin-F synthase 1 like 
homo sapiens 


19b 


CG56777-02 


47 


48 


Prostaglandin-F synthase I like 
homo sapiens 


20a 


CG56941-01 


49 


50 


Ribonuclease H type II like 
homo sapiens 


21a 


CG57 109-01 


51 


52 


Doublecortin/CAM kinase like 
homo sapiens 


21b 


CG57 109-02 


53 


54 


Doublecortin/CAM kinase like 
homo sapiens (also been filed as 
Cura691A) 


21c 


CG57 109-03 


55 


56 


Doublecortin/CAM kinase like 
homo sapiens 


21d 


CG5 7 109-04 


57 


58 


Doublecortin/CAM kinase like 
homo sapiens 


21e 


CG57 109-05 


59 


60 


Doublecortin/CAM kinase like 
homo sapiens 


2\f 


CG57 109-06 


61 


62 


Double Cortin 


22a 


CG57366-01 


63 


64 


KIAA1223 like homo sapiens 


23a 


CG57368-01 


65 


66 


Adenylate cyclase type IV like 
homo sapiens 


23b 


CG57368-02 


67 


68 




24a 


CG59955-01 


69 


70 


GPCR like homo sapiens 


25a 


CG892 11-01 


71 


72 


GPCR like homo sapiens 


26a 


CG90530-02 


73 


74 


UBIQUITIN-CONJUGATING 
ENZYME ISOLOG like homo 
sapiens 


27a 


CG93076-01 


75 


76 


GPCR like homo sapiens 


28a 


CG94235-01 


77 


78 


Thymidylate Kinase like homo 
sapiens 


28b 


CG94235-02 


79 


80 


Thymidylate Kinase like homo 
sapiens 


29a 


CG94692-01 


81 


82 


CARNITINE/ACYLCARNITIN 
E TRANSLOCASE like homo 
sapiens 


29b 


CG94692-02 


83 


84 


CARNITINE/ACYLCARNITIN 
E TRANSLOCASE like homo 
sapiens 


30a 


CG94724-01 


85 


86 


CARNITINE/ACYLCARNITIN 
E TRANSLOCASE like homo 
sapiens 


31a 


CG94871-01 


87 


88 


Josephin MJD1 like homo 
sapiens 


31b 


CG94871-02 


89 


90 


Josephin MJD1 like homo 
sapiens 


31c 


CG94871-04 


91 


92 


Josephin MJD1 like homo 
sapiens 


31d 


CG94871-05 


93 


94 


Josephin MJD1 like homo 
sapiens 


32a 


CG94946-01 


95 


96 


Agrin Precursor 
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OIL 

32b 


CG94946-02 


97 


98 


Agrin Precursor 


32c 


CG94946-03 


99 


100 


Agrin Pre6ursor 


32d 


CG94946-04 


101 


102 


Agrin Precursor 


32e 


CG94946-05 


103 


104 


Agrin Precursor 


32f 


CG94946-06 


105 


106 


Agrin Precursor 




CG94946-07 


107 


108 


Agrin Precursor 


33a 


CG95 165-01 


109 


1 10 


ADENYLATE CYCLASE, 
TYPE II like homo sapiens 


34a 


CG95 175-01 


1 1 1 


1 12 


Ephrine type A Receptor like 
homo sapiens 


35a 


CG95693-01 


113 


1 14 


' DPY-19 PROTEIN Hike homo 

sapiens 1 


36a 


CG958 14-01 


115 


116 


RHO-GTPASE-ACTIVATING 
PROTEIN 4 like homo sapiens 


37a 


CG95824-01 


117 


118 


Rho GAP like homo sapiens 


38a 


CG96198-01 


119 


120 ' 


Gonadotropin-releasing hormone 
receptor 


38b 


CG96 198-02 


121 


122 


Gonadotropin-releasing hormone 
receptor 


39a 


CG9623I-01 


123 


124 


OTU-iike cysteine protease like 
homo sapiens 


39b 


CG96231-02 


125 


126 


OTU-like cysteine protease like 
homo sapiens 


40a 


CG96260-01 


127 


128 


phospholipase A2 like homo 
sapiens 


41a 


CG96364-01 


129 


130 


ADP/ATP TRANSLOCASE 2 
like homo sapiens 


41b 


CG963 64-03 


131 


1 Ji 


ADP/ATP TRANSLOCASE 2 
like homo sapiens 


42a 


CG96422-01 


133 


l j" 


ADP/ATP translocase 3 like 
homo sapiens 


43a 


CG96442-01 


135 


136 


Acyl CoA-domain protein like, 
homo sapiens 


44a 


CG96501-01 


137 


138 


BRAIN MITOCHONDRIAL 
CARRIER PROTEIN- 1 like 
homo sapiens 


45a 


CG96557-01 


139 


140 


L-ALLO-THREONINE 
ALDOLASE 


46a 


CG96581-0I 


141 


142 


RP42 


46b 


CG96581-02 


143 


144 


RP42 


47a 


CG96624-01 


145 


146 


Putative seven pass 
transmembrane protein 


47b 


CG96624-02 


147 


148 


Putative seven pass 
transmembrane protein 


48a 


CG96747-01 


149 


150 [ 


VOLTAGE-DEPENDENT 
CALCIUM CHANNEL 
GAMMA-3 SUBUNIT 


49a 


CG96789-01 


151 


152 


GLYCINE CLEAVAGE 
SYSTEM PROTEIN H 


jua 


CG97253-01 


153 


154 


Galectin like homo sapiens 


51a 


CG97400-OI 


155 


156 


Myotubularin-related protein 6 
like homo sapiens 


52a 


CG97462-01 


157 


158 


Prohibitin like homo sapiens 


53a 


CG97472-01 


159 


160 


Glucose Transporter like homo 
sapiens 


54a 


CG97528-01 


161 


162 


Guanylate Binding Protein like 
homo sapiens 
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55a 


CG97629-01 


163 


164 


CELL DIVISION PROTEIN 
KINASE 7 like homo sapiens 




CG97648-0 1 


165 


166 


G PROTEIN-COUPLED 
RECEPTOR KINASE GRK7 
like homo sapiens 


57a 


CG97658-01 


167 


168 


PROTEIN-TYROSINE 
PHOSPHATASE, NON- 
RECEPTOR TYPE 5 like homo 
sapiens 


58a 


CG97842-0I 


169 


170 


protein kinase-formin like homo 
sapiens 


59a 


CG98021-0I 


171 


172 


Synaptotagmin III like homo 
sapiens 


60a 


CG98030-01 


173 


174 


Protein-tyrosine phosphatase like 
homo sapiens 


60b 


CG98030-02 


175 


176 




61a 


CG9806I-0! 


177 


178 


Histidine acid phosphatase 
domain like homo sapiens 


61b 


CG98061-02 


179 


180 


Novel Protein containing 
Histidine acid phosphatase 
domain-like Proteins and Nucleic 
Acids Encoding Same 


62a 


CG98071-01 


181 


182 


Histidine acid phosphatase 
domain like homo sapiens 


63a 


CG98131-01 


183 


184 


MDJ6 like homo sapiens 


64a 


CG98164-0I 


185 


186 


LRR and Kinase domain like 
homo sapiens 


64b 


CG98 164-02 


187 


188 


LRR and Kinase domain like 
homo sapiens 


65a 


CG99588-01 


189 


190 


transmembrane protein like 
homo sapiens 


66a 


CG996 18-01 


191 


192 


PROTEIN-TYROSINE 

PWO^PMATACP or* 
r nv/orri fx i Aot jx^ 


67a 


CG99832-01 


193 


194 


gene containing NUDIX 
hydrolase domain like homo 
sapiens 


68a 


CG99842-01 


195 


396 


Tensin 


69a 


CG99944-01 


197 


198 


SUGAR ABC TRANSPORTER 


70a 


CG99963-01 


199 


200 


cyclophilin 18 like homo sapiens 


70b 


CG99963-02 


201 


202 


cyclophilin 18 like homo sapiens 



Table 1 indicates homology of NOVX nucleic acids to known protein families. 
Thus, the nucleic acids and polypeptides, antibodies and related compounds according to 
the invention corresponding to a NOVX as identified in column 1 of Table 1 will be useful 
in therapeutic and diagnostic applications implicated in, for example, pathologies and 
disorders associated with the known protein families identified in column 5 of Table 1. 

NOVX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NOVX nucleic acids and polypeptides according 
to the invention are useful as novel members of the protein families according to the 
presence of domains and sequence relatedness to previously described proteins. 
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Additionally, NOVX nucleic acids and polypeptides can also be used to identify proteins 
that are members of the family to which the NOVX polypeptides belong. 

Consistent with other known members of the family of proteins, identified in 
column 5 of Table 1, the NOVX polypeptides of the present invention show homology to, 
5 and contain domains that are characteristic of, other members of such protein 

families. Details of the sequence relatedness and domain analysis for each NOVX are 
presented in Example A. 

The NOVX nucleic acids and polypeptides can also be used to screen for 
molecules, which inhibit or enhance NOVX activity or function. Specifically, the nucleic 

10 acids and polypeptides according to the invention may be used as targets for the 

identification of small molecules that modulate or inhibit diseases associated with the 
protein families listed in Table 1 . 

The NOVX nucleic acids and polypeptides are also useful for detecting specific 
cell types. Details of the expression analysis for each NOVX are presented in Example C. 

1 5 Accordingly, the NOVX nucleic acids, polypeptides, antibodies and related compounds 
according to the invention will have diagnostic and therapeutic applications in the 
detection of a variety of diseases with differential expression in normal vs. diseased 
tissues, e.g. a variety of cancers. 

Additional utilities for NOVX nucleic acids and polypeptides according to the 

20 invention are disclosed herein. 

The present invention is based on the identification of biological macromolecules 
differentially modulated in a pathologic state, disease, or an abnormal condition or state. 
Among the pathologies or diseases of present interest include metabolic diseases including 
those related to endocrinologic disorders, cancers, various tumors and neoplasias, r 

25 inflammatory disorders, central nervous system disorders, and similar abnormal conditions 
or states. In very significant embodiments of the present invention, the biological 
macromolecules implicated in the pathologies and conditions are proteins and 
polypeptides, and in such cases the present invention is related as well to the nucleic acids 
that encode them. Methods that may be employed to identify relevant biological 

30 macromolecules include any procedures that detect differential expression of nucleic acids 
encoding proteins and polypeptides associated with the disorder, as well as procedures that 
detect the respective proteins and polypeptides themselves. Significant methods that have 
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been employed by the present inventors, include GeneCalling ® technology and 
SeqCalling TM technology, disclosed respectively, in U. S. Patent No. 5,871,697, and in 
U. S. Ser. No. 09/417,386, filed Oct. 13, 1999, each of which is incorporated herein by 
reference in its entirety. GeneCalling ® is also described in Shimkets, et al., "Gene 
5 expression analysis by transcript profiling coupled to a gene database query" Nature 
Biotechnology 17:198-803 (1999). 

The invention provides polypeptides and nucleotides encoded thereby that have 
been identified as having novel associations with a disease or pathology, or an abnormal 
state or condition, in a mammal. The present invention further identifies a set of proteins 

10 and polypeptides, including naturally occurring polypeptides, precursor forms or 

proproteins, or mature forms of the polypeptides or proteins, which are implicated as 
targets for therapeutic agents in the treatment of various diseases, pathologies, abnormal 
states and conditions. A target may be employed in any of a variety of screening 
methodologies in order to identify candidate therapeutic agents which interact with the 

1 5 target and in so doing exert a desired or favorable effect. The candidate therapeutic agent 
is identified by screening a large collection of substances or compounds in an important 
embodiment of the invention. Such a collection may comprise a combinatorial library of 
substances or compounds in which, in at least one subset of substances or compounds, the 
individual members are related to each other by simple structural variations based on a 

20 particular canonical or basic chemical structure. The variations may include, by way of 
nonlimiting example, changes in length or identity of a basic framework of bonded atoms; 
changes in number, composition and disposition of ringed structures, bridge structures, 
alicyclic rings, and aromatic rings; and changes in pendent or substituents atoms or groups 
that are bonded at particular positions to the basic framework of bonded atoms or to the 

25 ringed structures, the bridge structures, the alicyclic structures, or the aromatic structures. 
A polypeptide or protein described herein, and that serves as a target in the 
screening procedure, includes the product of a naturally occurring polypeptide or 
precursor form or proprotein. The naturally occurring polypeptide, precursor or proprotein 
includes, e.g., the full-length gene product, encoded by the corresponding gene. The 

30 naturally occurring polypeptide also includes the polypeptide, precursor or proprotein 

encoded by an open reading frame described herein. A "mature" form of a polypeptide or 
protein arises as a result of one or more naturally occurring processing steps as they may 
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occur within the cell, including a host cell. The processing steps occur as the gene product 
arises, e.g., via cleavage of the amino-terminal methionine residue encoded by the 
initiation codon of an open reading frame, or the proteolytic cleavage of a signal peptide 
or leader sequence. Thus, a mature form arising from a precursor polypeptide or protein 
5 that has residues 1 to N, where residue 1 is the N-terminal methionine, would have 
residues 2 through N remaining. Alternatively, a mature form arising from a precursor 
polypeptide or protein having residues 1 to N, in which an amino-terminal signal sequence 
from residue 1 to residue M is cleaved, includes the residues from residue M+l to residue 
N remaining. A "mature" form of a polypeptide or protein may also arise from non- 
proteolytic post-translational modification. Such non-proteolytic processes include, e.g., 
glycosylation, myristylation or phosphorylation. In general, a mature polypeptide or 
protein may result from the operation of only one of these processes, or the combination of 
any of them. 

As used herein, "identical" residues correspond to those residues in a comparison 
between two sequences where the equivalent nucleotide base or amino acid residue in an 
alignment of two sequences is the same residue. Residues are alternatively described as 
"similar" or "positive" when the comparisons between two sequences in an alignment 
show that residues in an equivalent position in a comparison are either the same amino 
acid or a conserved amino acid as defined below. 

As used herein, a "chemical composition" relates to a composition including at 
least one compound that is either synthesized or extracted from a natural source. A 
chemical compound may be the product of a defined synthetic procedure. Such a 
synthesized compound is understood herein to have defined properties in terms of 
molecular formula, molecular structure relating the association of bonded atoms to each 
other, physical properties such as chromatographic or spectroscopic characterizations, and 
the like. A compound extracted from a natural source is advantageously analyzed by 
chemical and physical methods in order to provide a representation of its defined 
properties, including its molecular formula, molecular structure relating the association of 
bonded atoms to each other, physical properties such as chromatographic or spectroscopic 
characterizations, and the like. 

As used herein, a "candidate therapeutic agent" is a chemical compound that 
includes at least one substance shown to bind to a target biopolymer. In important 
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embodiments of the invention, the target biopolymer is a protein or polypeptide, a nucleic 
acid, a polysaccharide or proteoglycan, or a lipid such as a complex lipid. The method of 
identifying compounds that bind to the target effectively eliminates compounds with little 
or no binding affinity, thereby increasing the potential that the identified chemical 

5 compound may have beneficial therapeutic applications. In cases where the "candidate 
therapeutic agent" is a mixture of more than one chemical compound, subsequent 
screening procedures may be carried out to identify the particular substance in the mixture 
that is the binding compound, and that is to be identified as a candidate therapeutic agent. 
As used herein, a "pharmaceutical agent" is provided by screening a candidate 

1 0 therapeutic agent using models for a disease state or pathology in order to identify a 

candidate exerting a desired or beneficial therapeutic effect with relation to the disease or 
pathology. Such a candidate that successfully provides such an effect is termed a 
pharmaceutical agent herein. Nonlimiting examples of model systems that may be used in 
such screens include particular cell lines, cultured cells, tissue preparations, whole tissues, 

15 organ preparations, intact organs, and nonhuman mammals. Screens employing at least 
one system, and preferably more than one system, may be employed in order to identify a 
pharmaceutical agent. Any pharmaceutical agent so identified may be pursued in further 
investigation using human subjects. 

NOVX Nucleic Acids and Polypeptides 

20 NOVX clones 

NOVX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NOVX nucleic acids and polypeptides according 
to the invention are useful as novel members of the protein families according to the 
presence of domains and sequence reiatedness to previously described proteins. 

25 Additionally, NOVX nucleic acids and polypeptides can also be used to identify proteins 
that are members of the family to which the NOVX polypeptides belong. 

The NOVX genes and their corresponding encoded proteins are useful for 
preventing, treating or ameliorating medical conditions, e.g., by protein or gene therapy. 
Pathological conditions can be diagnosed by determining the amount of the new protein in 

30 a sample or by determining the presence of mutations in the new genes. Specific uses are 
described for each of the NOVX genes, based on the tissues in which they are most highly 
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expressed. Uses include developing products for the diagnosis or treatment of a variety of 
diseases and disorders. 

The NOVX nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications and as a research tool. These include serving as a 
5 specific or selective nucleic acid or protein diagnostic and/or prognostic marker, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a 
small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), (iv) a nucleic acid useful in gene therapy (gene 

10 delivery/gene ablation), and (v) a composition promoting tissue regeneration in vitro and 
in vivo (vi) biological defense weapon. 

In one specific embodiment, the invention includes an isolated polypeptide 
comprising an amino acid sequence selected from the group consisting of: (a) a mature 
form of the amino acid sequence selected from the group consisting of SEQ ID NO: 2n, 

1 5 wherein n is an integer between 1 and 10 1 ; (b) a variant of a mature form of the amino 
acid sequence selected from the group consisting of SEQ ID NO: 2n, wherein n is an 
integer between 1 and 101, wherein any amino acid in the mature form is changed to a 
different amino acid, provided that no more than 15% of the amino acid residues in the 
sequence of the mature form are so changed; (c) an amino acid sequence selected from the 

20 group consisting of SEQ ID NO: 2n, wherein n is an integer between 1 and 101; (d) a 

variant of the amino acid sequence selected from the group consisting of SEQ ID NO:2n, 
wherein n is an integer between 1 and 101 wherein any amino acid specified in the chosen 
sequence is changed to a different amino acid, provided that no more than 15% of the 
amino acid residues in the sequence are so changed; and (e) a fragment of any of (a) 

25 through (d). 

In another specific embodiment, the invention includes an isolated nucleic acid 
molecule comprising a nucleic acid sequence encoding a polypeptide comprising an amino 
acid sequence selected from the group consisting of: (a) a mature form of the amino acid 
sequence given SEQ ID NO: 2n, wherein n is an integer between 1 and 101; (b) a variant 
30 of a mature form of the amino acid sequence selected from the group consisting of SEQ 
ID NO: 2n, wherein n is an integer between 1 and 101 wherein any amino acid in the 
mature form of the chosen sequence is changed to a different amino acid, provided that no 
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more than 15% of the amino acid residues in the sequence of the mature form are so 
changed; (c) the amino acid sequence selected from the group consisting of SEQ ID NO: 
2n, wherein n is an integer between 1 and 101; (d) a variant of the amino acid sequence 
selected from the group consisting of SEQ ID NO: 2n, wherein n is an integer between 1 
5 and 101, in which any amino acid specified in the chosen sequence is changed to a 
different amino acid, provided that no more than 15% of the amino acid residues in the 
sequence are so changed; (e) a nucleic acid fragment encoding at least a portion of a 
polypeptide comprising the amino acid sequence selected from the group consisting of 
SEQ ID NO: 2n, wherein n is an integer between 1 and 101 or any variant of said 

1 0 polypeptide wherein any amino acid of the chosen sequence is changed to a different 
amino acid, provided that no more than 10% of the amino acid residues in the sequence 
are so changed; and (f) the complement of any of said nucleic acid molecules. 

In yet another specific embodiment, the invention includes an isolated nucleic acid 
molecule, wherein said nucleic acid molecule comprises a nucleotide sequence selected 

1 5 from the group consisting of: (a) the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101; (b) a 
nucleotide sequence wherein one or more nucleotides in the nucleotide sequence selected 
from the group consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 
101 is changed from that selected from the group consisting of the chosen sequence to a 

20 different nucleotide provided that no more than 15% of the nucleotides are so changed; (c) 
a nucleic acid fragment of the sequence selected from the group consisting of SEQ ID 
NO: 2n-l , wherein n is an integer between 1 and 101 ; and (d) a nucleic acid fragment 
wherein one or more nucleotides in the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101 is changed 

25 from that selected from the group consisting of the chosen sequence to a different 
nucleotide provided that no more than 15% of the nucleotides are so changed. 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
NOVX polypeptides or biologically active portions thereof Also included in the 
invention are nucleic acid fragments sufficient for use as hybridization probes to identify 

30 NOVX-encoding nucleic acids (e.g., NOVX mRNAs) and fragments for use as PCR 

primers for the amplification and/or mutation of NOVX nucleic acid molecules. As used 
herein, the term "nucleic acid molecule" is intended to include DNA molecules (e.g. , 
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cDNA or genomic DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA 
generated using nucleotide analogs, and derivatives, fragments and homologs thereof. The 
nucleic acid molecule may be single-stranded or double-stranded, but preferably is 
comprised double-stranded DNA. 
5 An NOVX nucleic acid can encode a mature NOVX polypeptide. As used herein, 

a "mature" form of a polypeptide or protein disclosed in the present invention is the 
product of a naturally occurring polypeptide or precursor form or proprotein. The 
naturally occurring polypeptide, precursor or proprotein includes, by way of nonlimiting 
example, the full-length gene product, encoded by the corresponding gene. Alternatively, 

10 it may be defined as the polypeptide, precursor or proprotein encoded by an ORF 
described herein. The product "mature" form arises, again by way of nonlimiting 
example, as a result of one or more naturally occurring processing steps as they may take 
place within the cell, or host cell, in. which the gene product arises. Examples of such 
processing steps leading to a "mature" form of a polypeptide or protein include the 

1 5 cleavage of the N-terminal methionine residue encoded by the initiation codon of an ORF, 
or the proteolytic cleavage of a signal peptide or leader sequence. Thus a mature form 
arising from a precursor polypeptide or protein that has residues 1 to N, where residue 1 is 
the N-terminal methionine, would have residues 2 through N remaining after removal of 
the N-terminal methionine. Alternatively, a mature form arising from a precursor 

20 polypeptide or protein having residues 1 to N, in which an N-terminal signal sequence 
from residue 1 to residue M is cleaved, would have the residues from residue M+l to 
residue N remaining. Further as used herein, a "mature" form of a polypeptide or protein 
may arise from a step of post-translational modification other than a proteolytic cleavage 
event. Such additional processes include, by way of non-limiting example, glycosylation, 

25 myristoylation or phosphorylation. In general, a mature polypeptide or protein may result 
from the operation of only one of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable 
length, preferably between at least about 10 nucleotides (nt), 100 nt, or as many as 
approximately, e.g., 6,000 nt, depending upon the specific use. Probes are used in the 

30 detection of identical, similar, or complementary nucleic acid sequences. Longer length 
probes are generally obtained from a natural oi; recombinant source, are highly specific, 
and much slower to hybridize than shorter-length oligomer probes. Probes may be single- 
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or double-stranded and designed to have specificity in PCR, membrane-based 
hybridization technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is 
separated from other nucleic acid molecules which are present in the natural source of the 
5 nucleic acid. Preferably, an "isolated" nucleic acid is free of sequences which naturally 
flank the nucleic acid (/.e., sequences located at the 5 - and 3 -termini of the nucleic acid) 
in the genomic DNA of the organism from which the nucleic acid is derived. For 
example, in various embodiments, the isolated NOVX nucleic acid molecules can contain 
less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 

10 naturally flank the nucleic acid molecule in genomic DNA of the cell/tissue from which 
the nucleic acid is derived (e.g., brain, heart, liver, spleen, etc.). Moreover, an "isolated" 
nucleic acid molecule, such as a cDNA molecule, can be substantially free of other 
cellular material or culture medium when produced by recombinant techniques, or of 
chemical precursors or other chemicals when chemically synthesized. 

15 A nucleic acid molecule of the invention, e.g. , a nucleic acid molecule having the 

nucleotide sequence SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, or a 
complement of this aforementioned nucleotide sequence, can be isolated using standard 
molecular biology techniques and the sequence information provided herein. Using all or 
a portion of the nucleic acid sequence of SEQ ID NO: 2n-l, wherein n is an integer 

20 between 1 and 101 as a hybridization probe, NOVX molecules can be isolated using 
standard hybridization and cloning techniques (e.g., as described in Sambrook, et al, 
(eds.), Molecular Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989; and Ausubel, et al, (eds.), Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993.) 

25 A nucleic acid of the invention can be amplified using cDNA, mRNA or 

alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can be 
cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to NOVX nucleotide sequences can be 

30 prepared by standard synthetic techniques, e.g. , using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 
residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a 

20 
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PCR reaction. A short oligonucleotide sequence may be based on, or designed from, a 
genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 
Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, 
or 100 nt in length, preferably about 15 nt to 30 nt in length. In one embodiment of the 
invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in length 
would further comprise at least 6 contiguous nucleotides SEQ ID NO: 2n-i, wherein n is 
an integer between 1 and 101, or a complement thereof Oligonucleotides may be 
chemically synthesized and may also be used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule that is a complement of the nucleotide from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, or a portion of 
this nucleotide sequence (e.g., a fragment that can be used as a probe or primer or a 
fragment encoding a biologically-active portion of an NOVX polypeptide). A nucleic acid 
molecule that is complementary to. the nucleotide sequence from the group consisting of 
SEQ ID NO: 2n- 1 , wherein n is an integer between 1 and 10 1 is one that is sufficiently 
complementary to the nucleotide sequence from the group consisting of SEQ ID NO: 2n- 
1, wherein n is an integer between 1 and 101 that it can hydrogen bond with little or no 
mismatches to the nucleotide sequence from the group consisting of SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 101, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen 
base pairing between nucleotides units of a nucleic acid molecule, and the term "binding" 
means the physical or chemical interaction between two polypeptides or compounds or 
associated polypeptides or compounds or combinations thereof. Binding includes ionic, 
non-ionic, van der Waals, hydrophobic interactions, and the like. A physical interaction 
can be either direct or indirect. Indirect interactions may be through or due to the effects 
of another polypeptide or compound. Direct binding refers to interactions that do not take 
place through, or due to, the effect of another polypeptide or compound, but instead are 
without other substantial chemical intermediates. 

Fragments provided herein are defined as sequences of at least 6 (contiguous) 
nucleic acids or at least 4 (contiguous) amino acids, a length sufficient to allow for 
specific hybridization in the case of nucleic acids or for specific recognition of an epitope 
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in the case of amino acids, respectively, and are at most some portion less than a full 
length sequence. Fragments may be derived from any contiguous portion of a nucleic acid 
or amino acid sequence of choice. Derivatives are nucleic acid sequences or amino acid 
sequences formed from the native compounds either directly or by modification or partial 
5 substitution. Analogs are nucleic acid sequences or amino acid sequences that have a 
structure similar to, but not identical to, the native compound but differs from it in respect 
to certain components or side chains. Analogs may be synthetic or from a different 
evolutionary origin and may have a similar or opposite metabolic activity compared to 
wild type. Homologs are nucleic acid sequences or amino acid sequences of a particular 

1 0 gene that are derived from different species. 

A full-length NOVX clone is identified as containing an ATG translation start 
codon and an in-frame stop codon. Any disclosed NOVX nucleotide sequence lacking an 
ATG start codon therefore encodes a truncated C-terminal fragment of the respective 
NOVX polypeptide, and requires that the corresponding full-length cDNA extend in the 5' 

1 5 direction of the disclosed sequence. Any disclosed NOVX nucleotide sequence lacking an 
in-frame stop codon similarly encodes a truncated N- terminal fragment of the respective 
NOVX polypeptide, and requires that the corresponding full-length cDNA extend in the 3* 
direction of the disclosed sequence. 

Derivatives and analogs may be full length or other than full length, if the 

20 derivative or analog contains a modified nucleic acid or amino acid, as described below. 
Derivatives or analogs of the nucleic acids or proteins of the invention include, but are not 
limited to, molecules comprising regions that are substantially homologous to the nucleic 
acids or proteins of the invention, in various embodiments, by at least about 70%, 80%, or 
95% identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid 

25 sequence of identical size or when compared to an aligned sequence in which the 

alignment is done by a computer homology program known in the art, or whose encoding 
nucleic acid is capable of hybridizing to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent conditions. 
See e.g. Ausubel, et aL, Current Protocols in Molecular Biology, John Wiley & 

30 Sons, New York, NY, 1 993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level 
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or amino acid level as discussed above. Homologous nucleotide sequences encode those 
sequences coding for isoforms of NOVX polypeptides. Isoforms can be expressed in 
different tissues of the same organism as a result of, for example, alternative splicing of 
RNA. Alternatively, isoforms can be encoded by different genes. In the invention, 
5 homologous nucleotide sequences include nucleotide sequences encoding for an NOVX 
polypeptide of species other than humans, including, but not limited to: vertebrates, and 
thus can include, e.g., frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. 
Homologous nucleotide sequences also include, but are not limited to, naturally occurring 
allelic variations and mutations of the nucleotide sequences set forth herein. A 

1 0 homologous nucleotide sequence does not, however, include the exact nucleotide 

sequence encoding human NOVX protein. Homologous nucleic acid sequences include 
those nucleic acid sequences that encode conservative amino acid substitutions (see 
below) in SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, as well as a 
polypeptide possessing NOVX biological activity. Various biological activities of the 

1 5 NOVX proteins are described below. 

An NOVX polypeptide is encoded by the open reading frame ("ORF") of an 
NOVX nucleic acid. An ORJF corresponds to a nucleotide sequence that could potentially 
be translated into a polypeptide. A stretch of nucleic acids comprising an ORF is 
uninterrupted by a stop codon. An ORF that represents the coding sequence for a Ml 

20 protein begins with an ATG "start" codon and terminates with one of the three "stop" 

codons, namely, TAA, TAG, or TGA. For the purposes of this invention, an ORF may be 
any part of a coding sequence, with or without a start codon, a stop codon, or both. For an 
ORF to be considered as a good candidate for coding for a bona fide cellular protein, a 
minimum size requirement is often set, e.g., a stretch of DNA that would encode a protein 

25 of 50 amino acids or more. 

The nucleotide sequences determined from the cloning of the human NOVX genes 
allows for the generation of probes and primers designed for use in identifying and/or 
cloning NOVX homologues in other cell types, e.g. from other tissues, as well as NOVX 
homologues from other vertebrates. The probe/primer typically comprises substantially 

30 purified oligonucleotide. The oligonucleotide typically comprises a region of nucleotide 
sequence that hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 
200, 250, 300, 350 or 400 consecutive sense strand nucleotide sequence SEQ ID NO: 2n- 
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1, wherein n is an integer between 1 and 101 ; or an anti-sense strand nucleotide sequence 
of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101. 

Probes based on the human NOVX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 
5 embodiments, the probe further comprises a label group attached thereto, e.g. the label 
group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. 
Such probes can be used as a part of a diagnostic test kit for identifying cells or tissues 
which mis-express an NOVX protein, such as by measuring a level of an NOVX-encoding 
nucleic acid in a sample of cells from a subject e.g., detecting NOVX mRNA levels or 

1 0 determining whether a genomic NOVX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an NOVX polypeptide" 
refers to polypeptides exhibiting activity similar, but not necessarily identical to, an 
activity of a polypeptide of the invention, including mature forms, as measured in a 
particular biological assay, with or without dose dependency. A nucleic acid fragment 

15 encoding a "biologically-active portion of NOVX" can be prepared by isolating a portion 
SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, that encodes a polypeptide 
having an NOVX biological activity (the biological activities of the NOVX proteins are 
described below), expressing the encoded portion of NOVX protein (e.g., by recombinant 
expression in vitro) and assessing the activity of the encoded portion of NOVX. 

20 NOVX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown in SEQ ID NO: 2n-l, wherein n is an integer between 1 and 
101 due to degeneracy of the genetic code and thus encode the same NOVX proteins as 
that encoded by the nucleotide sequences shown in SEQ ID NO: 2n-l, wherein n is an 

25 integer between 1 and 101 . In another embodiment, an isolated nucleic acid molecule of 
the invention has a nucleotide sequence encoding a protein having an amino acid sequence 
shown in SEQ ID NO: 2n, wherein n is an integer between 1 and 101 . 

In addition to the human NOVX nucleotide sequences shown in SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 101, it will be appreciated by those skilled in the art 

30 that DNA sequence polymorphisms that lead to changes in the amino acid sequences of 
the NOVX polypeptides may exist within a population (e.g., the human population). Such 
genetic polymorphism in the NOVX genes may exist among individuals within a 
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population due to natural allelic variation. As used herein, the terms "gene" and 
"recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
(ORF) encoding an NOVX protein, preferably a vertebrate NO VX protein. Such natural 
allelic variations can typically result in 1-5% variance in the nucleotide sequence of the 
5 NOVX genes. Any and all such nucleotide variations and resulting amino acid 

polymorphisms in the NOVX polypeptides, which are the result of natural allelic variation 
and that do not alter the functional activity of the NOVX polypeptides, are intended to be 
within the scope of the invention. 

Moreover, nucleic acid molecules encoding NOVX proteins from other species, 

10 and thus that have a nucleotide sequence that differs from the human SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 101 are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and 
homologues of the NOVX cDNAs of the invention can be isolated based on their 
homology to the human NOVX nucleic acids disclosed herein using the human cDNAs, or 

15 a portion thereof, as a hybridization probe according to standard hybridization techniques 
under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 2n-l, wherein n 

20 is an integer between 1 and 101. In another embodiment, the nucleic acid is at least 10, 
25, 50, 100, 250, 500, 750, 1000, 1500, or 2000 or more nucleotides in length. In yet 
another embodiment, an isolated nucleic acid molecule of the invention hybridizes to the 
coding region. As used herein, the term "hybridizes under stringent conditions" is 
intended to describe conditions for hybridization and washing under which nucleotide 

25 sequences at least 60% homologous to each other typically remain hybridized to each 
other. 

Homologs (i.e., nucleic acids encoding NOVX proteins derived from species other 
than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate 
or high stringency hybridization with all or a portion of the particular human sequence as a 
30 probe using methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 
under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to 
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no other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures 
than shorter sequences. Generally, stringent conditions are selected to be about 5 °C lower 
than the thermal melting point (Tm) for the specific sequence at a defined ionic strength 
5 and pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid 
concentration) at which 50% of the probes complementary to the target sequence 
hybridize to the target sequence at equilibrium. Since the target sequences are generally 
present at excess, at Tm, 50% of the probes are occupied at equilibrium. Typically, 
stringent conditions will be those in which the salt concentration is less than about 1 .0 M 

1 0 sodium ion, typically about 0.01 to 1 .0 M sodium ion (or other salts) at 

pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides {e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers 
and oligonucleotides. Stringent conditions may also be achieved with the addition of 
destabilizing agents, such as formamide. 

1 5 Stringent conditions are known to those skilled in the art and can be found in 

Ausubel, et aL, (eds.), Current Protocols in Molecular Biology, John Wiley & 
Sons, N.Y. (1989). 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least 
about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically 
remain hybridized to each other. A non-limiting example of stringent hybridization 

20 conditions are hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl 
(pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured 
salmon sperm DNA at 65°C, followed by one or more washes in 0.2X SSC, 0.01% BSA at 
50°C. An isolated nucleic acid molecule of the invention that hybridizes under stringent 
conditions to the sequences SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, 

25 corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 

"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 
nucleotide sequence that occurs in nature {e.g., encodes a natural protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic 
acid molecule comprising the nucleotide sequence of SEQ ID NO: 2n-l, wherein n is an 

30 integer between 1 and 1 0 1 , or fragments, analogs or derivatives thereof, under conditions 
of moderate stringency is provided. A non-limiting example of moderate stringency 
hybridization conditions are hybridization in 6X SSC, 5X Denhardt's solution, 0.5% SDS 



WO 03/010327 



PCT/US02/14199 



and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more washes in 
IX SSC, 0A% SDS at 37°C. Other conditions of moderate stringency that may be used 
are well-known within the art. See, e.g., Ausubel, et al. (eds.), 1993, CURRENT PROTOCOLS 
in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990; Gene Transfer 
5 and Expression, A Laboratory Manual, Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid 
molecule comprising the nucleotide sequences SEQ ID NO: 2n-l, wherein n is an integer 
between 1 and 101, or fragments, analogs or derivatives thereof, under conditions of low 
stringency, is provided, A non-limiting example of low stringency hybridization 

10 conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm 
DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 
25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low 
stringency that may be used are well known in the art (e.g., as employed for cross-species 

15 hybridizations). See, e.g., Ausubel, et al (eds.), 1993, Current Protocols in 

Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY; Shilo and Weinberg, 1981. 
Proc Natl Acad Sci USA 78: 6789-6792. 

Conservative Mutations 

20 In addition to naturally-occurring allelic variants of NOVX sequences that may 

exist in the population, the skilled artisan will further appreciate that changes can be 
introduced by mutation into the nucleotide sequences SEQ ID NO: 2n-l, wherein n is an 
integer between 1 and 101, thereby leading to changes in the amino acid sequences of the 
encoded NOVX proteins, without altering the functional ability of said NOVX proteins. 

25 For example, nucleotide substitutions leading to amino acid substitutions at 

"non-essential" amino acid residues can be made in the sequence SEQ ID NO: 2n, wherein 
n is an integer between 1 and 101 . A "non-essential" amino acid residue is a residue that 
can be altered from the wild-type sequences of the NOVX proteins without altering their 
biological activity, whereas an "essential" amino acid residue is required for such 

30 biological activity. For example, amino acid residues that are conserved among the 
NOVX proteins of the invention are predicted to be particularly non-amenable to 
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alteration. Amino acids for which conservative substitutions can be made are well-known 
within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding 
NOVX proteins that contain changes in amino acid residues that are not essential for 
5 activity. Such NOVX proteins differ in amino acid sequence from SEQ ID NO: 2n-l , 
wherein n is an integer between 1 and 101 yet retain biological activity. In one 
embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding 
a protein, wherein the protein comprises an amino acid sequence at least about 45% 
homologous to the amino acid sequences SEQ ID NO: 2n, wherein n is an integer between 

10 1 and 101. Preferably, the protein encoded by the nucleic acid molecule is at least about 
60% homologous to SEQ ID NO: 2n, wherein n is an integer between 1 and 101; more 
preferably at least about 70% homologous SEQ ID NO: 2n, wherein n is an integer 
between 1 and 101; still more preferably at least about 80% homologous to SEQ ID NO: 
2n, wherein n is an integer between 1 and 101; even more preferably at least about 90% 

1 5 homologous to SEQ ID NO: 2n, wherein n is an integer between 1 and 101; and most 
preferably at least about 95% homologous to SEQ ID NO: 2n, wherein n is an integer 
between 1 and 101. 

An isolated nucleic acid molecule encoding an NOVX protein homologous to the 
protein of SEQ ID NO: 2n, wherein n is an integer between 1 and 101 can be created by 
20 introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 
sequence of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, such that one 
or more amino acid substitutions, additions or deletions are introduced into the encoded 
protein. 

Mutations can be introduced into SEQ ID NO: 2n-l, wherein n is an integer 
25 between 1 and 101 standard techniques, such as site-directed mutagenesis and 

PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at 
one or more predicted, non-essential amino acid residues. A "conservative amino acid 
substitution" is one in which the amino acid residue is replaced with an amino acid residue 
having a similar side chain. Families of amino acid residues having similar side chains 
30 have been defined within the art. These families include amino acids with basic side 
chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic 
acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, 
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tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, 
isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). 
Thus, a predicted non-essential amino acid residue in the NOVX protein is replaced with 
5 another amino acid residue from the same side chain family. Alternatively, in another 
embodiment, mutations can be introduced randomly along all or part of an NOVX coding 
sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for 
NOVX biological activity to identify mutants that retain activity. Following mutagenesis 
" SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, the encoded protein can be 
10 expressed by any recombinant technology known in the art and the activity of the protein 
can be determined. 

The relatedness of amino acid families may also be determined based on side chain 
interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
conserved "weak" residues. The "strong" group of conserved amino acid residues may be 

15 any one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, 
HY, FYW, wherein the single letter amino acid codes are grouped by those amino acids 
that may be substituted for each other. Likewise, the "weak" group of conserved residues 
may be any one of the following: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, 
NDEQHK, NEQHRK, HF Y, wherein the letters within each group represent the single 

20 letter amino acid code. 

In one embodiment, a mutant NOVX protein can be assayed for (/) the ability to 
form proteimprotein interactions with other NOVX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (//) complex formation between a mutant NOVX 
protein and an NOVX ligand; or (///) the ability of a mutant NOVX protein to bind to an 

25 intracellular target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant NOVX protein can be assayed for the ability 
to regulate a specific biological function (e.g., regulation of insulin release). 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid 
30 molecules that are hybridizable to or complementary to the nucleic acid molecule 

comprising the nucleotide sequence of SEQ ID NO: 2n-l, wherein n is an integer between 
1 and 101, or fragments, analogs or derivatives thereof. An "antisense" nucleic acid 
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comprises a nucleotide sequence that is complementary to a "sense" nucleic acid encoding 
a protein (e.g., complementary to the coding strand of a double-stranded cDNA molecule 
or complementary to an mRNA sequence). In specific aspects, antisense nucleic acid 
molecules are provided that comprise a sequence complementary to at least about 10, 25, 
50, 100, 250 or 500 nucleotides or an entire NOVX coding strand, or to only a portion 
thereof. Nucleic acid molecules encoding fragments, homologs, derivatives and analogs 
of an NOVX protein of SEQ ID NO: 2n, wherein n is an integer between 1 and 101, or 
antisense nucleic acids complementary to an NOVX nucleic acid sequence of SEQ ID NO: 
2n-l, wherein n is an integer between 1 and 1 01, are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an NOVX protein. The 
term "coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues. In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a 
nucleotide sequence encoding the NOVX protein. The term "noncoding region" refers to 
5' and 3' sequences which flank the coding region that are not translated into amino acids 
(i.e. , also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding the NOVX protein disclosed herein, 
antisense nucleic acids of the invention can be designed according to the rules of Watson 
and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 
complementary to the entire coding region of NOVX mRNA, but more preferably is an 
oligonucleotide that is antisense to only a portion of the coding or noncoding region of 
NOVX mRNA. For example, the antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of NOVX mRNA. An antisense 
oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 
nucleotides in length. An antisense nucleic acid of the invention can be constructed using 
chemical synthesis or enzymatic ligation reactions using procedures known in the art. For 
example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically 
synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and sense nucleic acids (e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used). 
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Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracii, 
5 dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1 - methyl guanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

2- methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminornethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 

10 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 

pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 

15 using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered to 
20 a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding an NOVX protein to thereby inhibit expression of the 
protein (e.g., by inhibiting transcription and/or translation). The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific 
25 interactions in the major groove of the double helix. An example of a route of 

administration of antisense nucleic acid molecules of the invention includes direct 
injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 
administration, antisense molecules can be modified such that they specifically bind to 
30 receptors or antigens expressed on a selected cell surface (e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 
antigens). The antisense nucleic acid molecules can also be delivered to cells using the 
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vectors described herein. To achieve sufficient nucleic acid molecules, vector constructs 
in which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is 
5 an a-anomeric nucleic acid molecule. An tf-anomeric nucleic acid molecule forms 

specific double-stranded hybrids with complementary RNA in which, contrary to the usual 
p-units, the strands run parallel to each other. See, e.g., Gaultier, et al, 1987. Nucl Acids 
Res. 15: 6625-6641 . The antisense nucleic acid molecule can also comprise a 
2-o-methylribonucleotide (See, e.g., Inoue, et al 1987. Nucl Acids Res. 15: 6131-6148) 
10 or a chimeric RNA-DNA analogue (See, e.g., Inoue, et al, 1987. FEES Lett. 215: 
327-330. 

Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified 
bases, and nucleic acids whose sugar phosphate backbones are modified or derivatized. 

15 These modifications are carried out at least in part to enhance the chemical stability of the 
modified nucleic acid, such that they may be used, for example, as antisense binding 
nucleic acids in therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 

20 • cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 

complementary region. Thus, ribozymes (e.g., hammerhead ribozymes as described in 
Haselhoff and Gerlach 1988. Nature 334: 585-591) can be used to catalytically cleave 
NOVX mRNA transcripts to thereby inhibit translation of NOVX mRNA. A ribozyme 
having specificity for an NOVX-encoding nucleic acid can be designed based upon the 

25 nucleotide sequence of an NOVX cDNA disclosed herein (/.<?., SEQ ID NO: 2n-l, wherein 
n is an integer between 1 and 101). For example, a derivative of a Tetrahymena L-19 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an NOVX-encoding mRNA. 
See, e.g., U.S. Patent 4,987,071 to Cech, et al. and U.S. Patent 5,1 16,742 to Cech, et al. 

30 NOVX mRNA can also be used to select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel et al., (1993) Science 
261:1411-1418. 
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Alternatively, NOVX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NOVX nucleic acid (e.g., the 
NOVX promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the NOVX gene in target cells. See, e.g., Helene, 1991. Anticancer Drug 
5 Des. 6: 569-84; Helene, et ah 1992. Ann. N.Y. Acad ScL 660: 27-36; Maher, 1992. 
Bioassays 14: 807-15. 

In various embodiments, the NOVX nucleic acids can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, 
or solubility of the molecule. For example, the deoxyribose phosphate backbone of the 
10 nucleic acids can be modified to generate peptide nucleic acids. See, e.g., Hyrup, '£/ al, 
1996. Bioorg Med Chem 4: 5-23. As used herein, the terms "peptide nucleic acids" or 
"PNAs" refer to nucleic acid mimics {e.g., DNA mimics) in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs has been shown to allow for 
1 5 specific hybridization to DNA and RNA under conditions of low ionic strength. The 

synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis 
protocols as described in Hyrup, et al, 1996. supra; Perry-O'Keefe, et a/., 1996. Proc. 
Natl Acad. Sci. USA 93: 14670-14675. 

PNAs of NOVX can be used in therapeutic and diagnostic applications. For 
20 example, PNAs can be used as antisense or antigene agents for sequence-specific 

modulation of gene expression by, e.g., inducing transcription or translation arrest or 
inhibiting replication. PNAs of NOVX can also be used, for example, in the analysis of 
single base pair mutations in a gene (e.g., PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other enzymes, e.g., S\ nucleases (See, 
25 Hyrup, et aL 9 1 996.supra); or as probes or primers for DNA sequence and hybridization 
(See, Hyrup, et aL, 1996, supra; Perry-O'Keefe, et al, 1996. supra). 

In another embodiment, PNAs of NOVX can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
30 delivery known in the art. For example, PN A-DN A-chimeras of NOVX can be generated 
that may combine the advantageous properties of PNA and DNA. Such chimeras allow 
DNA recognition enzymes (e.g., RNase H and DNA polymerases) to interact with the 
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DNA portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of 
base stacking, number of bonds between the nucleobases, and orientation {see, Hyrup, et 
al., 1996. supra). The synthesis of PNA-DNA chimeras can be performed as described in 
5 Hyrup, etal, 1996. supra and Finn, etal, 1996. Nucl Acids Res 24: 3357-3363. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5 , -(4-methoxytrityl)amino-5 , -deoxy-thymidine phosphoramidite, can be used between the 
PNA and the 5' end of DNA. See, e.g., Mag, et all, 1989. Nucl Acid Res 17: 5973-5988. 

1 0 PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule 
with a 5' PNA segment and a 3' DNA segment. See, e.g., Finn, et al, 1996. supra. 
Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' 
PNA segment. See. e.g., Petersen, et al, 1975. Bioorg. Med. Chem. Lett. 5: 1119-11124. 
In other embodiments, the oligonucleotide may include other appended groups 

1 5 such as peptides (e.g. , for targeting host cell receptors in vivo), or agents facilitating 

transport across the cell membrane (see, e.g., Letsinger, et al, 1989. Proc. Natl Acad. Sci. 
U.S.A. 86: 6553-6556; Lemaitre, et al, 1987. Proc. Natl Acad. Sci. 84: 648-652; PCT 
Publication No. WO88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 
WO 89/10134). In addition, oligonucleotides can be modified with hybridization triggered 

20 cleavage agents (see, e.g., Krol, et a!., 1988. BioTechniques 6:958-976) or intercalating 
agents (see, e.g., Zon, 1988. Pharm. Res. 5: 539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, e.g., a peptide, a hybridization triggered 
cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, and the 
like. 

25 NOVX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the 
amino acid sequence of NOVX polypeptides whose sequences are provided in SEQ ID 
NO: 2n, wherein n is an integer between 1 and 1 01 . The invention also includes a mutant 
or variant protein any of whose residues may be changed from the corresponding residues 
30 shown in SEQ ID NO: 2n, wherein n is an integer between 1 and 101 while still encoding 
a protein that maintains its NOVX activities and physiological functions, or a functional 
fragment thereof. 
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In general, an NOVX variant that preserves NOVX-like function includes any 
variant in which residues at a particular position in the sequence have been substituted by 
other amino acids, and further include the possibility of inserting an additional residue or 
residues between two residues of the parent protein as well as the possibility of deleting 
5 one or more residues from the parent sequence. Any amino acid substitution, insertion, or 
deletion is encompassed by the invention. In favorable circumstances, the substitution is a 
conservative substitution as defined above. 

One aspect of the invention pertains to isolated NOVX proteins, and biologically- 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 

10 provided are polypeptide fragments suitable for use as immunogens to raise anti-NOVX 
antibodies. In one embodiment, native NOVX proteins can be isolated from cells or tissue 
sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NOVX proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, an NOVX protein or polypeptide can 

15 be synthesized chemically using standard peptide synthesis techniques. 

An "isolated 11 or "purified" polypeptide or protein or biologically-active portion 
thereof is substantially free of cellular material or other contaminating proteins from the 
cell or tissue source from which the NOVX protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. The language 

20 "substantially free of cellular material" includes preparations of NOVX proteins in which 
the protein is separated from cellular components of the cells from which it is isolated or 
recombinantly-produced. In one embodiment, the language "substantially free of cellular 
material" includes preparations of NOVX proteins having less than about 30% (by dry 
weight) of non-NOVX proteins (also referred to herein as a "contaminating protein"), 

25 more preferably less than about 20% of non-NOVX proteins, still more preferably less 
than about 10% of non-NOVX proteins, and most preferably less than about 5% of 
non-NOVX proteins. When the NOVX protein or biologically-active portion thereof is 
recombinantly-produced, it is also preferably substantially free of culture medium, z.e., 
culture medium represents less than about 20%, more preferably less than about 10%, and 

30 most preferably less than about 5% of the volume of the NOVX protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NOVX proteins in which the protein is separated from chemical 
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precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other chemicals" 
includes preparations of NOVX proteins having less than about 30% (by dry weight) of 
chemical precursors or non-NOVX chemicals, more preferably less than about 20% 
5 chemical precursors or non-NOVX chemicals, still more preferably less than about 10% 
chemical precursors or non-NOVX chemicals, and most preferably less than about 5% 
chemical precursors or non-NOVX chemicals. 

Biologically-active portions of NOVX proteins include peptides comprising amino 
acid sequences sufficiently homologous to or derived from the amino acid sequences of 

10 the NOVX proteins (e.g., the amino acid sequence shown in SEQ ID NO: 2n, wherein n is 
an integer between 1 and 101) that include fewer amino acids than the full-length NOVX 
proteins, and exhibit at least one activity of an NOVX protein. Typically, biologically- 
active portions comprise a domain or motif with at least one activity of the NOVX protein. 
A biologically-active portion of an NOVX protein can be a polypeptide which is, for 

1 5 example, 1 0, 25, 50, 1 00 or more amino acid residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
functional activities of a native NOVX protein. 

In an embodiment, the NOVX protein has an amino acid sequence shown SEQ ID 

20 NO: 2n, wherein n is an integer between 1 and 101. In other embodiments, the NOVX 
protein is substantially homologous to SEQ ID NO: 2n, wherein n is an integer between 1 
and 101, and retains the functional activity of the protein of SEQ ID NO: 2n, wherein n is 
an integer between 1 and 101, yet differs in amino acid sequence due to natural allelic 
variation or mutagenesis, as described in detail, below. Accordingly, in another 

25 embodiment, the NOVX protein is a protein that comprises an amino acid sequence at 

least about 45% homologous to the amino acid sequence SEQ ID NO: 2n, wherein n is an 
integer between 1 and 101, and retains the functional activity of the NOVX proteins of 
SEQ ID NO: 2n, wherein n is an integer between 1 and 101. 

Determining Homology Between Two or More Sequences 

30 To determine the percent homology of two amino acid sequences or of two nucleic 

acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
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alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
5 molecules are homologous at that position (/.e. , as used herein amino acid or nucleic acid 
"homology" is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity between 
two sequences. The homology may be determined using computer programs known in the 
art, such as GAP software provided in the GCG program package. See, Needleman and 

10 Wunsch, 1970. JMol Biol 48: 443-453. Using GCG GAP software with the following 
settings for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP 
extension penalty of 0.3, the coding region of the analogous nucleic acid sequences 
referred to above exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 
90%, 95%, 98%, or 99%, with the CDS (encoding) part of the DNA from the group 

1 5 consisting of SEQ ID NO: 2n- 1 , wherein n is an integer between 1 and 1 0 1 . 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region 
of comparison. The term "percentage of sequence identity" is calculated by comparing 
two optimally aligned sequences over that region of comparison, determining the number 

20 of positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of 
nucleic acids) occurs in both sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of positions in the region of 
comparison (i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes a 

25 characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 
sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 

Chimeric and Fusion Proteins 

30 The invention also provides NOVX chimeric or fusion proteins. As used herein, 

an NOVX "chimeric protein" or "fusion protein" comprises an NOVX polypeptide 
operatively-linked to a non-NOVX polypeptide. An "NOVX polypeptide" refers to a 
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polypeptide having an amino acid sequence corresponding to an NOVX protein SEQ ID 
NO: 2n, wherein n is an integer between 1 and 101, whereas a "non-NOVX polypeptide" 
refers to a polypeptide having an amino acid sequence corresponding to a protein that is 
not substantially homologous to the NOVX protein, e.g., a protein that is different from 
the NOVX protein and that is derived from the same or a different organism. Within an 
NOVX fusion protein the NOVX polypeptide can correspond to all or a portion of an 
NOVX protein. In one embodiment, an NOVX fusion protein comprises at least one 
biologically-active portion of an NOVX protein. In another embodiment, an NOVX 
fusion protein comprises at least two biologically-active portions of an NOVX protein. In 
yet another embodiment, an NOVX fusion protein comprises at least three biologically- 
active portions of an NOVX protein. Within the fusion protein, the term "operatively- 
linked" is intended to indicate that the NOVX polypeptide and the non-NOVX polypeptide 
are fused in-frame with one another. The non-NOVX polypeptide can be fused to the 
N-terminus or C-terminus of the NOVX polypeptide. 

In one embodiment, the fusion protein is a GST-NOVX fusion protein in which the 
NOVX sequences are fused to the C-terminus of the GST (glutathione S-transferase) 
sequences. Such fusion proteins can facilitate the purification of recombinant NOVX 
polypeptides. 

In another embodiment, the fusion protein is an NOVX protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells {e.g., mammalian host 
cells), expression and/or secretion of NOVX can be increased through use of a 
heterologous signal sequence. 

In yet another embodiment, the fusion protein is an NOVX-immunoglobulin fusion 
protein in which the NOVX sequences are fused to sequences derived from a member of 
the immunoglobulin protein family. The NOVX-immunoglobulin fusion proteins of the 
invention can be incorporated into pharmaceutical compositions and administered to a 
subject to inhibit an interaction between an NOVX ligand and an NOVX protein on the 
surface of a cell, to thereby suppress NOVX-mediated signal transduction in vivo. The 
NOVX-immunoglobulin fusion proteins can be used to affect the bioavailability of an 
NOVX cognate ligand. Inhibition of the NOVX ligand/NOVX interaction may be useful 
therapeutically for both the treatment of proliferative and differentiate disorders, as well 
as modulating (e.g. promoting or inhibiting) cell survival. Moreover, the 
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NOVX-immunoglobulin fusion proteins of the invention can be used as immunogens to 
produce anti-NOVX antibodies in a subject, to purify NOVX ligands, and in screening 
assays to identify molecules that inhibit the interaction of NOVX with an NOVX ligand. 

An NOVX chimeric or fusion protein of the invention can be produced by standard 
5 recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, 
restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends 
as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 

10 ligation. In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification of 
gene fragments can be carried out using anchor primers that give rise to complementary 
overhangs between two consecutive gene fragments that can subsequently be annealed and 
reamplified to generate a chimeric gene sequence (see, e.g., Ausubel, et al. (eds.) 

1 5 Current Protocols in Molecular Biology, John Wiley & Sons, 1 992). Moreover, 
many expression vectors are commercially available that already encode a fusion moiety 
(e.g., a GST polypeptide). An NOVX-encoding nucleic acid can be cloned into such an 
expression vector such that the fusion moiety is linked in-frame to the NOVX protein. 

20 NOVX Agonists and Antagonists 

The invention also pertains to variants of the NOVX proteins that function as either 
NOVX agonists (z.e., mimetics) or as NOVX antagonists. Variants of the NOVX protein 
can be generated by mutagenesis (e.g., discrete point mutation or truncation of the NOVX 

25 protein). An agonist of the NOVX protein can retain substantially the same, or a subset of, 
the biological activities of the naturally occurring form of the NOVX protein. An 
antagonist of the NOVX protein can inhibit one or more of the activities of the naturally 
occurring form of the NOVX protein by, for example, competitively binding to a 
downstream or upstream member of a cellular signaling cascade which includes the 

30 NOVX protein. Thus, specific biological effects can be elicited by treatment with a 
variant of limited function. In one embodiment, treatment of a subject with a variant 
having a subset of the biological activities of the naturally occurring form of the protein 
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has fewer side effects in a subject relative to treatment with the naturally occurring form of 
the NOVX proteins. 

Variants of the NOVX proteins that function as either NOVX agonists (i.e., 
mimetics) or as NOVX antagonists can be identified by screening combinatorial libraries 
5 of mutants (e.g., truncation mutants) of the NOVX proteins for NOVX protein agonist or 
antagonist activity. In one embodiment, a variegated library of NOVX variants is 
generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a 
variegated gene library. A variegated library of NOVX variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic oligonucleotides into gene 

10 sequences such that a degenerate set of potential NOVX sequences is expressible as 

individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage 
display) containing the set of NOVX sequences therein. There are a variety of methods 
which can be used to produce libraries of potential NOVX variants from a degenerate 
oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 

15 performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
appropriate expression vector. Use of a degenerate set of genes allows for the provision, 
in one mixture, of all of the sequences encoding the desired set of potential NOVX 
sequences. Methods for synthesizing degenerate oligonucleotides are well-known within 
the art. See, e.g., Narang, 1983. Tetrahedron 39: 3; Itakura, et ah, 1984. Annu. Rev. 

20 Biochem. 53: 323; Itakura, et aU 1984. Science 198: 1056; Ike, et aU 1983. Nucl Acids 
Res. 11:477. 

Polypeptide Libraries 

25 In addition, libraries of fragments of the NOVX protein coding sequences can be 

used to generate a variegated population of NOVX fragments for screening and 
subsequent selection of variants of an NOVX protein. In one embodiment, a library of 
coding sequence fragments can be generated by treating a double stranded PCR fragment 
of an NOVX coding sequence with a nuclease under conditions wherein nicking occurs 

30 only about once per molecule, denaturing the double stranded DNA, renaturing the DNA 
to form double-stranded DNA that can include sense/antisense pairs from different nicked 
products, removing single stranded portions from reformed duplexes by treatment with Si 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
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method, expression libraries can be derived which encodes N-terminal and internal 
fragments of various sizes of the NOVX proteins. 

Various techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
5 libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
NOVX proteins. The most widely used techniques, which are amenable to high 
throughput analysis, for screening large gene libraries typically include cloning the gene 
library into replicable expression vectors, transforming appropriate cells with the resulting 

1 0 library of vectors, and expressing the combinatorial genes under conditions in which 

detection of a desired activity facilitates isolation of the vector encoding the gene whose 
product was detected. Recursive ensemble mutagenesis (REM), a new techniique that 
enhances the frequency of functional mutants in the libraries, can be used in combination 
with the screening assays to identify NOVX variants. See, e.g., Arkin and Yourvan, 1992. 

15 Proc. Natl Acad. ScL USA 89: 781 1-7815; Delgrave, et al, 1993. Protein Engineering 
6:327-331. 

NOVX Antibodies 

The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 

20 contain an antigen binding site that specifically binds (immunoreacts with) an antigen. 
Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single 
chain, F a b, F a b* and F( a b-)2 fragments, and an F a b expression library. In general, antibody 
molecules obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, 
which differ from one another by the nature of the heavy chain present in the molecule. 

25 -Certain classes have subclasses as well, such as IgGj, IgG2, and others. Furthermore, in * 
humans, the light chain may be a kappa chain or a lambda chain. Reference herein to 
antibodies includes a reference to all such classes, subclasses and types of human antibody 
species. 

An isolated protein of the invention intended to serve as an antigen, or a portion or 
30 fragment thereof, can be used as an immunogen to generate antibodies that 

immunospecifically bind the antigen, using standard techniques for polyclonal and 
monoclonal antibody preparation. The full-length protein can be used or, alternatively; the 
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invention provides antigenic peptide fragments of the antigen for use as immunogens. An 
antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid 
sequence of the full length protein, such as an amino acid sequence shown in SEQ ID NO: 
2n, wherein n is an integer between 1 and 101 , and encompasses an epitope thereof such 
5 that an antibody raised against the peptide forms a specific immune complex with the full 
length protein or with any fragment that contains the epitope. Preferably, the antigenic 
peptide comprises at least 10 amino acid residues, or at least 15 amino acid residues, or at 
least 20 amino acid residues, or at least 30 amino acid residues. Preferred epitopes 
encompassed by the antigenic peptide are regions of the protein that are located on its 
10 surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of NOVX that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human NOVX protein sequence will 
indicate which regions of a NOVX polypeptide are particularly hydrophilic and, therefore, 

15 are likely to encode surface residues useful for targeting antibody production. As a means 
for targeting antibody production, hydropathy plots showing regions of hydrophilicity and 
hydrophobicity may be generated by any method well known in the art, including, for 
example, the Kyte Doolittle or the Hopp Woods methods, either with or without Fourier 
transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat. Acad Set USA 78: 3824- 

20 3828; Kyte and Doolittle 1982, J. Mol Biol 157: 105-142, each incorporated herein by 
reference in their entirety. Antibodies that are specific for one or more domains within an 
antigenic protein, or derivatives, fragments, analogs or homologs thereof, are also 
provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog , 
25 thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of 
polyclonal or monoclonal antibodies directed against a protein of the invention, or against 
derivatives, fragments, analogs homologs or orthologs thereof (see, for example, 
30 Antibodies: A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, incorporated herein by reference). Some of 
these antibodies are discussed below. 
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Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by one or more injections with 
5 the native protein, a synthetic variant thereof, or a derivative of the foregoing. An 

appropriate immunogenic preparation can contain, for example, the naturally occurring 
immunogenic protein, a chemically synthesized polypeptide representing the 
immunogenic protein, or a recombinantly expressed immunogenic protein. Furthermore, 
the protein may be conjugated to a second protein known to be immunogenic in the 

10 mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. The preparation can further include an adjuvant. Various adjuvants used 
to increase the immunological response include, but are not limited to, Freund's (complete 
and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances (e.g., 

15 lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), 
adjuvants usable in humans such as Bacille Calmette-Guerin and Corynebacterium 
parvum, or similar immunostimulatory agents. Additional examples of adjuvants which 
can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). 

20 The polyclonal antibody molecules directed against the immunogenic protein can 

be isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the. specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 

25 immobilized on a column to purify the immune specific antibody by immunoaffinity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. 
Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 
(April 17, 2000), pp. 25-28). 

Monoclonal Antibodies 

30 The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", 

as used herein, refers to a population of antibody molecules that contain only one 
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molecular species of antibody molecule consisting of a unique light chain gene product 
and a unique heavy chain gene product In particular, the complementarity determining 
regions (CDRs) of the monoclonal antibody are identical in all the molecules of the 
population. MAbs thus contain an antigen binding site capable of immunoreacting with a 
5 particular epitope of the antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1975). In a hybridoma method, a 
mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing 

10 antibodies that will specifically bind to the immunizing agent. Alternatively, the 
lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment 
thereof or a fusion protein thereof. Generally, either peripheral blood lymphocytes are 
used if cells of human origin are desired, or spleen cells or lymph node cells are used if 

1 5 non-human mammalian sources are desired. The lymphocytes are then fused with an 

immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, Academic 
Press, (1986) pp. 59-103]. Immortalized cell lines are usually transformed mammalian 
cells, particularly myeloma cells of rodent, bovine and human origin. Usually, rat or 

20 mouse myeloma cell lines are employed. The hybridoma cells can be cultured in a 

suitable culture medium that preferably contains one or more substances that inhibit the 
growth or survival of the unfused, immortalized cells. For example, if the parental cells 
lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin,- and 

25 thymidine ("HAT medium 11 ), which substances prevent the growth of HGPRT-deficient 
cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high 
level expression of antibody by the selected antibody-producing cells, and are sensitive to 
a medium such as HAT medium. More preferred immortalized cell lines are murine 
30 myeloma lines, which can be obtained, for instance, from the Salk Institute Cell 

Distribution Center, San Diego, California and the American Type Culture Collection, 
Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also 
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have been described for the production of human monoclonal antibodies [Kozbor, J. 
Immunol., 133:3001 (1984); Brodeur et aL, Monoclonal Antibody Production Techniques 
and Applications, Marcel Dekker, Inc., New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed 
5 for the presence of monoclonal antibodies directed against the antigen. Preferably, the 
binding specificity of monoclonal antibodies produced by the hybridoma cells is 
determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art. The binding affinity of the monoclonal 

10 antibody can, for example, be determined by the Scatchard analysis of Munson and 
Pollard, Anal. Biochem. , 107:220 (1980). It is an objective, especially important in 
therapeutic applications of monoclonal antibodies, to identify antibodies having a high 
degree of specificity and a high binding affinity for the target antigen. 

After the desired hybridoma cells are identified, the clones can be subcloned by 

1 5 limiting dilution procedures and grown by standard methods (Goding, 1 986). Suitable 

culture media for this purpose include, for example, Dulbecco's Modified Eagle's Medium 
and RPMI-1640 medium. Alternatively, the hybridoma cells can be grown in vivo as 
ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 
20 culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

. The monoclonal antibodies can also be made by recombinant DNA methods, such 
as those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal 
25 antibodies of the invention can be readily isolated and sequenced using conventional 

procedures (e.g., by using oligonucleotide probes that are capable of binding specifically 
to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells 
of the invention serve as a preferred source of such DNA. Once isolated, the DNA can be 
placed into expression vectors, which are then transfected into host cells such as simian 
30 COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise 
produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also can be modified, for example, by substituting the 
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coding sequence for human heavy and light chain constant domains in place of the 
homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 812-13 
(1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
5 polypeptide can be substituted for the constant domains of an antibody of the invention, or 
can be substituted for the variable domains of one antigen-combining site of an antibody 
of the invention to create a chimeric bivalent antibody. 

Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further 

10 comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against 
the administered immunoglobulin. Humanized forms of antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab')2 or other antigen-binding subsequences of antibodies) that are principally comprised 

15 of the sequence of a human immunoglobulin, and contain minimal sequence derived from 
a non-human immunoglobulin. Humanization can be performed following the method of 
Winter and co-workers (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., 
Nature. 332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)), by 
substituting rodent CDRs or CDR sequences for the corresponding sequences of a human 

20 antibody. (See also U.S. Patent No. 5,225,539.) In some instances, Fv framework 
residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies can also comprise residues which are found neither in the 
recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, 

25 variable domains, in which all or substantially all of the CDR regions correspond to those 
ofa non-human immunoglobulin and all or substantially all of the framework regions are 
those ofa human immunoglobulin consensus sequence. The humanized antibody 
optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), 
typically that of a human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and 

30 Presta, Curr. Op. Struct. Biol.. 2:593-596 (1 992)). 

Human Antibodies 
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Fully human antibodies essentially relate to antibody molecules in which the entire 
sequence of both the light chain and the heavy chain, including the CDRs, arise from 
human genes. Such antibodies are termed "human antibodies", or "fully human 
antibodies" herein. Human monoclonal antibodies can be prepared by the trioma 

5 technique; the human B-cell hybridoma technique (see Kozbor, et al., 1 983 Immunol 

Today 4: 72) and the EBV hybridoma technique to produce human monoclonal antibodies 
(see Cole, et al., 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp. 77-96). Human monoclonal antibodies may be utilized in the practice of the 
present invention and may be produced by using human hybridomas (see Cote, et al., 

10 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by transforming human B-cells with 
Epstein Barr Virus in vitro (see Cole, et al., 1985 In: MONOCLONAL ANTIBODIES AND 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); 

15 Marks et al, J. Mol Biol. , 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 

20 This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 
779-783 (1992)); Lonberg et al. (Nature 368 856-859 (1994)); Morrison ( Nature 368, 
812-13 (1994)); Fishwild et aL( Nature Biotechnology 14, 845-51 (1996)); Neuberger 
(Nature Biotechnology 14, 826 (1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 

25 13 65-93(1995)). 

Human antibodies may additionally be produced using transgenic nonhuman 
animals which are modified so as to produce fully human antibodies rather than the 
animal's endogenous antibodies in response to challenge by an antigen. (See PCT 
publication WO94/02602). The endogenous genes encoding the heavy and light 

30 immunoglobulin chains in the nonhuman host have been incapacitated, and active loci 
encoding human heavy and light chain immunoglobulins are inserted into the host's 
genome. The human genes are incorporated, for example, using yeast artificial 
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chromosomes containing the requisite human DNA segments. An animal which provides 
all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the 
5 Xenomouse™ as disclosed in PCT publications WO 96/33735 and WO 96/34096. This 
animal produces B cells which secrete fully human immunoglobulins. The antibodies can 
be obtained directly from the animal after immunization with an immunogen of interest, 
as, for example, a preparation of a polyclonal antibody, or alternatively from immortalized 
B cells derived from the animal, such as hybridomas producing monoclonal antibodies. 

10 Additionally, the genes encoding the immunoglobulins with human variable regions can 
be recovered and expressed to obtain the antibodies directly, or can be further modified to 
obtain analogs of antibodies such as, for example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. 

15 Patent No. 5,939,598. It can be obtained by a method including deleting the J segment 
genes from at least one endogenous heavy chain locus in an embryonic stem cell to 
prevent rearrangement of the locus and to prevent formation of a transcript of a rearranged 
immunoglobulin heavy chain locus, the deletion being effected by a targeting vector 
containing a gene encoding a selectable marker; and producing from the embryonic stem 

20 cell a transgenic mouse whose somatic and germ cells contain the gene encoding the 
selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is 
disclosed in U.S. Patent No. 5,916,771 . It includes introducing an expression vector that 
contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 

25 culture, introducing an expression vector containing a nucleotide sequence encoding a 
light chain into another mammalian host cell, and fusing the two cells to form a hybrid 
cell. The hybrid cell expresses an antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically 
relevant epitope on an immunogen, and a correlative method for selecting an antibody that 

30 binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 
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F a b Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. 
5 Patent No. 4,946,778). In addition, methods can be adapted for the construction of F a b 
expression libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid 
and effective identification of monoclonal F a b fragments with the desired specificity for a 
protein or derivatives, fragments, analogs or homologs thereof. Antibody fragments that 
contain the idiotypes to a protein antigen may be produced by techniques known in the art 
10 including, but not limited to: (i) an F( a b»)2 fragment produced by pepsin digestion of an 
antibody molecule; (ii) an F a b fragment generated by reducing the disulfide bridges of an 
F( a b-)2 fragment; (iii) an F a b fragment generated by the treatment of the antibody molecule 
with papain and a reducing agent and (iv) F v fragments. 

1 5 Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
that have binding specificities for at least two different antigens. In the present case, one 
of the binding specificities is for an antigenic protein of the invention. The second binding 
target is any other antigen, and advantageously is a cell-surface protein or receptor or 

20 receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature, 305:537-539 (1 983)). Because of the random 

25 assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 

produce a potential mixture of ten different antibody molecules, of which only one has the 
correct bispecific structure. The purification of the correct molecule is usually 
accomplished by affinity chromatography steps. Similar procedures are disclosed in WO 
93/08829, published 13 May 1993, and in Traunecker et al., EMBO J.. 10:3655-3659 

30 (1991). 
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Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least 
part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain 
5 constant region (CHI ) containing the site necessary for light-chain binding present in at 
least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if 
desired, the immunoglobulin light chain, are inserted into separate expression vectors, and 
are co-transfected into a suitable host organism. For further details of generating 
bispecific antibodies see, for example, Suresh et al., Methods in Enzvmology. 121:210 
10 (1986). 

According to another approach described in WO 96/2701 1 , the interface between a 
pair of antibody molecules can be engineered to maximize the percentage of heterodimers 
which are recovered from recombinant cell culture. The preferred interface comprises at 
least a part of the CHS region of an antibody constant domain. In this method, one or 

15 more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of 
identical or similar size to the large side chain(s) are created on the interface of the second 
antibody molecule by replacing large amino acid side chains with smaller ones (e.g. 
alanine or threonine). This provides a mechanism for increasing the yield of the 

20 heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody 
fragments (e.g. F(ab')2 bispecific antibodies). Techniques for generating bispecific 
antibodies from antibody fragments have been described in the literature. For example, 
bispecific antibodies can be prepared using chemical linkage. Brennan et aL, Science 

25 229:81 (1 985) describe a procedure wherein intact antibodies are proteolytically cleaved to 
generate F(ab') 2 fragments. These fragments are reduced in the presence of the dithiol 
complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular 
disulfide formation. The Fab' fragments generated are then converted to thionitrobenzoate 
(TNB) derivatives. One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol 

30 by reduction with mercaptoethylamine and is mixed with an equimolar amount of the 
other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies 
produced can be used as agents for the selective immobilization of enzymes. 
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Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab')2 molecule. Each 
Fab' fragment was separately secreted from E. coli and subjected to directed chemical 
5 coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well 
as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor 
targets. 

Various techniques for making and isolating bispecific antibody fragments directly 

10 from recombinant cell culture have also been described. For example, bispecific 
antibodies have been produced using leucine zippers. Kostelny et al., J. Immunol. 
148(5):1547-1553 (1992). The leucine zipper peptides from the Fos and Jun proteins were 
linked to the Fab' portions of two different antibodies by gene fusion. The antibody 
homodimers were reduced at the hinge region to form monomers and then re-oxidized to 

1 5 form the antibody heterodimers. This method can also be utilized for the production of 
antibody homodimers. The "diabody" technology described by Hollinger et al., Proc. 
Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for 
making bispecific antibody fragments. The fragments comprise a heavy-chain variable 
domain (V H ) connected to a light-chain variable domain (V L ) by a linker which is too short 

20 to allow pairing between the two domains on the same chain. Accordingly, the V H and V L 
domains of one fragment are forced to pair with the complementary V L and V H domains of 
another fragment, thereby forming two antigen-binding sites. Another strategy for making 
bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

25 Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of 
which originates in the protein antigen of the invention. Alternatively, an anti-antigenic 
arm of an immunoglobulin molecule can be combined with an arm which binds to a 

30 triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, 
CD28, or B7), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and 
FcyRIII (CD16) so as to focus cellular defense mechanisms to the cell expressing the 
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particular antigen. Bispecific antibodies can also be used to direct cytotoxic agents to cells 
which express a particular antigen. These antibodies possess an antigen-binding arm and 
an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, 
DOTA, or TETA. Another bispecific antibody of interest binds the protein antigen 
5 described herein and further binds tissue factor (TF). 

Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 

10 antibodies have, for example, been proposed to target immune system cells to unwanted 
cells (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro 
using known methods in synthetic protein chemistry, including those involving 
crosslinking agents. For example, immunotoxins can be constructed using a disulfide 

15 exchange reaction or by forming a thioether bond. Examples of suitable reagents for this 
purpose include iminothiolate and meftyl-4-mercaptobutyrimidate and those disclosed, for 
example, in U.S. Patent No. 4,676,980. 

Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector 
20 function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have improved internalization capability and/or increased complement- 
mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et 
25 aL J. Exp Med .. 176: 1 191-1 195 (1992) and Shopes, J. Immunol .. 148 : 2918-2922 (1992). 
Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using 
heterobifunctional cross-linkers as described in Wolff et al. Cancer Research, 53: 2560- 
2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and 
can thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et 
30 al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

Immunoconjugates 
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The invention also pertains to immunoconjugates comprising an antibody 
conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an 
enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments 
thereof), or a radioactive isotope (i.e., a radioconjugate). 

5 Chemotherapeutic agents useful in the generation of such immunoconjugates have 

been described above. Enzymatically active toxins and fragments thereof that can be used 
include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A 
chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 
alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 

10 (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the 
tricothecenes. A variety of radionuclides are available for the production of 
radioconjugated antibodies. Examples include 2l2 Bi, l3, 1, 13l In, 90 Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 

1 5 Afunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) 

propionate (SPDP), iminothiolane (IT), Afunctional derivatives of imidoesters (such as 
dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes 
(such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 

20 ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine 
compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin 
can be prepared as described in Vitetta et al, Science, 238: 1098 (1987). Carbon-14- 
labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) 
is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 

25 WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate 
is administered to the patient, followed by removal of unbound conjugate from the 
circulation using a clearing agent and then administration of a "ligand" (e.g., avidin) that is 

30 in turn conjugated to a cytotoxic agent. 

Immunoliposomes 
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The antibodies disclosed herein can also be formulated as immunoliposomes. 
Liposomes containing the antibody are prepared by methods known in the art, such as 
described in Epstein et al., Proc. Natl. Acad. Sci. USA, 82: 3688 (1985); Hwang et al., 
Proc. Natl Acad. Sci. USA . 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. 
5 Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation 
method with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG- 
derivatized phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters 
of defined pore size to yield liposomes with the desired diameter. Fab 1 fragments of the 
10 antibody of the present invention can be conjugated to the liposomes as described in 

Martin et al J. Biol. Chem. , 257 : 286-288 (1982) via a disulfide-interchange reaction. A 
chemotherapeutic agent (such as Doxorubicin) is optionally contained within the 
liposome. See Gabizon et al., J. National Cancer Inst.. 81(19): 1484 (1989). 

Diagnostic Applications of Antibodies Directed Against the Proteins of the Invention 

1 5 Antibodies directed against a protein of the invention may be used in methods 

known within the art relating to the localization and/or quantitation of the protein (e.g., for 
use in measuring levels of the protein within appropriate physiological samples, for use in 
diagnostic methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies against the proteins, or derivatives, fragments, analogs or homologs thereof, 

20 that contain the antigen binding domain, are utilized as pharmacologically-active 
compounds (see below). 

An antibody specific for a protein of the invention can be used to isolate the 
protein by standard techniques, such as immunoaffinity chromatography or 
immunoprecipitation. Such an antibody can facilitate the purification of the natural 

25 protein antigen from cells and of recombinantly produced antigen expressed in host cells. 
Moreover, such an antibody can be used to detect the antigenic protein (e.g., in a cellular 
lysate or cell supernatant) in order to evaluate the abundance and pattern of expression of 
the antigenic protein. Antibodies directed against the protein can be used diagnostically to 
monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for example, 

30 • determine the efficacy of a given treatment regimen. Detection can be facilitated by 
coupling (i.e., physically linking) the antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic groups, fluorescent materials, 
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luminescent materials, bioluminescent materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
5 umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material 
includes luminol; examples of bioluminescent materials include luciferase, luciferin, and 
aequorin, and examples of suitable radioactive material include l2 % I3! I, 3:> S or 3 H. 

Antibody Therapeutics 

10 Antibodies of the invention, including polyclonal, monoclonal, humanized and 

fully human antibodies, may used as therapeutic agents. Such agents will generally be 
employed to treat or prevent a disease or pathology in a subject. An antibody preparation, 
preferably one having high specificity and high affinity for its target antigen, is 
administered to the subject and will generally have an effect due to its binding with the 

15 target. Such an effect may be one of two kinds, depending on the specific nature of the 
interaction between the given antibody molecule and the target antigen in question. In the 
first instance, administration of the antibody may abrogate or inhibit the binding of the 
target with an endogenous ligand to which it naturally binds. In this case, the antibody 
binds to the target and masks a binding site of the naturally occurring ligand, wherein the 

20 ligand serves as an effector molecule. Thus the receptor mediates a signal transduction 
pathway for which ligand is responsible. 

Alternatively, the effect may be one in which the antibody elicits a physiological 
result by virtue of binding to an effector binding site on the target molecule. In this case 
the target, a receptor having an endogenous ligand which may be absent or defective in the 

25 disease or pathology, binds the antibody as a surrogate effector ligand, initiating a 
receptor-based signal transduction event by the receptor. 

A therapeutically effective amount of an antibody of the invention relates generally 
to the amount needed to achieve a therapeutic objective. As noted above, this may be a 
binding interaction between the antibody and its target antigen that, in certain cases, 

30 interferes with the functioning of the target, and in other cases, promotes a physiological 
response. The amount required to be administered will furthermore depend on the binding 
affinity of the antibody for its specific antigen, and will also depend on the rate at which 
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an administered antibody is depleted from the free volume other subject to which it is 
administered. Common ranges for therapeutically effective dosing of an antibody or 
antibody fragment of the invention may be, by way of nonlimiting example, from about 
0.1 mg/kg body weight to about 50 mg/kg body weight. Common dosing frequencies may 
5 range, for example, from twice daily to once a week. 

Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a protein of the invention, as well as other, 
molecules identified by the screening assays disclosed herein, can be administered for the 
treatment of various disorders in the form of pharmaceutical compositions. Principles and 

10 considerations involved in preparing such compositions, as well as guidance in the choice 
of components are provided, for example, in Remington : The Science And Practice Of 
Pharmacy 19th ed. (Alfonso R. Gennaro, et aL, editors) Mack Pub. Co., Easton, Pa. : 1995; 
Drug Absorption Enhancement : Concepts, Possibilities, Limitations, And Trends, 
Harwood Academic Publishers, Langhorne, Pa., 1994; and Peptide And Protein Drug 

15 Delivery (Advances In Parenteral Sciences, Vol. 4), 1991, M. Dekker, New York. 

If the antigenic protein is intracellular and whole antibodies are used as inhibitors, 
internalizing antibodies are preferred. However, liposomes can also be used to deliver the 
antibody, or an antibody fragment, into cells. Where antibody fragments are used, the 
smallest inhibitory fragment that specifically binds to the binding domain of the target 

20 protein is preferred. For example, based upon the variable-region sequences of an 
antibody, peptide molecules can be designed that retain the ability to bind the target 
protein sequence. Such peptides can be synthesized chemically and/or produced by 
recombinant DNA technology. See, e.g., Marasco et aL, Proc. Natl. Acad. Sci. USA, 90: 
7889-7893 ( 1 993). The formulation herein can also contain more than one active 

25 compound as necessary for the particular indication being treated, preferably those with 
complementary activities that do not adversely affect each other. Alternatively, or in 
addition, the composition can comprise an agent that enhances its function, such as, for 
example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. 
Such molecules are suitably present in combination in amounts that are effective for the 

30 purpose intended. 

The active ingredients can also be entrapped in microcapsules prepared, for 
example, by coacervation techniques or by interfacial polymerization, for example, 
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hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

5 The formulations to be used for in vivo administration must be sterile. This is 

readily accomplished by filtration through sterile filtration membranes. 

Sustained-release preparations can be prepared. Suitable examples of sustained- 
release preparations include semipermeable matrices of solid hydrophobic polymers 
containing the antibody, which matrices are in the form of shaped articles, e.g., films, or 

10 microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for 
example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides (U.S. 
Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non- 
degradable ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as 
the LUPRON DEPOT ™ (injectable microspheres composed of lactic acid-glycolic acid 

15 copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While polymers 
such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for 
over 100 days, certain hydrogels release proteins for shorter time periods. 

ELISA Assay 

An agent for detecting an analyte protein is an antibody capable of binding to an 
20 analyte protein, preferably an antibody with a detectable label. Antibodies can be 

polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof 
(eg., F ab or F ( ab)2) can be used. The term "labeled", with regard to the probe or antibody, 
is intended to encompass direct labeling of the probe or antibody by coupling (i.e., 
physically linking) a detectable substance to the probe or antibody, as well as indirect 
25 labeling of the probe or antibody by reactivity with another reagent that is directly labeled. 
Examples of indirect labeling include detection of a primary antibody using a 
fluorescently-labeled secondary antibody and end-labeling of a DNA probe with biotin 
such that it can be detected with fluorescently-labeled streptavidin. The term "biological 
sample" is intended to include tissues, cells and biological fluids isolated from a subject, 
30 as well as tissues, cells and fluids present within a subject Included within the usage of 
the teim "biological sample", therefore, is blood and a fraction or component of blood 
including blood serum, blood plasma, or lymph. That is, the detection method of the 
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invention can be used to detect an analyte mRNA, protein, or genomic DNA in a 
biological sample in vitro as well as in vivo. For example, in vitro techniques for detection 
of an analyte mRNA include Northern hybridizations and in situ hybridizations. In vitro 
techniques for detection of an analyte protein include enzyme linked immunosorbent 
5 assays (ELISAs), Western blots, immunoprecipitations, and immunofluorescence. In vitro 
techniques for detection of an analyte genomic DNA include Southern hybridizations. 
Procedures for conducting immunoassays are described, for example in "ELISA: Theory 
and Practice: Methods in Molecular Biology", Vol. 42, J. R. Crowther (Ed.) Human Press, 
Totowa, NJ, 1995; "Immunoassay", E. Diamandis and T. Christopoulus, Academic Press, 

10 Inc., San Diego, CA, 1996; and "Practice and Thory of Enzyme Immunoassays", P. 

Tijssen, Elsevier Science Publishers, Amsterdam, 1985. Furthermore, in vivo techniques 
for detection of an analyte protein include introducing into a subject a labeled anti-an 
analyte protein antibody. For example, the antibody can be labeled with a radioactive 
marker whose presence and location in a subject can be detected by standard imaging 

15 techniques. 

NOVX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an NOVX protein, or derivatives, fragments, analogs or 

20 homologs thereof As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, wherein 
additional DNA segments can be ligated into the viral genome. Certain vectors are 

25 capable of autonomous replication in a host cell into which they are introduced (e.g. , 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated along 
with the host genome. Moreover, certain vectors are capable of directing the expression of 

30 genes to which they are operatively-linked. Such vectors are referred to herein as 
"expression vectors". In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids. In the present specification, "plasmid" and 
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"vector" can be used interchangeably as the plasmid is the most commonly used form of 
vector. However, the invention is intended to include such other forms of expression 
vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and 
adeno-associated viruses), which serve equivalent functions. 

5 The recombinant expression vectors of the invention comprise a nucleic acid of the 

invention in a form suitable for expression of the nucleic acid in a host cell, which means 
that the recombinant expression vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for expression, that is operatively-linked 
to the nucleic acid sequence to be expressed. Within a recombinant expression vector, 

10 "operably-linked" is intended to mean that the nucleotide sequence of interest is linked to 
the regulatory sequence(s) in a manner that allows for expression of the nucleotide 
sequence (e.g.. in an in vitro transcription/translation system or in a host cell when the 
vector is introduced into the host cell). 

The term "regulatory sequence" is intended to includes promoters, enhancers and 

15 other expression control elements (e.g., polyadenylation signals). Such regulatory 
sequences are described, for example, in Goeddel, Gene EXPRESSION TECHNOLOGY: 
Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory 
sequences include those that direct constitutive expression of a nucleotide sequence in 
many types of host cell and those that direct expression of the nucleotide sequence only in 

20 certain host cells (e.g. , tissue-specific regulatory sequences). It will be appreciated by 

those skilled in the art that the design of the expression vector can depend on such factors 
as the choice of the host cell to be transformed, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to thereby 
produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic 

25 acids as described herein (e.g. , NOVX proteins, mutant forms of NOVX proteins, fusion 
proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of NOVX proteins in prokaryotic or eukaryotic cells. For example, NOVX 
proteins can be expressed in bacterial cells such as Escherichia coli, insect cells (using 
30 baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are 
discussed further in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif. (1990). Alternatively, the 
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recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
5 fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: (0 to increase expression of recombinant protein; 
(«) to increase the solubility of the recombinant protein; and (///) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 

10 , expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their 
cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical 
fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 

15 1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 
(Pharmacia, Piscataway, N.J.) that fuse glutathione S-transferase (GST), maltose E 
binding protein, or protein A. respectively, to the target recombinant protein. 
Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amrann et al, (1988) Gene 69:301-315) and pET 1 Id (Studier et aL, GENE EXPRESSION 

20 Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 
60-89). 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein. See, e.g., Gottesman, Gene Expression Technology: Methods in 

25 Enzymology 185, Academic Press, San Diego, Calif (1990) 1 19-128. Another strategy 
is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 
vector so that the individual codons for each amino acid are those preferentially utilized in 
£. coli (see, e.g., Wada, et ah, 1992. Nucl Acids Res. 20: 21 1 1-21 18). Such alteration of 
nucleic acid sequences of the invention can be carried out by standard DNA synthesis 

30 techniques. 

In another embodiment, the NOVX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
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(Baldari, et aU 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 
30: 933-943), pJRY88 (Schultz etal, 1987. Gene 54: 1 13-123), pYES2 (Invitrogen 
Corporation, San Diego, Calif), and picZ (InVitrogen Corp, San Diego, Calif.). 
Alternatively, NOVX can be expressed in insect cells using baculovirus expression 

5 vectors. Baculovirus vectors available for expression of proteins in cultured insect cells 
(e.g., SF9 cells) include the pAc series (Smith, et al, 1983. Mol Cell Biol. 3: 2156-2165) 
and the pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 

1 0 expression vectors include pCDM8 (Seed, 1 987. Nature 329: 840) and pMT2PC 
(Kaufman, et al, 1987. EMBOJ. 6: 187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. For 
example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus, and simian virus 40. For other suitable expression systems for both 

15 prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et al, 
Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type (e.g., 

20 tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert, et al, 
1987. Genes Dev. 1: 268-277), lymphoid-specific promoters (Calame and Eaton, 1988. 
Adv. Immunol. 43: 235-275), in particular promoters of T cell receptors (Winoto and 

25 Baltimore, 1989. EMBOJ. 8: 729-733) and immunoglobulins (Banerji, et al, 1983. Cell 
33: 729-740; Queen and Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle, 1989. Proc. Natl Acad. ScL USA 86: 
5473-5477), pancreas-specific promoters (Edlund, et al, 1985. Science 230: 912-916), and 
mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 

30 and European Application Publication No. 264, 1 66). Developmentally-regulated 

promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. 
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Science 249: 374-379) and the a-fetoprotein promoter (Campes and Tilghman, 1989. 
Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. 
5 That is, the DNA molecule is operatively-linked to a regulatory sequence in a manner that 
allows for expression (by transcription of the DNA molecule) of an RNA molecule that is 
antisense to NOVX mRNA. Regulatory sequences operatively linked to a nucleic acid 
cloned in the antisense orientation can be chosen that direct the continuous expression of 
the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or 

10 enhancers, or regulatory sequences can be chosen that direct constitutive, tissue specific or 
cell type specific expression of antisense RNA. The antisense expression vector can be in 
the form of a recombinant plasmid, phagemid or attenuated virus in which antisense 
nucleic acids are produced under the control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into which the vector is introduced. 

15 For a discussion of the regulation of gene expression using antisense genes see, e.g., 
Weintraub, et al, "Antisense RNA as a molecular tool for genetic analysis," Reviews- 
Trends in Genetics, Vol. 1(1) 1 986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 

20 "recombinant host cell" are used interchangeably herein. It is understood that such terms 
refer not only to the particular subject cell but also to the progeny or potential progeny of 
such a cell. Because certain modifications may occur in succeeding generations due to 
either mutation or environmental influences, such progeny may not, in fact, be identical to 
the parent cell, but are still included within the scope of the term as used herein. 

25 A host cell can be any prokaryotic or eukaryotic cell. For example, NOVX protein 

can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells 
(such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are 
known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

30 conventional transformation or transfection techniques. As used herein, the terms 

"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
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calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al (Molecular Cloning: A 
Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
5 Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker {e.g., resistance to antibiotics) is 

1 0 generally introduced into the host cells along with the gene of interest. Various selectable 
markers include those that confer resistance to drugs, such as G418, hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host cell 
on the same vector as that encoding NOVX or can be introduced on a separate vector. 
Cells stably transfected with the introduced nucleic acid can be identified by drug 

15 selection {e.g., cells that have incorporated the selectable marker gene will survive, while 
the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 
can be used to produce {i.e., express) NOVX protein. Accordingly, the invention further 
provides methods for producing NOVX protein using the host cells of the invention. In 
20 one embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding NOVX protein has been introduced) in a suitable 
medium such that NOVX protein is produced. In another embodiment, the method further 
comprises isolating NOVX protein from the medium or the host cell. 

25 Transgenic NOVX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which NOVX protein-coding sequences have been 
30 introduced. Such host cells can then be used to create non-human transgenic animals in 
which exogenous NOVX sequences have been introduced into their genome or 
homologous recombinant animals in which endogenous NOVX sequences have been 
altered. Such animals are useful for studying the function and/or activity of NOVX 
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protein and for identifying and/or evaluating modulators of NOVX protein activity. As 
used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 
preferably a rodent such as a rat or mouse, in which one or more of the cells of the animal 
includes a transgene. Other examples of transgenic animals include non-human primates, 
5 sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that 
is integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, a "homologous recombinant animal" is a non-human animal, preferably a 

10 mammal, more preferably a mouse, in which an endogenous NOVX gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
molecule introduced into a cell of the animal, e.g. , an embryonic cell of the animal, prior 
to development of the animal. 

A transgenic animal of the invention can be created by introducing 

15 NOVX-encoding nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by 
microinjection, retroviral infection) and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The human NOVX cDNA sequences SEQ ID NO: 
2n-l , wherein n is an integer between 1 and 101 can be introduced as a transgene into the 
genome of a non-human animal. Alternatively, a non-human homologue of the human 

20 NOVX gene, such as a mouse NOVX gene, can be isolated based on hybridization to the 
human NOVX cDNA (described further supra) and used as a transgene. Intronic 
sequences and polyadenylation signals can also be included in the transgene to increase 
the efficiency of expression of the transgene. A tissue-specific regulatory sequence(s) can 
be operably-linked to the NOVX transgene to direct expression of NOVX protein to 

25 particular cells. Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866; 4,870,009; and 4,873,191; and 
Hogan, 1986. In: Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N. Y. Similar methods are used for production of other 

30 transgenic animals. A transgenic founder animal can be identified based upon the 

presence of the NOVX transgene in its genome and/or expression of NOVX mRNA in 
tissues or cells of the animals. A transgenic founder animal can then be used to breed 
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additional animals carrying the transgene. Moreover, transgenic animals carrying a 
transgene-encoding NOVX protein can further be bred to other transgenic animals 
carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
5 at least a portion of an NOVX gene into which a deletion, addition or substitution has been 
introduced to thereby alter, e.g., functionally disrupt, the NOVX gene. The NOVX gene 
can be a human gene (e.g., the cDNA of SEQ ID NO: 2n-l , wherein n is an integer 
between 1 and 101), but more preferably, is a non-human homologue of a human NOVX 
gene. For example, a mouse homologue of human NOVX gene of SEQ ID NO: 2n-l, 

10 wherein n is an integer between 1 and 101 can be used to construct a homologous 
recombination vector suitable for altering an endogenous NOVX gene in the mouse 
genome. In one embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous NOVX gene is functionally disrupted (i.e., no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

15 Alternatively, the vector can be designed such that, upon homologous 

recombination, the endogenous NOVX gene is mutated or otherwise altered but still 
encodes functional protein (e.g., the upstream regulatory region can be altered to thereby 
alter the expression of the endogenous NOVX protein). In the homologous recombination 
vector, the altered portion of the NOVX gene is flanked at its 5 - and 3 -termini by 

20 additional nucleic acid of the NOVX gene to allow for homologous recombination to 
occur between the exogenous NOVX gene carried by the vector and an endogenous 
NOVX gene in an embryonic stem cell. The additional flanking NOVX nucleic acid is of 
sufficient length for successful homologous recombination with the endogenous gene. 
Typically, several kilobases of flanking DNA (both at the 5'- and 3 f -termini) are included 
• 25 in the vector. See, e.g., Thomas, et al, 1987. Cell 51: 503 for a description of homologous 
recombination vectors. The vector is ten introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced NOVX gene has homologously- 
recombined with the endogenous NOVX gene are selected. See, e.g., Li, et al, 1992. Cell 
69: 915. 

30 The selected cells are then injected into a blastocyst of an animal (e.g. , a mouse) to 

form aggregation chimeras. See, e.g., Bradley, 1987. In: Teratocarcinomas AND 
Embryonic Stem Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 
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1 13-152. A chimeric embryo can then be implanted into a suitable pseudopregnant female 
foster animal and the embryo brought to term. Progeny harboring the homologously- 
recombined DNA in their germ cells can be used to breed animals in which all cells of the 
animal contain the homologously-recombined DNA by germline transmission of the 
5 transgene. Methods for constructing homologous recombination vectors and homologous 
recombinant animals are described further in Bradley, 1991. Cw/r. Opin. Biotechnol 2: 
823-829; PCT International Publication Nos.: WO 90/1 1354; WO 91/01 140; WO 92/0968; 
and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that 
10 contain selected systems that allow for regulated expression of the transgene. One 

example of such a system is the cre/loxP recombinase system of bacteriophage PL For a 
description of the cre/loxP recombinase system, See, e.g., Lakso, et al., 1992. Proc. Natl. 
Acad Sci. USA 89: 6232-6236. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae. See t O'Gorman, et aL 9 1991. Science 
15 251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
20 recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et al, 1 997. Nature 385: 810-813. In brief, 
a cell (e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through 
25 the use of electrical pulses, to an enucleated oocyte from an animal of the same species 
from which the quiescent cell is isolated. The reconstructed oocyte is then cultured such 
that it develops to morula or blastocyte and then transferred to pseudopregnant female 
foster animal. The offspring borne of this female foster animal will be a clone of the 
animal from which the cell (e.g., the somatic cell) is isolated. 

30 Pharmaceutical Compositions 

The NOVX nucleic acid molecules, NOVX proteins, and anti-NOVX antibodies 
(also referred to herein as "active compounds") of the invention, and derivatives, 
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fragments, analogs and homologs thereof, can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 
nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. As 
used herein, "pharmaceutically acceptable carrier" is intended to include any and all 
solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with pharmaceutical administration. 
Suitable carriers are described in the most recent edition of Remington's Pharmaceutical 
Sciences, a standard reference text in the field, which is incorporated herein by reference. 
Preferred examples of such carriers or diluents include, but are not limited to, water, 
saline, finger's solutions, dextrose solution, and 5% human serum albumin. Liposomes 
and non-aqueous vehicles such as fixed oils may also be used. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof in 
the compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with 
its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(i.e„ topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid (EDTA); 
buffers such as acetates, citrates or phosphates, and agents for the adjustment of tonicity 
such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as 
hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in 
ampoules, disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL 
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(BASF, Parsippany, NJ.) or phosphate buffered saline (PBS). In all cases, the 
composition must be sterile and should be fluid to the extent that easy syringeability 
exists. It must be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria and fungi. 
5 The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the 
like), and suitable mixtures thereof The proper fluidity can be maintained, for example, 
by the use of a coating such as lecithin, by the maintenance of the required particle size in 
the case of dispersion and by the use of surfactants. Prevention of the action of 

10 microorganisms can be achieved by various antibacterial and antifungal agents, for 

example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many 
cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols 
such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including in the composition an agent 

15 which delays absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., an NOVX protein or anti-NOVX antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, followed 
by filtered sterilization. Generally, dispersions are prepared by incorporating the active 

20 compound into a sterile vehicle that contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, methods of preparation are vacuum drying and 
freeze-drying that yields a powder of the active ingredient plus any additional desired 
ingredient from a previously sterile-filtered solution thereof. 

25 Oral compositions generally include an inert diluent or an edible carrier. They can 

be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 

30 applied orally and swished and expectorated or swallowed. Pharmaceutically compatible 
binding agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
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compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth 
or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic 
acid, Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 

5 or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 

10 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 

15 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. ■ 

20 In one embodiment, the active compounds are prepared with carriers that will 

protect the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation 

25 of such formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 
prepared according to methods known to those skilled in the art, for example, as described 

30 in U.S. Patent No. 4,522,81 1 . 

It is especially advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
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herein refers to physically discrete units suited as unitary dosages for the subject to be 
treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
5 directly dependent on the unique characteristics of the active compound and the particular 
therapeutic effect to be achieved, and the limitations inherent in the art of compounding 
such an active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 

10 example, intravenous injection, local administration (see, e.g., U.S. Patent No. 5,328,470) 
or by stereotactic injection (see, e.g., Chen, et al., 1994. Proc. Natl. Acad. Sci. USA 91: 
3054-3057). The pharmaceutical preparation of the gene therapy vector can include the 
gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 

15 delivery vector can be produced intact from recombinant cells, e.g. , retroviral vectors, the 
pharmaceutical preparation can include one or more cells that produce the gene delivery 
system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

20 Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express NOVX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy 
applications), to detect NOVX mRNA (e.g., in a biological sample) or a genetic lesion in 
an NOVX gene, and to modulate NO VX activity, as described further, below. In addition, 

25 the NOVX proteins can be used to screen drugs or compounds that modulate the NOVX 
protein activity or expression as well as to treat disorders characterized by insufficient or 
excessive production of NOVX protein or production of NOVX protein forms that have 
decreased or aberrant activity compared to NOVX wild-type protein (e.g. ; diabetes 
(regulates insulin release); obesity (binds and transport lipids); metabolic disturbances 

30 associated with obesity, the metabolic syndrome X as well as anorexia and wasting 
disorders associated with chronic diseases and various cancers, and infectious 
disease(possesses anti-microbial activity) and the various dyslipidemias. In addition, the 
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anti-NOVX antibodies of the invention can be used to detect and isolate NOVX proteins 
and modulate NOVX activity. In yet a further aspect, the invention can be used in methods 
to influence appetite, absorption of nutrients and the disposition of metabolic substrates in 
both a positive and negative fashion. 
5 The invention further pertains to novel agents identified by the screening assays 

described herein and uses thereof for treatments as described, supra. 

Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
1 0 peptidomimetics, small molecules or other drugs) that bind to NOVX proteins or have a 
stimulatory or inhibitory effect on, e.g., NOVX protein expression or NOVX protein 
? activity. The invention also includes compounds identified in the screening assays 
described herein. 

In one embodiment, the invention provides assays for screening candidate or test 

1 5 compounds which bind to or modulate the activity of the membrane-bound form of an 

NOVX protein or polypeptide or biologically-active portion thereof. The test compounds 
of the invention can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 

20 deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 
non-peptide oligomer or small molecule libraries of compounds. See, e.g., Lam, 1997. 
Anticancer Drug Design 12: 145. 

25 A "small molecule" as used herein, is meant to refer to a composition that has a 

molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 
molecules can be, e.g. , nucleic acids, peptides, polypeptides, peptidomimetics, 
carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical 
and/or biological mixtures, such as fungal, bacterial, or algal extracts, are known in the art 

30 and can be screened with any of the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt, et al y 1993. Proc. Natl Acad. Sci. U.S.A. 90: 6909; Erb, et aU 
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1994. Proc. Natl Acad Set U.S.A. 91:1 1422; Zuckermann, et al, 1994. J. Med Chem. 
37: 2678; Cho, etal, 1993. Science 261: 1303; Carrell, etal, 1994. Angew. Chem. Int. Ed. 
Engl. 33: 2059; Carell, et al, 1994. Angew. Chem. Int. Ed. Engl 33: 2061; and Gallop, et 
al, 1994. J. Med. Chem. 37: 1233. 
5 Libraries of compounds may be presented in solution {e.g., Houghten, 1992. 

Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips 
(Fodor, 1993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores 
(Ladner, U.S. Patent 5,233,409), plasmids (Cull, et al, 1992. Proc. Natl Acad. ScL USA 
89: 1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. 

10 Science 249: 404-406; Cwirla, et al, 1990. Proc. Natl Acad. Sci. USA. 87: 6378-6382; 
Felici, 1991. J. Mol Biol 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses 
a membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the 
cell surface is contacted with a test compound and the ability of the test compound to bind 

15 to an NOVX protein determined. The cell, for example, can of mammalian origin or a 
yeast cell. Determining the ability of the test compound to bind to the NOVX protein can 
be accomplished, for example, by coupling the test compound with a radioisotope or 
enzymatic label such that binding of the test compound to the NOVX protein or 
biologically-active portion thereof can be determined by detecting the labeled compound 

20 in a complex. For example, test compounds can be labeled with I25 1, 35 S, 14 C, or 3 H, either 
directly or indirectly, and the radioisotope detected by direct counting of radioemission or 
by scintillation counting. Alternatively, test compounds can be enzymatically-labeled 
with, for example, horseradish peroxidase, alkaline phosphatase, or luciferase, and the 
enzymatic label detected by determination of conversion of an appropriate substrate to 

25 product. In one embodiment, the assay comprises contacting a cell which expresses a 
membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the 
cell surface with a known compound which binds NOVX to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the test 
compound to interact with an NOVX protein, wherein determining the ability of the test 

30 compound to interact with an NOVX protein comprises determining the ability of the test 
compound to preferentially bind to NOVX protein or a biologically-active portion thereof 
as compared to the known compound. 
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In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of NOVX protein, or a biologically-active portion 
thereof, on the cell surface with a test compound and determining the ability of the test 
compound to modulate (e.g., stimulate or inhibit) the activity of the NOVX protein or 
biologically-active portion thereof. Determining the ability of the test compound to 
modulate the activity of NOVX or a biologically-acti ve portion thereof can be 
accomplished, for example, by determining the ability of the NOVX protein to bind to or 
interact with an NOVX target molecule. As used herein, a "target molecule" is a molecule 
with which an NOVX protein binds or interacts in nature, for example, a molecule on the 
surface of a cell which expresses an NOVX interacting protein, a molecule on the surface 
of a second cell, a molecule in the extracellular milieu, a molecule associated with the 
internal surface of a cell membrane or a cytoplasmic molecule. An NOVX target 
molecule can be a non-NOVX molecule or an NOVX protein or polypeptide of the 
invention. In one embodiment, an NOVX target molecule is a component of a signal 
transduction pathway that facilitates transduction of an extracellular signal (e.g. a signal 
generated by binding of a compound to a membrane-bound NOVX molecule) through the 
cell membrane and into the cell. The target, for example, can be a second intercellular 
protein that has catalytic activity or a protein that facilitates the association of downstream 
signaling molecules with NOVX. 

Determining the ability of the NOVX protein to bind to or interact with an NOVX 
target molecule can be accomplished by one of the methods described above for 
determining direct binding. In one embodiment, determining the ability of the NOVX 
protein to bind to or interact with an NOVX target molecule can be accomplished by 
determining the activity of the target molecule. For example, the activity of the target 
molecule can be determined by detecting induction of a cellular second messenger of the 
target (i.e. intracellular Ca 2+ , diacylglycerol, IP 3 , etc.), detecting catalytic/enzymatic 
activity of the target an appropriate substrate, detecting the induction of a reporter gene 
(comprising an NOVX-responsive regulatory element operatively linked to a nucleic acid 
encoding a detectable marker, e.g., luciferase), or detecting a cellular response, for 
example, cell survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay 
comprising contacting an NOVX protein or biologically-active portion thereof with a test 
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compound and determining the ability of the test compound to bind to the NOVX protein 
or biologically-active portion thereof. Binding of the test compound to the NOVX protein 
can be determined either directly or indirectly as described above. In one such 
embodiment, the assay comprises contacting the NOVX protein or biologically-active 
5 portion thereof with a known compound which binds NOVX to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the test 
compound to interact with an NOVX protein, wherein determining the ability of the test 
compound to interact with an NOVX protein comprises determining the ability of the test 
compound to preferentially bind to NOVX or biologically-active portion thereof as 

1 0 compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting 
NOVX protein or biologically-active portion thereof with a test compound and 
determining the ability of the test compound to modulate (e.g. stimulate or inhibit) the 
activity of the NOVX protein or biologically-active portion thereof. Determining the 

1 5 ability of the test compound to modulate the activity of NOVX can be accomplished, for 
example, by determining the ability of the NOVX protein to bind to an NOVX target 
molecule by one of the methods described above for determining direct binding. In an 
alternative embodiment, determining the ability of the test compound to modulate the 
activity of NOVX protein can be accomplished by determining the ability of the NOVX 

20 protein further modulate an NOVX target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate substrate can be determined as described, 
supra. 

In yet another embodiment, the cell-free assay comprises contacting the NOVX 
protein or biologically-active portion thereof with a known compound which binds NOVX 

25 protein to form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with an NOVX protein, wherein 
determining the ability of the test compound to interact with an NOVX protein comprises 
determining the ability of the NOVX protein to preferentially bind to or modulate the 
activity of an NOVX target molecule. 

30 The cell-free assays of the invention are amenable to use of both the soluble form 

or the membrane-bound form of NOVX protein. In the case of cell-free assays comprising 
the membrane-bound form of NOVX protein, it may be desirable to utilize a solubilizing 
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agent such that the membrane-bound form of NOVX protein is maintained in solution. 
Examples of such solubilizing agents include non-ionic detergents such as 
n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 
5 Isotridecypoly(ethylene glycol ether) n , N-dodecyl--N,N-dimethyl-3-ammonio-l -propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 
3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it may 
be desirable to immobilize either NOVX protein or its target molecule to facilitate 

10 separation of complexed from uncomplexed forms of one or both of the proteins, as well 
as to accommodate automation of the assay. Binding of a test compound to NOVX 
protein, or interaction of NOVX protein with a target molecule in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for 
containing the reactants. Examples of such vessels include microliter plates, test tubes, 

1 5 and micro-centrifuge tubes. In one embodiment, a fusion protein can be provided that 
adds a domain that allows one or both of the proteins to be bound to a matrix. For 
example, GST-NO VX fusion proteins or GST-target fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized 
microtiter plates, that are then combined with the test compound or the test compound and 

20 either the non-adsorbed target protein or NOVX protein, and the mixture is incubated 

under conditions conducive to complex formation (e.g., at physiological conditions for salt 
and pH). Following incubation, the beads or microtiter plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as described, supra. Alternatively, 

25 the complexes can be dissociated from the matrix, and the level of NOVX protein binding 
or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the NOVX protein or its target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 

30 NOVX protein or target molecules can be prepared from biotin-NHS 

(N-hydroxy-succinimide) using techniques well-known within the art (e.g., biotinylation 
kit, Pierce Chemicals, Rockford, 111.), and immobilized in the wells of streptavidin-coated 

75 



WO 03/010327 



PCI7US02/14199 



96 well plates (Pierce Chemical). Alternatively, antibodies reactive with NOVX protein 
or target molecules, but which do not interfere with binding of the NOVX protein to its 
target molecule, can be derivatized to the wells of the plate, and unbound target or NOVX 
protein trapped in the wells by antibody conjugation. Methods for detecting such 

5 complexes, in addition to those described above for the GST-immobilized complexes, 
include immunodetection of complexes using antibodies reactive with the NOVX protein 
or target molecule, as well as enzyme-linked assays that rely on detecting an enzymatic 
activity associated with the NOVX protein or target molecule. 

In another embodiment, modulators of NOVX protein expression are identified in 

10 a method wherein a cell is contacted with a candidate compound and the expression of 
NOVX mRNA or protein in the cell is determined. The level of expression of NOVX 
mRNA or protein in the presence of the candidate compound is compared to the level of 
expression of NOVX mRNA or protein in the absence of the candidate compound. The 
candidate compound can then be identified as a modulator of NOVX mRNA or protein 

1 5 expression based upon this comparison. For example, when expression of NOVX mRNA 
or protein is greater (i.e., statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 
NOVX mRNA or protein expression. Alternatively, when expression of NOVX mRNA or 
protein is less (statistically significantly less) in the presence of the candidate compound 

20 than in its absence, the candidate compound is identified as an inhibitor of NOVX mRNA 
or protein expression. The level of NOVX mRNA or protein expression in the cells can be 
determined by methods described herein for detecting NOVX mRNA or protein. 

In yet another aspect of the invention, the NOVX proteins can be used as "bait 
proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; 

25 Zervos, et aL, 1993. Cell 72: 223-232; Madura, et a/., 1993. J. Biol Chem. 268: 

12046-12054; Bartel, et a/., 1993. Biotechniques 14: 920-924; Iwabuchi, etal, 1993. 
Oncogene 8: 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to 
or interact with NOVX ("NOVX-binding proteins" or "NOVX-bp") and modulate NOVX 
activity. Such NOVX-binding proteins are also likely to be involved in the propagation of 

30 signals by the NOVX proteins as, for example, upstream or downstream elements of the 
NOVX pathway. 
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The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for 
NOVX is fused to a gene encoding the DNA binding domain of a known transcription 

5 factor (e.g. , GAL-4). In the other construct, a DNA sequence, from a library of DNA 

sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene that 
codes for the activation domain of the known transcription factor. If the "bait" and the 
"prey" proteins are able to interact, in vivo, forming an NOVX-dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 

10 proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) that is 
operably linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected and cell colonies containing the functional 
transcription factor can be isolated and used to obtain the cloned gene that encodes the 
protein which interacts with NOVX. 

1 5 The invention further pertains to novel agents identified by the aforementioned 

screening assays and uses thereof for treatments as described herein. 

Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
20 corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. By way of example, and not of limitation, these sequences can be used to: (/) 
map their respective genes on a chromosome; and, thus, locate gene regions associated 
with genetic disease; (ii) identify an individual from a minute biological sample (tissue 
typing); and (Hi) aid in forensic identification of a biological sample. Some of these 
25 applications are described in the subsections, below. 

Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the NOVX 
30 sequences, SEQ ID NO: 2n-l , wherein n is an integer between 1 and 101 , or fragments or 
derivatives thereof, can be used to map the location of the NOVX genes, respectively, on a 
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chromosome. The mapping of the NOVX sequences to chromosomes is an important first 
step in correlating these sequences with genes associated with disease. 

Briefly, NOVX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the NOVX sequences. Computer analysis of the 
5 NOVX, sequences can be used to rapidly select primers that do not span more than one 
exon in the genomic DNA, thus complicating the amplification process. These primers 
can then be used for PCR screening of somatic cell hybrids containing individual human 
chromosomes. Only those hybrids containing the human gene corresponding to the 
NOVX sequences will yield an amplified fragment. 

10 Somatic cell hybrids are prepared by fusing somatic cells from different mammals 

(e.g., human and mouse cells). As hybrids of human and mouse cells grow and divide, 
they gradually lose human chromosomes in random order, but retain the mouse 
chromosomes. By using media in which mouse cells cannot grow, because they lack a 
particular enzyme, but in which human cells can, the one human chromosome that 

15 contains the gene encoding the needed enzyme will be retained. By using various media, 
panels of hybrid cell lines can be established. Each cell line in a panel contains either a 
single human chromosome or a small number of human chromosomes, and a full set of 
mouse chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. See, e.g., D'Eustachio, et ah, 1983. Science 220: 919-924. Somatic cell 

20 hybrids containing only fragments of human chromosomes can also be produced by using 
human chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day 
using a single thermal cycler. Using the NOVX sequences to design oligonucleotide 

25 primers, sub-localization can be achieved with panels of fragments from specific 
chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose division has been blocked in 
30 metaphase by a chemical like colcemid that disrupts the mitotic spindle. The 

chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A pattern 
of light and dark bands develops on each chromosome, so that the chromosomes can be 
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identified individually. The FISH technique can be used with a DNA sequence as short as 
500 or 600 bases. However, clones larger than 1 5 000 bases have a higher likelihood of 
binding to a unique chromosomal location with sufficient signal intensity for simple 
detection. Preferably 1,000 bases, and more preferably 2,000 bases, will suffice to get 
5 good results at a reasonable amount of time. For a review of this technique, see, Verma, et 
aU Human Chromosomes: A Manual of Basic Techniques (Pergamon Press, New 
York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 

1 0 marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the chance 
of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 

1 5 position of the sequence on the chromosome can be correlated with genetic map data, 
Such data are found, e.g., in McKusick, Mendelian Inheritance in Man, available 
on-line through Johns Hopkins University Welch Medical Library). The relationship 
between genes and disease, mapped to the same chromosomal region, can then be 
identified through linkage analysis (co-inheritance of physically adjacent genes), described 

20 in, e.g., Egeland, et aL, 1987. Nature, 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the NOVX gene, can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 

25 Comparison of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes, such as deletions or translocations that are 
visible from chromosome spreads or detectable using PGR based on that DNA sequence. 
Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

30 Tissue Typing 

The NOVX sequences of the invention can also be used to identify individuals 
from minute biological samples. In this technique, an individual's genomic DNA is 
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digested with one or more restriction enzymes, and probed on a Southern blot to yield 
unique bands for identification. The sequences of the invention are useful as additional 
DNA markers for RFLP ("restriction fragment length polymorphisms" described in U.S. 
Patent No. 5,272,057). 

5 Furthermore, the sequences of the invention can be used to provide an alternative 

technique that determines the actual base-by-base DNA sequence of selected portions of 
an individual's genome. Thus, the NOVX sequences described herein can be used to 
prepare two PCR primers from the 5 - and 3'-termini of the sequences. These primers can 
then be used to amplify an individual's DNA and subsequently sequence it. 

1 0 Panels of corresponding DNA sequences from individuals, prepared in this 

manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
invention can be used to obtain such identification sequences from individuals and from 
tissue. The NOVX sequences of the invention uniquely represent portions of the human 

15 genome. Allelic variation occurs to some degree in the coding regions of these sequences, 
and to a greater degree in the noncoding regions. It is estimated that allelic variation 
between individual humans occurs with a frequency of about once per each 500 bases. 
Much of the allelic variation is due to single nucleotide polymorphisms (SNPs), which " 
include restriction fragment length polymorphisms (RFLPs). 

20 Each of the sequences described herein can, to some degree, be used as a standard 

against which DNA from an individual can be compared for identification purposes. 
Because greater numbers of polymorphisms occur in the noncoding regions, fewer 
sequences are necessary to differentiate individuals. The noncoding sequences can 
comfortably provide positive individual identification with a panel of perhaps 10 to 1,000 

25 primers that each yield a noncoding amplified sequence of 100 bases. If predicted coding 
sequences, such as those in SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101 
are used, a more appropriate number of primers for positive individual identification 
would be 500-2,000. 

Predictive Medicine 

30 The invention also pertains to the field of predictive medicine in which diagnostic 

assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 
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Accordingly, one aspect of the invention relates to diagnostic assays for determining 
NOVX protein and/or nucleic acid expression as well as NOVX activity, in the context of 
a biological sample (e.g., blood, serum, cells, tissue) to thereby determine whether an 
individual is afflicted with a disease or disorder, or is at risk of developing a disorder, 
5 associated with aberrant NOVX expression or activity. The disorders include metabolic 
disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated cachexia, 
cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 
disorders, and hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 

10 associated with chronic diseases and various cancers. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is at risk of 
developing a disorder associated with NOVX protein, nucleic acid expression or activity. 
For example, mutations in an NOVX gene can be assayed in a biological sample. Such 
assays can be used for prognostic or predictive purpose to thereby prophylactically treat an 

1 5 individual prior to the onset of a disorder characterized by or associated with NOVX 
protein, nucleic acid expression, or biological activity. 

Another aspect of the invention provides methods for determining NOVX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate . 
therapeutic or prophylactic agents for that individual (referred to herein as 

20 "pharmacogenomics"). Pharmacogenomics allows for the selection of agents (e.g. , drugs) 
for therapeutic or prophylactic treatment of an individual based on the genotype of the 
individual {e.g., the genotype of the individual examined to determine the ability of the 
individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
25 drugs, compounds) on the expression or activity of NOVX in clinical trials. 

These and other agents are described in further detail in the following sections. 

Diagnostic Assays 

An exemplary method for detecting the presence or absence of NOVX in a 
biological sample involves obtaining a biological sample from a test subject and 
30 contacting the biological sample with a compound or an agent capable of detecting NOVX 
protein or nucleic acid (e.g., mRNA, genomic DNA) that encodes NOVX protein such that 
the presence of NOVX is detected in the biological sample. An agent for detecting NOVX 
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mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to NOVX 
mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length 
NOVX nucleic acid, such as the nucleic acid of SEQ ID NO: 2n-l, wherein n is an integer 
between 1 and 101, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 
5 1 00, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 

stringent conditions to NOVX mRNA or genomic DNA. Other suitable probes for use in 
the diagnostic assays of the invention are described herein. 

An agent for detecting NOVX protein is an antibody capable of binding to NOVX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or 

10 more preferably, monoclonal. An intact antibody, or a fragment thereof (eg., Fab or 

F(ab ! )2) can be used. The term "labeled", with regard to the probe or antibody, is intended 
to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 

15 labeling include detection of a primary antibody using a fluorescently-labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 
fluorescently-labeled streptavidin. The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a subject, as well as tissues, cells and 
fluids present within a subject. That is, the detection method of the invention can be used 

20 to detect NOVX mRNA, protein, or genomic DNA in a biological sample in vitro as well 
as in vivo. For example, in vitro techniques for detection of NOVX mRNA include 
Northern hybridizations and in situ hybridizations. In vitro techniques for detection of 
NOVX protein include enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations, and immunofluorescence. In vitro techniques for detection of 

25 NOVX genomic DNA include Southern hybridizations. Furthermore, in vivo techniques 
for detection of NOVX protein include introducing into a subject a labeled anti-NOVX 
antibody. For example, the antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 

30 subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A preferred biological sample is 
a peripheral blood leukocyte sample isolated by conventional means from a subject. 
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In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent 
capable of detecting NOVX protein, mRNA, or genomic DNA, such that the presence of 
NOVX protein, mRNA or genomic DNA is detected in the biological sample, and 

5 comparing the presence of NOVX protein, mRNA or genomic DNA in the control sample 
with the presence of NOVX protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of NOVX in a 
biological sample. For example, the kit can comprise: a labeled compound or agent 
capable of detecting NOVX protein or mRNA in a biological sample; means for 

10 determining the amount of NOVX in the sample; and means for comparing the amount of 
NOVX in the sample witlra standard. The compound or agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to detect 
NOVX protein or nucleic acid. 

1 5 Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
NOVX expression or activity. For example, the assays described herein, such as the 

20 preceding diagnostic assays or the following assays, can be utilized to identify a subject 
having or at risk of developing a disorder associated with NOVX protein, nucleic acid 
expression or activity. Alternatively, the prognostic assays can be utilized to identify a 
subject having or at risk for developing a disease or disorder. Thus, the invention provides 
a method for identifying a disease or disorder associated with aberrant NOVX expression 

25 or activity in which a test sample is obtained from a subject and NOVX protein or nucleic 
acid (e.g., mRNA, genomic DNA) is detected, wherein the presence of NOVX protein or 
nucleic acid is diagnostic for a subject having or at risk of developing a disease or disorder 
associated with aberrant NOVX expression or activity. As used herein, a "test sample" 
refers to a biological sample obtained from a subject of interest For example, a test 

30 sample can be a biological fluid (e.g. , serum), cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) to 
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treat a disease or disorder associated with aberrant NOVX expression or activity. For 
example, such methods can be used to determine whether a subject can be effectively 
treated with an agent for a disorder. Thus, the invention provides methods for determining 
whether a subject can be effectively treated with an agent for a disorder associated with 

5 aberrant NOVX expression or activity in which a test sample is obtained and NOVX 

protein or nucleic acid is detected {e.g., wherein the presence of NOVX protein or nucleic 
acid is diagnostic for a subject that can be administered the agent to treat a disorder 
associated with aberrant NOVX expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an 

10 NOVX gene, thereby determining if a subject with the lesioned gene is at risk for a 
disorder characterized by aberrant cell proliferation and/or differentiation. In various 
embodiments, the methods include detecting, in a sample of cells from the subject, the 
presence or absence of a genetic lesion characterized by at least one of an alteration 
affecting the integrity of a gene encoding an NOVX-protein, or the misexpression of the 

15 NOVX gene. For example, such genetic lesions can be detected by ascertaining the 

existence of at least one of: (/) a deletion of one or more nucleotides from an NOVX gene; 
(//) an addition of one or more nucleotides to an NOVX gene; (///) a substitution of one or 
more nucleotides of an NOVX gene, (/v) a chromosomal rearrangement of an NOVX 
gene; (v) an alteration in the level of a messenger RNA transcript of an NOVX gene, (yi) 

20 aberrant modification of an NOVX gene, such as of the methylation pattern of the 

genomic DNA, (vn) the presence of a non-wild-type splicing pattern of a messenger RNA 
transcript of an NOVX gene, (yiii) a non-wild-type level of an NOVX protein, (ix) allelic 
loss of an NOVX gene, and (x) inappropriate post-translational modification of an NOVX 
protein. As described herein, there are a large number of assay techniques known in the 

25 art which can be used for detecting lesions in an NOVX gene. A preferred biological 
sample is a peripheral blood leukocyte sample isolated by conventional means from a 
subject. However, any biological sample containing nucleated cells may be used, 
including, for example, buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer 

30 in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 

4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain 
reaction (LCR) (see, e.g., Landegran, et aU 1988. Science 241: 1077-1080; and 
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Nakazawa, etal, 1994. Proc. Natl Acad. Sci. USA91: 360-364), the latter of which can 
be particularly useful for detecting point mutations in the NOVX-gene (see, Abravaya, et 
al., 1995. Nucl. Acids Res. 23: 675-682). This method can include the steps of collecting a 
sample of cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from 
the cells of the sample, contacting the nucleic acid sample with one or more primers that 
specifically hybridize to an NOVX gene under conditions such that hybridization and 
amplification of the NOVX gene (if present) occurs, and detecting the presence or absence 
of an amplification product, or detecting the size of the amplification product and 
comparing the length to a control sample. It is anticipated that PCR and/or LCR may be 
desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication (see, 
Guatelli, et al, 1990. Proc. Natl. Acad. Sci. USA 87: 1874-1878), transcriptional 
amplification system (see, Kwoh, et al, 1989. Proc. Natl. Acad. Sci. USA 86: 1173-1177); 
Qp Replicase (see, Lizardi, et al, 1988. BioTechnology 6: 1 197), or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using 
techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 

In an alternative embodiment, mutations in an NOVX gene from a sample cell can 
be identified by alterations in restriction enzyme cleavage patterns. For example, sample 
and control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use of sequence specific 
ribozymes (see, e.g., U.S. Patent No. 5,493,53 1) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in NOVX can be identified by 
hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays 
containing hundreds or thousands of oligonucleotides probes. See, e.g., Cronin, et al, 
1996. Human Mutation 7: 244-255; Kozal, et al, 1996. Nat. Med. 2: 753-759. For 
example, genetic mutations in NOVX can be identified in two dimensional arrays 
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containing light-generated DNA probes as described in Cronin, et aL 9 supra. Briefly, a 
first hybridization array of probes can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the sequences by making linear 
arrays of sequential overlapping probes. This step allows the identification of point 
5 mutations. This is followed by a second hybridization array that allows the 

characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild-type gene and the other complementary 
to the mutant gene. 

10 In yet another embodiment, any of a variety of sequencing reactions known in the 

art can be used to directly sequence the NOVX gene and detect mutations by comparing 
the sequence of the sample NOVX with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on techniques developed by Maxim 
and Gilbert, 1977. Proc. Natl Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl Acad. 

15 Set USA 74: 5463. It is also contemplated that any of a variety of automated sequencing 
procedures can be utilized when performing the diagnostic assays (see, e.g., Naeve, et aL 9 
1995. Biotechniques 19: 448), including sequencing by mass spectrometry (see, e.g., PCT 
International Publication No. WO 94/16101; Cohen, et al., 1996. Adv. Chromatography 
36: 127-162; and Griffin, etal, 1993. Appl Biochem. Biotechnol. 38: 147-159). 

20 Other methods for detecting mutations in the NOVX gene include methods in 

which protection from cleavage agents is used to detect mismatched bases in RNA/RNA 
or RNA/DNA heteroduplexes. See, e.g., Myers, et al. y 1985. Science 230: 1242. In 
general, the art technique of "mismatch cleavage" starts by providing heteroduplexes of 
formed by hybridizing (labeled) RNA or DNA containing the wild-type NOVX sequence 

25 with potentially mutant RNA or DNA obtained from a tissue sample. The double-stranded 
duplexes are treated with an agent that cleaves single-stranded regions of the duplex such 
as which will exist due to basepair mismatches between the control and sample strands. 
For instance, RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids 
treated with Si nuclease to enzymatically digesting the mismatched regions. In other 

30 embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 

hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 
regions. After digestion of the mismatched regions, the resulting material is then 
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separated by size on denaturing polyacrylamide gels to determine the site of mutation. 
See, e.g., Cotton, etal, 1988,/Voc. Natl. Acad. Sci. USA 85: 4397; Saleeba, etal, 1992. 
Methods Enzymol. 217: 286-295. In an embodiment, the control DNA or RNA can be 
labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations 
in NOVX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. 
coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells 
cleaves T at G/T mismatches. See, e.g., Hsu, era/., 1994. Carcinogenesis 15: 1657-1662. 
According to an exemplary embodiment, a probe based on an NOVX sequence, e.g., a 
wild-type NOVX sequence, is hybridized to a cDNA or other DNA product from a test 
cell(s). The duplex is treated with a DNA mismatch repair enzyme, and the cleavage 
products, if any, can be detected from electrophoresis protocols or the like. See, e.g., U.S. 
Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in NOVX genes. For example, single strand conformation 
polymorphism (SSCP) may be used to detect differences in electrophoretic mobility 
between mutant and wild type nucleic acids. See, e.g., Orita, et al, 1989. Proc. Natl. 
Acad Sci. USA: 86: 2766; Cotton, 1993. Mutat. Res. 285: 125-144; Hayashi, 1992. Genet. 
Anal. Tech. Appl. 9: 73-79. Single-stranded DNA fragments of sample and control NOVX 
nucleic acids will be denatured and allowed to renature. The secondary structure of 
single-stranded nucleic acids varies according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a single base change. The DNA 
fragments may be labeled or detected with labeled probes. The sensitivity of the assay 
may be enhanced by using RNA (rather than DNA), in which the secondary structure is 
more sensitive to a change in sequence. In one embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility. See, e.g., Keen, et al., 1991. Trends Genet. 7: 5. 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE). See, e.g., Myers, et al, 1985. Nature 313: 495. 
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When DGGE is used as the method of analysis, DNA will be modified to insure that it 
does not completely denature, for example by adding a GC clamp of approximately 40 bp 
of high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient 
is used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA. See, e.g., Rosenbaurh and Reissner, 1987. Biophys. Chem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
that permit hybridization only if a perfect match is found. See, e.g., Saiki, et al, 1986. 
Nature 324: 163; Saiki, et al, 1989. Proc. Natl. Acad. Sci. USA 86: 6230. Such allele 
specific oligonucleotides are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are attached to the hybridizing membrane 
and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization; see, e.g., Gibbs, et al, 1989. Nucl. Acids Res. 17: 2437-2448) or at the 
extreme 3'-terminus of one primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension {see, e.g., Prossner, 1 993. Tibteck 1 1 : 238). In 
addition it may be desirable to introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection. See, e.g., Gasparini, et a!., 1992. Mol. Cell 
Probes 6:1. It is anticipated that in certain embodiments amplification may also be 
performed using Taq ligase for amplification. See, e.g., Barany, 1991. Proc. Natl. Acad. 
Sci. USA 88: 189. In such cases, ligation will occur only if there is a perfect match at the 
3'-terminus of the 5' sequence, making it possible to detect the presence of a known 
mutation at a specific site by looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by utilizing 
pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, e.g., in clinical settings to 
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diagnose patients exhibiting symptoms or family history of a disease or illness involving 
anNOVXgene. 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
NOVX is expressed may be utilized in the prognostic assays described herein. However, 
5 any biological sample containing nucleated cells may be used, including, for example, 
buccal mucosal cells. 

Pharmacogenomics 

10 * Agents, or modulators that have a stimulatory or inhibitory effect on NOVX 

activity (e.g., NOVX gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) disorders 
(The disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, 
cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, 

15 Parkinson's Disorder, immune disorders, and hematopoietic disorders, and the various 
dyslipidemias, metabolic disturbances associated with obesity, the metabolic syndrome X 
and wasting disorders associated with chronic diseases and various cancers.) In 
conjunction with such treatment, the pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound or 

20 drug) of the individual may be considered. Differences in metabolism of therapeutics can 
lead to severe toxicity or therapeutic failure by altering the relation between dose and 
blood concentration of the pharmacologically active drug. Thus, the pharmacogenomics 
of the individual permits the selection of effective agents (e.g., drugs) for prophylactic or 
therapeutic treatments based on a consideration of the individual's genotype. Such 

25 pharmacogenomics can further be used to determine appropriate dosages and therapeutic 
regimens. Accordingly, the activity of NOVX protein, expression of NOVX nucleic acid, 
or mutation content of NOVX genes in an individual can be determined to thereby select 
appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 

30 response to drugs due to altered drug disposition and abnormal action in affected persons. 
See e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol. Physiol, 23: 983-985; Under, 1997. 
Clin, Chem. 9 43: 254-266, In general, two types of pharmacogenetic conditions can be 
differentiated. Genetic conditions transmitted as a single factor altering the way drugs act 
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on the body (altered drug action) or genetic conditions transmitted as single factors 
altering the way the body acts on drugs (altered drug metabolism). These 
pharmacogenetic conditions can occur either as rare defects or as polymorphisms. For 
example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common inherited 
5 enzymopathy in which the main clinical complication is hemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 
fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 

10 genetic polymorphisms of drug metabolizing enzymes (e.g. , N-acetyltransferase 2 (NAT 
2) and cytochrome PREGNANCY ZONE PROTEIN PRECURSOR enzymes CYP2D6 
and CYP2C19) has provided an explanation as to why some patients do not obtain the 
expected drug effects or show exaggerated drug response and serious toxicity after taking 
the standard and safe dose of a drug. These polymorphisms are expressed in two 

1 5 phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). 
The prevalence of PM is different among different populations. For example, the gene 
coding for CYP2D6 is highly polymorphic and several mutations have been identified in 
PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 
and CYP2C19 quite frequently experience exaggerated drug response and side effects 

20 when they receive standard doses. If a metabolite is the active therapeutic moiety, PM 
show no therapeutic response, as demonstrated for the analgesic effect of codeine 
mediated by its CYP2D6-formed metabolite morphine. At the other extreme are the so 
called ultra-rapid metabolizers who do not respond to standard doses. Recently, the 
molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 

25 amplification. 

Thus, the activity of NOVX protein, expression of NOVX nucleic acid, or mutation 
content of NOVX genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding 
30 drug-metabolizing enzymes to the identification of an individual's drug responsiveness 
phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse 
reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency 
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when treating a subject with an NOVX modulator, such as a modulator identified by one 
of the exemplary screening assays described herein. 

Monitoring of Effects During Clinical Trials 

5 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of NOVX (e.g., the ability to modulate' aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical trials. 
For example, the effectiveness of an agent determined by a screening assay as described 

1 0 herein to increase NOVX gene expression, protein levels, or upregulate NOVX activity, 
can be monitored in clinical trails of subjects exhibiting decreased NOVX gene 
expression, protein levels, or downregulated NOVX activity. Alternatively, the 
effectiveness of an agent determined by a screening assay to decrease NOVX gene 
expression, protein levels, or downregulate NOVX activity, can be monitored in clinical 

1 5 trails of subjects exhibiting increased NOVX gene expression, protein levels, or 

upregulated NOVX activity. In such clinical trials, the expression or activity of NOVX 
and, preferably, other genes that have been implicated in, for example, a cellular 
proliferation or immune disorder can be used as a "read out" or markers of the immune 
responsiveness of a particular cell. 

20 By way of example, and not of limitation, genes, including NOVX, that are 

modulated in cells by treatment with an agent (e.g., compound, drug or small molecule) 
that modulates NOVX activity (e.g., identified in a screening assay as described herein) 
can be identified. Thus, to study the effect of agents on cellular proliferation disorders, for 
example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 

25 levels of expression of NOVX and other genes implicated in the disorder. The levels of 
gene expression (i.e., a gene expression pattern) can be quantified by Northern blot 
analysis or RT-PCR, as described herein, or alternatively by measuring the amount of 
protein produced, by one of the methods as described herein, or by measuring the levels of 
activity of NOVX or other genes. In this manner, the gene expression pattern can serve as 

30 a marker, indicative of the physiological response of the cells to the agent. Accordingly, 
this response state may be determined before, and at various points during, treatment of 
the individual with the agent. 
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In one embodiment, the invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, 
peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the steps of (i) obtaining a 
5 pre-administration sample from a subject prior to administration of the agent; (//) detecting 
the level of expression of an NOVX protein, mRNA, or genomic DNA in the 
preadministration sample; (Hi) obtaining one or more post-administration samples from 
the subject; (z'v) detecting the level of expression or activity of the NOVX protein, mRNA, 
or genomic DNA in the post-administration samples; (v) comparing the level of expression 

10 or activity of the NOVX protein, mRNA, or genomic DNA in the pre-administration 
sample with the NOVX protein, mRNA, or genomic DNA in the post administration 
sample or samples; and (vz) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to 
increase the expression or activity of NOVX to higher levels than detected, i.e., to increase 

15 the effectiveness of the agent. Alternatively, decreased administration of the agent may be 
desirable to decrease expression or activity of NOVX to lower levels than detected, i.e., to 
decrease the effectiveness of the agent. 

Methods of Treatment 

20 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with 
aberrant NOVX expression or activity. The disorders include cardiomyopathy, 
atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial septal defect 

25 (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, 
subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation, adrenoleukodystrophy, congenital adrenal 
hyperplasia, prostate cancer, neoplasm; adenocarcinoma, lymphoma, uterus cancer, 
fertility, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 

30 immunodeficiencies, graft versus host disease, AIDS, bronchial asthma, Crohn's disease; 
multiple sclerosis, treatment of Albright Hereditary Osteodystrophy, and other diseases, 
disorders and conditions of the like. 

These methods of treatment will be discussed more fully, below. 
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Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antagonize 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that 
may be utilized include, but are not limited to: (/) an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; (if) antibodies to an aforementioned peptide; 
(Hi) nucleic acids encoding an aforementioned peptide; (rv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion 
within the coding sequences of coding sequences to an aforementioned peptide) that are 
utilized to "knockout" endogenous function of an aforementioned peptide by homologous 
recombination (see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( 
i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 
between an aforementioned peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that 
may be utilized include, but are not limited to, an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide and/or 
RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in 
vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or 
mRNAs of an aforementioned peptide). Methods that are well-known within the art 
include, but are not limited to, immunoassays (e.g., by Western blot analysis, 
immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, and the like). 



Prophylactic Methods 



93 



WO 03/010327 



PCT/US02/14199 



In one aspect, the invention provides a method for preventing, in a subject, a 
disease or condition associated with an aberrant NOVX expression or activity, by 
administering to the subject an agent that modulates NOVX expression or at least one 
NOVX activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
5 NOVX expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 
agent can occur prior to the manifestation of symptoms characteristic of the NOVX 
aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending upon the type of NOVX aberrancy, for example, an NOVX 

10 agonist or NOVX antagonist agent can be used for treating the subject. The appropriate 
agent can be determined based on screening assays described herein. The prophylactic 
methods of the invention are further discussed in the following subsections. 
Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating NOVX 

1 5 expression or activity for therapeutic purposes. The modulatory method of the invention 
involves contacting a cell with an agent that modulates one or more of the activities of 
NOVX protein activity associated with the cell. An agent that modulates NOVX protein 
activity can be an agent as described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of an NOVX protein, a peptide, an NOVX 

20 peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or 
more NOVX protein activity. Examples of such stimulatory agents include active NOVX 
protein and a nucleic acid molecule encoding NOVX that has been introduced into the 
cell. In another embodiment, the agent inhibits one or more NOVX protein activity. 
Examples of such inhibitory agents include antisense NOVX nucleic acid molecules and 

25 anti-NOVX antibodies. These modulatory methods can be performed in vitrp (e.g., by 
culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the agent 
to a subject). As such, the invention provides methods of treating an individual afflicted 
with a disease or disorder characterized by aberrant expression or activity of an NOVX 
protein or nucleic acid molecule. In one embodiment, the method involves administering 

30 an agent (e.g., an agent identified by a screening assay described herein), or combination 
of agents that modulates (e.g. , up-regulates or down-regulates) NOVX expression or 
activity. In another embodiment, the method involves administering an NOVX protein or 
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nucleic acid molecule as therapy to compensate for reduced or aberrant NOVX expression 
or activity. 

Stimulation of NOVX activity is desirable in situations in which NOVX is 
abnormally downregulated and/or in which increased NOVX activity is likely to have a 
5 beneficial effect One example of such a situation is where a subject has a disorder 

characterized by aberrant cell proliferation and/or differentiation (e.g., cancer or immune 
associated disorders). Another example of such a situation is where the subject has a 
gestational disease (e.g., preclampsia). 

1 0 Determination of the Biological Effect of the Therapeutic 

In various embodiments of the invention, suitable in vitro or in vivo assays are 
performed to determine the effect of a specific Therapeutic and whether its administration 
is indicated for treatment of the affected tissue. 

1 5 In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic 
exerts the desired effect upon the cell type(s). Compounds for use in therapy may be 
tested in suitable animal model systems including, but not limited to rats, mice, chicken, 
cows, monkeys, rabbits, and the like, prior to testing in human subjects. Similarly, for in 

20 vivo testing, any of the animal model system known in the art may be used prior to 
administration to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

25 The NOVX nucleic acids and proteins of the invention are useful in potential 

prophylactic and therapeutic applications implicated in a variety of disorders including, 
but not limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, 
cancer-associated cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, hematopoietic disorders, and the various dyslipidemias, 

30 metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 
disorders associated with chronic diseases and various cancers. 

As an example, a cDNA encoding the NOVX protein of the invention may be 
useful in gene therapy, and the protein may be useful when administered to a subject in 
need thereof. By way of non-limiting example, the compositions of the invention will 
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have efficacy for treatment of patients suffering from: metabolic disorders, diabetes, 
obesity, infectious disease, anorexia, cancer-associated cachexia, cancer, 
neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 
disorders, hematopoietic disorders, and the various dyslipidemias. 

Both the novel nucleic acid encoding the NOVX protein, and the NOVX protein of 
the invention, or fragments thereof, may also be useful in diagnostic applications, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed. A further use 
could be as an anti-bacterial molecule (i.e., some peptides have been found to possess anti- 
bacterial properties). These materials are further useful in the generation of antibodies, 
which immunospecifically-bind to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

Sequence Analyses 

The sequence of NOVX was derived by laboratory cloning of cDNA fragments, by 
in silico prediction of the sequence. cDNA fragments covering either the full length of the 
DNA sequence, or part of the sequence, or both, were cloned. In silico prediction was 
based on sequences available in CuraGen's proprietary sequence databases or in the public 
human sequence databases, and provided either the full length DNA sequence, or some 
portion thereof. 

The laboratory cloning was performed using one or more of the methods 
summarized below: 

SeqCalIing™TechnoIogy: cDNA was derived from various human samples 
representing multiple tissue types, normal and diseased states, physiological states, and 
developmental states from different donors. Samples were obtained as whole tissue, 
primary cells or tissue cultured primary cells or cell lines. Cells and cell lines may have 
been treated with biological or chemical agents that regulate gene expression, for example, 
growth factors, chemokines or steroids. The cDNA thus derived was then sequenced using 
CuraGen Corporation's SeqCalling technology which is disclosed in fuliin U. S. Ser. Nos. 
09/417,386 filed Oct. 13, 1999, and 09/614,505 filed July 1 1, 2000. Sequence traces were 
evaluated manually and edited for corrections if appropriate. cDNA sequences from all 
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samples were assembled together, sometimes including public human sequences, using 
bioinformatics programs to produce a consensus sequence for each assembly. Each 
assembly is included in CuraGen Corporation's database. Sequences were included as 
components for assembly when the extent of identity with another component was at least 

5 95% over 50 bp. Each assembly represents a gene or portion thereof and includes 

information on variants, such as splice forms single nucleotide polymorphisms (SNPs), 
insertions, deletions and other sequence variations. 

Variant sequences are also included in this application. A variant sequence can 
include a single nucleotide polymorphism (SNP). A SNP can, in some instances, be 

10 referred to as a "cSNP" to denote that the nucleotide sequence containing the SNP 

originates as a cDNA. A SNP can arise in several ways. For example, a SNP may be due 
to a substitution of one nucleotide for another at the polymorphic site. Such a substitution 
can be either a transition or a transversion. A SNP can also arise from a deletion of a 
nucleotide or an insertion of a nucleotide, relative to a reference allele. In this case, the 

1 5 polymorphic site is a site at which one allele bears a gap with respect to a particular 

nucleotide in another allele. SNPs occurring within genes may result in an alteration of the 
amino acid encoded by the gene at the position of the SNP. Intragenic SNPs may also be 
silent, when a codon including a SNP encodes the same amino acid as a result of the 
redundancy of the genetic code. SNPs occurring outside the region of a gene, or in an 

20 intron within a gene, do not result in changes in any amino acid sequence of a protein but 
may result in altered regulation of the expression pattern. Examples include alteration in 
temporal expression, physiological response regulation, cell type expression regulation, 
intensity of expression, and stability of transcribed message. 

Presented information includes that associated with genomic clones, public genes 

25 and ESTs sharing sequence identity with the disclosed sequence and CuraGen 
Corporations Electronic Northern bioinformatic tool. 

EXAMPLES 

Example A: Polynucleotide and Polypeptide Sequences, and Homology Data 

30 The NOV 1 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 1 A. 
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Table 1 A. NO VI Sequence Analysis 




SEQ ID NO: 1 j 750 bp 


NOV la, 
CGI 00 126-01 DNA Sequence 

j 
; 


TCCAAACCC^GAACTTCACTCTCTTGGAAACCCACCCAGATCCTCCCCGTTCTGACA 
CCATGGTCAGCTCCTGTTGTGGCTCCGTGTGCTCTGACCAGGGCTGCGGCCAAGACCT 
CTGTCAGGAGACCTGCTGCCGCCCCAGCTGCTGTGAGACCACCTGCTGCAGGACCACC 
TGCTGCCGCCCCAGCTGTTGTGTATCCAGCTG CTG CAGGCCC CAGTGCTGCCAGTCTG 
TGTGCTGCCAACCCACTTGTTCCCGCCCCAGCTGCTGTCAGACCACCTGTTGCAGGAC 
CACCTGCTACCGCCCC^GCTGTTGTGTGTCCAGCTGCrGCAGGCCCCAGTGCTGCCAG 
CCTGTGTGCTGCCAACCCACCTGCTGTCGCCCCAGCTGCTGTGAGACGACCTGCTGCC 
ACCCTAGGTGCTGCATCTCCAGCTGCTGTCGCCCCAGCTGCTGTGTGTCCAGCTGCTG 
CAAGCCCC^GTGCTGCCAGTCTGTGTGCTGCCAGCCCAACTGCTGCCGCCCCAGCTGC 
AGCATCTCCAGCTGCTGCCGCCCCTCTTGCTGTGAATCCAGCTGCTGCCGCCCCTGCT 
GCTGCGTGCGTCCAGTCTGTGGCCGAGTCTCCTGCCACACCACTTGCTATCGCCCAAC 
CTGTGTCATCTCCAGCTGCCCCCGCCCCTTGTGCTGTGCCTCCTCTTCCTGCTGAGCC 
CACTGCCCTGGCTTATCTCCCCCTTCACCACTGGCCCACAGATGTAGACCCTTC 




ORF Start: ATG at 61 


ORF Stop: TGA at 691 




SEQ ID NO: 2 


210 aa MW at 22433. OkD 


NOV la, 
CG 1 00 1 26-0 1 Protein Sequence 


MVSSCCGSVCSDQGCGQDLCQETCCRPSCCETTCCRTTCCRPSCCVSSCCRPQCCQSV 
CCQPTCSRPSCCQTTCCRTTCYRPSCCVSSCCRPQCCQPVCCQPTCCRPSCCETTCCH 
PRCCISSCCRPSCCVSSCCKPQCCQSVCCQPNCCRPSCSISSCCRPSCCESSCCRPCC 
CVRPVCGRVSCHTTCYRPTCVISSCPRPLCCASSCC 



Further analysis of the NOV la protein yielded the following properties shown in 
Table IB. 



Table IB. Protein Sequence Properties NOVla 



Psort analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in mitochondrial 
matrix space; 0.1000 probability located in lysosome (lumen); 0.0000 probability 
located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOVla protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1C. 
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Table 1C. Geneseq Results for NOVla 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOVla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


! AAE02058 


Human four disulfide core domain 
(FDCD)-containing protein - Homo 
sapiens, 230 aa. [WO200I40249-A1, 07- 
JUN-2001] 


1..209 
1..204 


149/218(68%) 
167/218(76%) 


e-102 


ABG29368 


Novel human diagnostic protein #29359 - 
Homo sapiens, 206 aa. [WO200 175067- 
A2, ll-OCT-2001] 


1..2I0 
12..206 


134/210(63%) 
158/210(74%) 


2e-94 


ABG29368 


"Novel human diagnostic protein #29359 - 
Homo sapiens, 206 aa. [WO200175067- 
A2, ll-OCT-2001] 


I..210 
12..206 


134/210(63%) 
158/210(74%) 


2e-94 


ABB 12277 


Human hair keratin associated protein 
homoiogue, SEQ ID NO:2647 - Homo 
sapiens, 120 aa. [WO200157188-A2, 09- 
AUG-2001] 


1..120 
1..120 


119/120 (99%) 
119/120(99%) 


8e-83 


\ AAM79986 

i 


Human protein SEQ ID NO 3632 - Homo 
sapiens. 30 1 aa. [ WO200 1 57 1 90-A2, 09- 
AUG-2001] 


2..210 
80..301 


110/227(48%) 
141/227 (61%) 


2e-70 



In a BLAST search of public sequence datbases, the NOVla protein was found to 
have homology to the proteins shown in the BLASTP data in Table ID. 
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Table ID. Public BLASTP Results for NOVla 


Protein 
Accession 
Number 

i 


Protein/Organism/Length 


NOVla 
Residues/ 

Match 
Residues 


Identities/ 

Cim i laritipc 
ollllllanilcS 

for the 
Matched 
Portion 


Expect 
Value 


Q9BYQ8 

i 

i 

i 


KERATIN ASSOCIATFD PROTFIN 4 9 
- Homo sapiens (Human), 191 aa 
(fragment). 


oo 9 in 
L.191 


1Q 1 /I oi 

(100%) 
191/191 
(100%) 


e-i j / 


Q9BQ66 


KERATIN A^OPI ATFH PROTFIM 

4.12 (SIMILAR TO RIKEN CDNA 
1 1 10054P19 GENE) - Homo sapiens 
(Human), 201 aa. 


i 9 1 n 

1 ..A i U 

1..201 


(87%) 
191/210 
(90%) 


e-128 


Q9BYR0 


KERATIN ASSOCIATED PROTEIN 4.7 
- Homo sapiens (Human), 210 aa. 


1..210 
1..210 


179/215 
' (83%) 
188/215 
(87%) 


e-127 


Q9BYQ5 

i 

i 


KERATIN ASSOCIATED PROTEIN 
4.15 - Homo sapiens (Human), 193 aa 
(fragment). 


13.210 
1..193 


171/207 
(82%) 
175/207 
(83%) 


e-114 


Q9BYQ6 

i 

i 


KERATIN ASSOCIATED PROTEIN 
4.14 - Homo sapiens (Human), 195 aa. 


I ..210 
1..195 


161/210 
(76%) 
172/210 
(81%) 


e-112 



PFam analysis predicts that the NOVla protein contains the domains shown in the 
Table IE. 



Table IE. Domain Analysis of NOVla 


Pfam Domain 


NOVla Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Keratin_B2 


2.. 108 


57/177 (32%) 
92/177 (52%) 


6.5e-12 


Keratin_B2 


109..210 


54/177 (31%) 
82/177(46%) 


3.4e-05 



Example 2. 



The NO V2 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 2A. 
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Table 2A. NOV2 Sequence Analysis 



SEQ ID NO: 3 



3005 bp 



NOV2a, 
CGI 00 146-01 DNA Sequence 



CCAAACACACTAAAATAAATATGAGGTCATCAATCTTTTGTTGGTCTCCTTGGCATGC 
AC CTATT CAG ACTGTTAGTATTATGT ATTT ACTTCAAATTTTAG CAGTT ATATTTT AA 
CTTGATTGATTTTTCCTCAGATATAAGTATGAGAAATGACAGAAAGAAACAACAACTG 
GAAAAG AAG C ATTGCAT AAG ACCAGGAT GT CTCTG AAATGG ACG T CAGT CTTTCTG CT 



GATACAG CTCAGTTTTTACTTTAG CTCTGGGAGTTGTGG AAAGGTG CTGGTATGGCCC 

ACAGAATACAGCCTTTGGATGAATATGAAGACAATCCTGAAAGAACTTGTTCAGAGAG 

GTCATG AGGTG ACTGTACTGG CAT CTTCAG CTTCCATTCTTTTTG ATCCCAACGACTC 

ATCCACTCTTAAACITGAAGTTTATCCTACATCTTTAACTAAAAATGAATTTGAGAAT 

ATCTATC^TGCAATTGGTTAAGAGATGGATATATGGTGTTTCAAAAAGATGCATTTTGG 

TTACCTTTCACAAGAACAAGAAATCCTGTGGGCAATATTATGACATAATTAGAAACTT 

CTGTAAAGATGTAGTTTCAAATAAGAAACTTATGAAAAAACTACAAGAGTCAAGATTT 

GACATCGTTTTTGCAG ATG CTTATTTACCCTGTGGTGAG CTGCTGGCTGAGCTATTTA 

ACATACCCTTTGTGTACAGTCTCCGCrTCTCTGTTGGCTACACAGTTGAGAAGAATGG 

TGGAGGATTTCTGTTCCCTCCTTCCTATGTACCTGTTGTTATGTCAGAATTAAGTGAT 

CAAATGACTTTCATGGAGAGGGTAAAAAATATGATCTATGTGC^^ 

GGTTCGAAATATTTGACATGAAGAAGTGGGATCAGTTTTATAGTGAAGTTCTAGGAAG 

ACCCACTACATTATCTGAGACAATGGGGAAAGCrcACGTATGGCTTATTCGAAACTAC 

TGGGATTTTCAATTTCCTCACCCACTCTTACCAAATGTTGAGTTCGTTGGAGGACTCC 

ACTGCAAACCTGCCAAACCCCrrACCGAAGGAAATGGAAGAGTTTGTCCAGAGCTCTGG 

AGAAAATGGTGTTGTGGTGTTTTCTCTGGGGTCGATGGTCAGTAACATGTCAGAAGAA 

AGTGCCAACATGATTGCATCAGCCCTTGCCCAGATCCCACAAAAGGTTCTATGGAGAT 

TTGATGGCAAGAAGCCAAATACTTTAGGTTCCAATACTCGACTGTACAAGTGGATACC 

CCAGAATGACCTTCTAGGTCATCCAAAGACCAAAGCrTTTATAACTCATGGTGGAACC 

AATGG CATCTATGAGGCAATCTACCATGGG ATCCCTATGGTGGG CATTCCCTTGTTTG 

CGGATCAACATGATAACATTGCTCACATGAAAGCCAAGGGAGCAGCCCTCAGTGTGGA 

CATCAGGACCATGTCAAGTAGAGATTTGCTCAATGCATTGAAGTCAGTCATTAATGAC 

CCTATCTATAAAGAGAATATCATGAAATTATCAAGAATTCATCATGATCAACCAATGA 

AG CCCCTGGATCG AGC^GTCTT CTGG ATTGAGTTTGTCATG CGCCACAAAGG AGCCAA 

GCACCTTCGAGTCGCAGCCCACAACCTCACCrGGTTCCAGTACCACTCTrTGGATGTG 

ATAGCATTCCTGCTGGCCTGCGTGGCAACTATGATATTTATGATCACAAAATGTTGCC 

TGTTTTGTTTCCGAAAGTTTGCTAGAAAAGCAAAGAAGGGAAAAAATGATTA GTTATA 



TCTGAGATTTGAAGCTGGGGAATTCCGTTTATTGAAGATTCAGGTTAACCTGAATCAA 
GTTAACCCAGTCTCAAATGCTCACTTATCCrTTCT 

CTGGATTGCCAAAGAAAATTCAAATTATTCTTCAATTAGTCAGGATGATTTGACTATC 

AGCAGTTCATAGTACCCATCTTCATAACTAAGCCACCTAGGGATCCAGCAGAAAAAAA 

AGGGATGAGGGGAGTCATCATACAGGAGGTGGATCATTTACCAGGATCCACACTTCCT 

ACAAAGCGGTTGTAATGTTAAATAACAAAACTGTTTTTTAT^ 

ACAGGAATAATTGTATACITTCTTACrAATGTGTTCCATGGAGTT^ 

GTGGCTTAGGGGAAAATGAGCCCCAGTAATGCTT^ 

GACCCTTTGACTTTCTGCCCCAGCCCCTCrrTAGTTCTCCTAGAATTAGGACrAAGGTT 
AAGTGCCCTCTTGGGATATGACTTCCTTCCCTTCCTCITGATACAAAAAGAGCCrATT 
AC(2AACCCTCATACA<^CAAGAGTTCCCTTCCTAGTTGCAGACrCTTCTGCrrCCAGCT 
GGACTCCCCTAGCTCTGGACTCCCACTAGATCACACAGGGGTCCCrGCATGTCAGTAA 
AC!TTTGGATGACCTTGGGAGACGAAAAAATGGAATAT(^TTTTTTGATCTAAACAAAA 
TAGTTTCOTGATTTAACACTGGCC^GGAAGGTGGGCTGCACCCTCAGTCrCTCTCTCC 
CAT CATGGTTTTCACATG ATATCAAAG GACT CT CAT AACAGT CTG ATt'Cl'TATG AGTT 
GGGCATCCTGTGTTTCCCTTTAGGGGCCTGCTTCCTTCAAATAGAGGAGATGGGTGCT 
ATGAAACCrATTCACrCTGGACTTGGGATGGCTCTTCTCCATCTTCCCAAGTCTGAGC 
TGGAGCCTCCATGCCCiACTCTGCTCTGCTCTTCTATTTCCTGACAGC^GCTAAGGCA 
TGTTCCZTGTTCTGCCCCC1AAATTGACCTTACTAACAGTGAG 

GGGTCTTGGGAAATCCAAGTTTTCCCGGAAACGTTTTGTTGTAAACAGTGTCCACACT 
CTTTGCTCCAATAAAGCTCGGTTCCTTAAGCCAAAAAAAAAAAAAAA 





ORF Start: ATG at 201 


ORF Stop: TAG at 1791 




SEQ ID NO: 4 


530 aa 


MW at 60923.7kD 


NOV2a, 
CGI 00 146-01 Protein Sequence 


MS LKWTSVFLLI QLS FYFS SGS CGKVLVWPTE YS LWMNMKTI LKELVQRGHEVTVIiAS 
SASILFDPNDSSTLKLEVYPTSLTKNEFENI IMQLVKRWIYGVS KRCILVTFHKNKKS 
CGQYYDI IRNFCKDWSNKKLMKKLQESRFDIVFADAYLPCGELLAELFNI PFVYSLR 
FSVGYTVEKNGGGFLFPPSYVPVVMSELSDQMTFMERVKNMIYVLYFDFWFEIFDMKK 
WIX3FYSEVI^PTTIiSETMGKADVWLIRNYWDFQFPH^ 

KEMEEFVQS SGENGVWFSLGSMVSNMSEES ANMI ASAXAQI PQKVLWRFDGKKPNTL 
GSNTRLYKWI PQNDLLGHPKTKAF I THGGTNG I YEAI YHG I PMVGI PLFADQHDNIAH 
MKAKGAALSVDIRTMSSRDLLNALKSVINDPIYKENIMKLSRIHHDQPMKPLDRAVFW 
IEFVMRHKGAKHLRVAAHNLTWFQYHSLDVIAFL1ACVATMIFMITKCCLFCFRKFAR 
KAKKGKND 
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Further analysis of the NOV2a protein yielded the following properties shown in 
Table 2B. 



Table 2B. Protein Sequence Properties NOV2a 


PSort analysis: 


0.8200 probability located in endoplasmic reticulum (membrane); 0.4600 probability 
located in plasma membrane; 0.2000 probability located in lysosome (membrane); 
0.1790 probability located in microbody (peroxisome) 


SignalP analysis: 


Cleavage site between residues 25 and 26 



A search of the NOV2a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 2C. 



1 

Table 2C. Geneseq Results for NOV2a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV2a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


| AAY78935 


Human UDP- 
glucuronosyltransferase 2B15 amino 
acid sequence - Homo sapiens, 530 
aa. [WO200006776-A1, 10-FEB- 
2000] 


1..530 
1..530 


447/530 (84%) 
485/530 (91%) 


0.0 


AAW47126 


Uridine diphospho- 
glucuronosyltransferase 2B17 
(UGT2B17) enzyme - Homo 
sapiens, 530 aa. [W09744466-A1, 
27-NOV-1997] 


. 1..530 
1..530 


445/530 (83%) 
480/530 (89%) 


0.0 


ABG05523 


Novel human diagnostic protein 
#5514 - Homo sapiens, 533 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..530 
5..533 


442/530 (83%) 
477/530 (89%) 


0.0 


ABG05523 


Novel human diagnostic protein 
#5514 - Homo sapiens, 533 aa. 
[WO2001 75067- A2, ll-OCT-2001] 


L.530 
5..533 


442/530 (83%) 
477/530 (89%) 


0.0 


ABG05524 


Novel human diagnostic protein 
#55 1 5 - Homo sapiens, 532 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..530 
4..532 


446/531 (83%) 
479/531 (89%) 


0.0 



In a BLAST search of public sequence datbases, the NOV2a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 2D. 
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Table 2D. Public BLASTP Results for NOV2a 


Protein 
Accession 
Number 

1 
i 


Protein/Organism/Length 


NOV2a 
Residues/ 
Match Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


! P54855 


UDP-glucuronosyltransferase 2B 1 5 
precursor, microsomal (EC 2.4. 1.17) 
(UDPGT) (UDPGTH-3) (HLUG4) - 
Homo sapiens (Human), 530 aa. 


1..530 
1..530 


447/530 
(84%) 

485/530 
(91%) 


0.0 


075795 


UDP-glucuronosyltransferase 2B 1 7 
precursor, microsomal (EC 2.4.1.17) 
(UDPGT) (C19-steroid specific UDP- 
glucuronosyltransferase) - Homo sapiens 
(Human), 530 aa. 


1..530 
1 ..530 


445/530 
(83%) 

480/530 
(89%) 


0.0 


P36537 


UDP-glucuronosyltransferase 2B10 
precursor, microsomal (EC 2.4.1.17) 
(UDPGT) - Homo sapiens (Human), 528 
aa. 


1..530 
I..528 


451/530 
(85%) 

484/530 
(91%) 


0.0 


PI 6662 

; 


UDP-glucuronosyltransferase 2B7 
precursor, microsomal (EC 2.4.1.17) 
(UDPGT) (3,4-catechol estrogen specific) 
(UDPGTH-2) - Homo sapiens (Human), 
529 aa. 


1..530 
I. .529 


442/530 
(83%) 

477/530 
(89%) 


0.0 


; 002663 

! 


UDP-glucuronosyltransferase 2B9 
precursor, microsomal (EC 2.4.1.17) 
(UDPGT) - Macaca fascicularis (Crab 
eating macaque) (Cynomolgus monkey), 
529 aa. 


L.530 
L.529 


441/530 
(83%) 

479/530 
(90%) 


0.0 



PFam analysis predicts that the NOV2a protein contains the domains shown in the 
Table 2E. 



Table 2E. Domain Analysis of NO V2a , 


Pfam Domain 


NOV2a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


UDPGT 


24..528 


356/507 (70%) 
491/507(97%) 


0 



Example 3. 



The NOV3 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 3 A. 



103 



WO 03/010327 PCT/US02/14199 



Table 3A. NOV3 Sequence Analysis 




SEQ ID NO: 5 


2310 bp 


NOV3a, 
CGI 00 179-01 DNA Sequence 

1 
i 

! 


TGGCCGGCGATAAGGGTTT(^CCTTCAGGACTGGAGCTCCACAAGATTCAACCGTTAT 
ATCAACCTCCCCGGCCC^GGTCTTACCATCACAGCGTCACAAACTCCAGGTCGCCTAG 
GCGCTGCGCAGGAAGCGCTTGCCAGCCCCGGACTTCTGCGCGCGCTGCATGCCCATTG 
GATGTGCTCATTGCCACCCCAGCCAATCCCGAAACTCGCTCGGACGCTGACAGAAGAC 
TGCGCCGCTGCTTTGGGATTGGTAGCTGAGTTTTGTGTCGCGCCTTTTCTGACGATGC 
GAACAACATGGCGGCGGAAAGTGGTAGCGATTTTCAGCAGAGACGTAGAAGGCGCCGG 
GACCCX5GAGGAACCGGAAAAAACAGAACTCAGCGAAAGAGAGCTGGCAGTAGCAGTGG 
CGGTGTCCCAGGAGAACGATGAGGAGAACGAAGAGCGCTGGGTTGGACCTTTACCTGT 
GGAGGCAACACTGGCCAAGAAGAGGAAAGTCTTAGAGTTTGAAAGAGTCTATCTTGAT 
AATCTCCCCAGTGCATCCATGTATGAGCGCAGTTACATGCATAGAGATGTTATCACCC 
ATGTGGTATGCACCAAGACAGATTTTATTATTACTGCCAGTCATGATGGACATGTCAA 
GTTCTGGAAAAAAATAGAAGAGGGAATTGAATTTGTTAAACATTTTCGTAGTCACCTG 


i 
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ATGATAAAGCAATGAAGGTGTTTGATGTAGTGAACTTTGACATGATCAACATGCTGAA 
ACTTGGGTATTTTCCTGGACAGTGTGAGTGGATCTATTGCCCAGGGGATGCAATTTCT 
TCAGTTGCTGCTTCCGAAAAGAGTACAGGAAAAATTTTCATTTATGATGGCCGAGGAG 
ATAACCAGCCACITCATATTTTTGACAAACTCCATACATCACCTCTTACTCAGATACG 
GCTAAACCCAGTTTACAAAGCAGTAGTGTCTTCTGACAAATCTGGGATGATTGAATAC 
TGGACTGGGCCTCCTCATGAATATAAATTCCCCAAAAATGTGAACTGGGAATATAAAA 
CTGACACTGA1TTATATGAATTTGCCAAGTGTAAGGCTTATCCAACCAGCGTATGTTT 
TTCACCAGATGGGAAGAAAATAGCTACTATTGGTTCTGATAGAAAAGTTAGAATTTTC 
AGATTTGTAACTGGAAAACTCATGAGAGTCTTTGATGAATCACTAAGCATGTTTACTG 
AACTGCAACAGATGAGGCAACAGTTACCAGACATGGAATTTGGCCGACGAATGGCTGT 
AGAACGTGAGTTGGAGAAGGTTGATGCTGTAAGATTAATTAATATAGTTTTTGATGAA 
ACTGGACACTTCGTGCTGTATGGAACAATGCTGGGCATTAAAGTTATAAATGTAGAGA 
CAAACAGGTGTGTGCGGATTTTAGG CAAAC AAG AAAATATT AG AGTG ATGCAATTGG C 
TTTGTTCCAGGGGATAGCCAAAAAGCATCGTGCTGCAACTACTATAGAAATGAAAGCT 
TCTGAAAATCCTGTTCTTCAGAATATTCAAGCTGACCCAACAATAGTCTGTACATCAT 
TCAAAAAGAATAGATTTTATATGTTTACCAAACGAGAACCAGAAGATACGAAAAGTGC 
AGATTCTGATCGAGATGTTTTTAATGAGAAACCTTCTAAAGAAGAAGTCATGGCAGCT 
ACTCAAGCTGAAGGACCTAAACGAGTTTCGGACAGTGCCATTATCCACACCAGCATGG 
GAGACATTCACACCAAACTTTTTCCTGTTGAGTGCCCTAAGACAGTGGAAAACTTCTG 
TGTTCACAGC AGAAATGGTT ATT ATAATGGG C ATACATTTCACCGTATAATTAAGGG C 
TTTATGATTCAGACTGGAGATCCAACAGGTACTGGTATGGGAGGAGAAAGCATATGGG 
GAGGAGAATTTGAAGATGAATTTCATTCAACATTACGACATGACAGGCCGTACACACT 
CAGCATGGCTAACGCGGGATCAAATACTAATGGATCCCAGTTTTTCATAACGGTAGTA 
CCAACGCCTTGGCTTGATAATAAGCATACAGTATTTGGACGAGTGACTAAAGGAATGG 
AAGTTGTACAGAGGATCTCCAACGTCAAAGTCAATCCCAAAACAGATAAGCCCTATGA 
GG ATGTCAGCAT CAT AAATATTACTGT CAAGT AAAAT AAG ATTTGTTTTAATGT ACTT 
GCAAATAAAAATACAATATTAAACAGATTATTTTACATTAGGAAGCTT 




ORF Start: ATG at 298 


ORF Stop: TAA at 2236 




SEQ ID NO: 6 


646 aa 


MW at 735742kD 


NOV3a, 
CGI 00 179-01 Protein Sequence 


MAAESGSDFQQRRRKRRDPEEPEKTELSERELAVAVAVSQENDEENEERWVGPLPVEA 
TI^KKRKVLEFERVYLDNLPSASMYERSYMHRDVITHWCTKTDFIITASHDGHVKFW 
KKI EEGI EFVKHFRSHLGVI ES I AVSSEGALFCSVGDDKAMKVFDVVNFDMINMLKLG 
YFPGQCEWI YCPGDAI S SVAAS EKSTGKI FI YDGRGDNQPLHI FDKLHTSPLTQ IRLN 
PVYKAVVSSDKSGMIEyWTGPPHEYKFPKNVNWEyKTDTDLYEFAKCKAYPTSVCFSP 
DGKKIATIGSDRJCVRIFRFVTGKLMRVFDESLSMFTELQO^QQLPDMEFGRRMAVER 
ELEKVDAVRLINIVFDETGHFVLYGTMLGIKVINV^ 

QGI AKKHRAATTI EMKAS EN PVLQNIQADPT I VCTS FKKNRFYMFTKREPEDTKS ADS 
DRDVFNEKPSKEEVMAATQAEGPKRVSDSAI I HTSMGDIHTKLFPVECPKTVENFCVH 
SRNGYYNGHTFHRIIKGFMIQTGDPTGTGMGGESIWGGEFEDEFHSTLRHDRPYTLSM 
ANAG SNTNGS Q F F I TWPTPWLDN KHTVFGRVTKGMEWQRI SNVKVN PKTD KPYEDV 
SIINITVK 



Further analysis of the NOV3a protein yielded the following properties shown in 
Table 3B. 
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Table 3B. Protein Sequence Properties NOV3a 


PSort analysis: 


0.7000 probability located in nucleus; 0.4969 probability located in microbody 
(peroxisome); 0.3600 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV3a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 3d 



Table 3C. Geneseq Results for NOV3a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
«, Date] 


NOV3a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAM79775 


Human protein SEQ ID NO 342 1 
- Homo sapiens, 660 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..646 
15..660 


644/646 (99%) 
645/646(99%) 


0.0 


AAM78791 


Human protein SEQ ID NO 1453 
- Homo sapiens, 565 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


82..646 
1..565 


565/565(100%) 
565/565(100%) 


0.0 


AAM43579 


Human polypeptide SEQ ID NO 
257 - Homo sapiens, 483 aa. 
[WO200155308-A2, 02-AUG- 
2001] 


165..646 
2..483 


478/482 (99%) 
478/482(99%) 


0.0 


AAM43649 


Human polypeptide SEQ ID NO 
327 - Homo sapiens, 420 aa. 
[WO200155308-A2, 02-AUG- 
2001] 


227..646 
1..420 


420/420 (100%) 
420/420(100%) 


0.0 


ABB59961 


Drosophila melanogaster 
polypeptide SEQ ID NO 6675 - 
Drosophila melanogaster, 342 aa. 
[WO200171042-A2, 27-SEP- 
2001] 


7.343 
2.335 


' 178/342(52%) 
241/342(70%) 


e-100 



In a BLAST search of public sequence datbases, the NOV3a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 3D. 
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Table 3D. Public BLAST? Results for NOV3a 


Protein 
Accession 
j Number 


Protein/Organism/Length 


NOV3a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q96BP3 

t 

j 


HYPOTHETICAL 73.6 KDA 
PROTEIN - Homo sapiens (Human), 
646 aa. 


1..646 
1..646 


646/646(100%) 
646/646(100%) 


0.0 


| Q 15002 


KIAA0073 PROTEIN - Homo 
sapiens (Human), 645 aa (fragment). 


2..646 
1..645 


645/645(100%) 
645/645(100%) 


0.0 


Q960Q8 


LD39850P - Drosophiia 
melanogaster (Fruit fly), 637 aa. 


7..643 
2..633 


376/642(58%) 
487/642 (75%) 


0.0 


Q8W4D0 

* 
i 

j 


HYPOTHETICAL 71.0 KDA 
PROTEIN - Arabidopsis thaliana 
(Mouse-ear cress), 63 1 aa. 


25..645 
10..628 


334/633 (52%) 
446/633 (69%) 


0.0 


! Q9LXM7 

1 
i 


HYPOTHETICAL 70.7 KDA 
PROTEIN - Arabidopsis thaliana 
(Mouse-ear cress), 629 aa. 


25..645 
10..626 


334/633 (52%) 
444/633 (69%) 


0.0 



PFam analysis predicts that the NOV3a protein contains the domains shown in the 
Table 3E. 



f "• — — — — — . 

Table 3E. Domain Analysis of NOV3a 


Pfam Domain 


NOV3a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


WD40 


125.J61 


7/37(19%) 
29/37 (78%) 


0.27 


WD40 


272..308 


13/37 (35%) 
30/37(81%) 


0.94 


pro_isomerase 


501. .643 


80/164(49%) 
115/164(70%) 


1.2e-61 



Example 4. 



The NOV4 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 4A. 
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Table 4A. NOV4 Sequence Analysis 




SEQIDNO:7 


1060 bp 


NOV4a, 
CGI 002 12-01 DNA Sequence 


CAATGATTGrrrCAAAGAGTGGTATTGAATTCTCGACCTGGTAAAAATGGTAATCCAGT 
GG CAG AG AATTTC CG AATGG AAG AAG TCT ATTT ACC AGAT AAT ATTAATG AAGG ACAA 
GTACAAGTTAGAACTCTTTATCTTTCTGTGGATCCTTACATGCGTTGTAGAATGAATG 
AAGACACTGGCACTGATTATATAACACCTTGGCAGCTATCTCAAGTCGTTGATGGAGG 
AGGTATTGGAATTATAGAAGAAAGCAAACACACAAATTTGACTAAAGGCGATTTTGTG 
ACnTCTTTCTATTGGCCCTGGCAAACCAAGGTTATTCTGGATGGAAATAGCCTTGAAA 
AGGTGGACCCACAACTTGTGGATGGACACCTTTCATATTTTCTTGGAGCTATAGGTAT 
GCCTGGTTTGACTTCCTTGATTGGGATACAGGAAAAAGGTCATATAACTGCTGGATCT 
AATAAGACAATGGTTGTCAGTGGGGCCGCAGGTGCCTGTGGATCTGTGGCTGGGCAGA 
TTGGCCATTTCTTAGGTTGTTCCAGAGTGGTGGGAATTTGTGGAACACATGAGAAATG 
CATCCTCTTGACCTCAGAACTGGGCTTTGATGCTGCAATTAATTATAAAAAAGACAAT 
GTGGCAGAACAGCTCCGTGAATC^TGCCCAGCTGGAGTGGATGTTTATTTTGACAATG 
TTGGTGGTAACATCAGTGATACAGTGATAAGTCAGATGAATGAGAACAGCCACATCAT 
CCTGTGTGGTC^U^TTTCTCAGTACAACAAAGATGTGCCTTATCCTCCCCCGCTATCC 
C CTG CT AT AG AGG CAAT C CAG AAAGAAAG AAACAT CAC AAGGG AAAGATTTCTGGT AT 
TAAATTATAAAGACAAATTTGAGCCTGGCATTCTACAGCTGAGTCAGTGGTTTAAAGA 
AGGAAAGCTAAAGGTAAAAGAGACGGTAATAAATGGGTTGGAAAACATGGGAGCATTC 
C AGTC C ATG ATG ACAGG AG GTAACATTGGAAAG C AG AT AGT TTG CATTTCAG AAGAAA 
TCTCTTTGTAATTGCT 




ORF Start: ATG at 3 


ORF Stop: TAA at 1053 




SEQ ID NO: 8 


350 aa 


MWat38413.5kD 


NOV4a, 
CGI 002 12-01 Protein Sequence 


MIVQRVVI.NSRPGKNGNPVAENFRMEEVYLPDNINEGQVQVRTLYLSVDPYMRCRMNE 
DTGTDYITPWQLSQWDGGGIG 1 1 EESKHTNLTKGDFVTS FYWPWQTKVI LDGNSLEK 
VD PQLVDGHLSYFLGAI GMPGLTS LI G I QEKGH I TAG SNKTMWS GAAG ACGSVAGQ I 
GHFLGCSRWG I CGTHEKC I LLTS ELG FD AAINYKKDNVAEQLRES C PAGVDVYFDNV 
GGNISDTVISQMNENSHI ILCGQI SQYNKDVPYPPPLSPAI EAIQKERNITRERFLVL 
NYKDKFEPGILQLSQWFKEGKLKVKETVINGLENMGAFQSMMTGGNIGKQIVCISEEI 
SL 




SEQ ID NO: 9 


853 bp 


1NUV4D, 

CG 100212-02 DNA Sequence 


CMTGAT^GTTC^^GAGTGGTATTGAATTCTCGACCTGGTAAAAATGGTAATCCAGT 

GGCAGAGAATTTCCGAATGGAAGAAGTCTATTTACCAGATAATATTAATGAAGGACAA 

GTACAAGTTAGAACTCTTTATCTT^CTGTGGATCCTTACATGCGTTGTAGAATGAATG 

AAGAC ACTGG CACTG ATTAT ATAACACCTTGG CAG CTATCT CAAGTCGTTG ATGG AGG 

AGGTATTGGAATTATAGAAGAAAGCAAACAG&GAAATT^ 

ACTTCTTTCTATTGGCCCTGGCAAACCAAGGTTAT^ 

AGGT AGACCCACAACTTGTGG ATGGACACCTT T CAT ATTTT CTTGGAG CTATAGGT AT 
GCCTGGTTTGACTTCCTTGATTGGGATACAGGAAAAAGGTCATATAACTGCTGGATCT 
AATAAGACAATGGTTGTCAGTGGGGCCGCAGGTGCCTGTGGATCTGTGGCTGGGCAGA 
TGAATGAGAACAGCCACATCATCCTGTGTGGTCAAATTTCTCAGTACAACAAAGATGT 
GCCTTATCCTCCCCCGCTATCCCCTGCTATAGAGGCAATCCAGAAAGAAAGAAACATC 
ACAAGGGAAAGAT TT CTGGT ATT AAATT AT AAAGACAAATTTG AG C CTG GCATTCT AC 
AGCTG AGTCAGTG GTTT AAAGAAGGAAGG CTAAAG ATT AAAG AG ACGGT AATAAATGG 
GTTGGAAAACATG GG AG CTG CATT CCAGTCCATGATGAC AG GAGGTAACATTGGAAAG 
CAGATAGTTTGCATTTCAGAAGAAATCTCITTGTAATTG CT 




ORF Start: ATG at 3 


ORF Stop: TAA at 846 




SEQ ID NO: 10 


281 aa 


MWat31055.3kD 


NOV4b, 
CGI 002 12-02 Protein Sequence 


MIVQRVVLNSRPGKNGNPYAENFRMEEVYLPDNINEGQVQWTLYLSVDPYMRORMNE 
DTGTDYI TPWQLS QWDGGG I G 1 1 EESKHTNLTKGDFVTS FYWPWQTKVILDGNSLEK 
VDPQLVDGHLSYFLGAI GMPGLTS LI GI QEKGH I TAGSNKTMWSGAAGACGSVAGQM 
NENS H 1 1 LCGQI S QYNKDVPYPPPLS PAI EAI QKERN ITRERFLVLNYKDKFE PGI LQ 
LSQWFKEGRLKIKETVINGLENMGAAFQSMMTGGNIGKQIVCISEEISL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 4B. 
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Table 4B. Comparison of NOV4a against NOV4b. 


Protein Sequence 


NOV4a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV4b 


1..183 
1..186 


154/186 (82%) 
158/186 (84%) 



Further analysis of the NOV4a protein yielded the following properties shown in 
Table 4C. 



| Table 4C. Protein Sequence Properties NOV4a 


PSort analysis: 


0.6500 probability located in cytoplasm; 0.1572 probability located in lysosome 
(lumen); 0.1000 probability located in mitochondrial matrix space; 0.0000 
probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV4a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 4D. 



i 

Table 4D. Geneseq Results for NOV4a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV4a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG64386 


Human alcohol dehydrogenase 39 - 
Homo sapiens, 351 aa. [ WO200 155404- 
A1.02-AUG-2001] 


1..350 
1..351 


349/351 (99%) 
350/351 (99%) 


0.0 


AAU33607 


Pseudomonas aeruginosa cellular 
proliferation protein #51 - Pseudomonas 
aeruginosa, 345 aa. [WO2001 70955- 
A2,27-SEP-2001] 


4..347 
9.344 


131/347(37%) 
202/347(57%) 


4e-58 


AAU33607 


Pseudomonas aeruginosa cellular 
proliferation protein #5 1 - Pseudomonas 
aeruginosa, 345 aa. [WO200 170955- 
A2, 27-SEP-2001] 


4..347 
9.344 


131/347 (37%) 
202/347 (57%) 


4e-58 


AAU34567 


E. coli cellular proliferation protein 

#148 - Escherichia coli, 376 aa. 
[WO200170955-A2, 27-SEP-2001] 


4.345 
40.373 


133/350 (38%) 
203/350 (58%) 


2e-56 


AAU34567 


E. coli cellular proliferation protein 

#148 - Escherichia coli, 376 aa. 
[WO200170955-A2, 27-SEP-2001] 


4.345 
40.373 


133/350(38%) 
203/350(58%) 


2e-56 
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In a BLAST search of public sequence datbases, the NOV4a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 4E. 



Table 4E. Public BLASTP Results for NOV4a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV4a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q8VDQ1 


SIMILAR TO RIKEN CDNA 
B830026H24 GENE - Mus musculus 
(Mouse), 351 aa. 


1..350 
1..351 


308/351 (87%) 
328/351 (92%) 


0.0 


Q9D1W8 


B830026H24RIK PROTEIN - Mus 
musculus (Mouse), 35 1 aa. 


1..350 
1..351 


304/351 (86%) 
327/351 (92%) 


0.0 


Q09593 

i 


HYPOTHETICAL 59.0 KDA 
PROTEIN Ml 06.3 IN 
CHROMOSOME II - Caenorhabditis 
elegans, 529 aa. 


. 6..345 
27..374 


161/348(46%) 
218/348 (62%) 


6e-83 


034812 


YFMJ PROTEIN - Bacillus subtilis, 
339 aa. 


4..346 
6..337 


146/346 (42%) 
208/346 (59%) 


7e-61 


Q9HR85 

i 
1 


QUINONE OXIDOREDUCTASE - 
Halobacterium sp. (strain NRC-1), 380 
aa. 


7.346 
53..380 


144/343(41%) 
200/343 (57%) 


6e-60 



PFam analysis predicts that the NOV4a protein contains the domains shown in the 
Table 4F. 



Table 4F. Domain Analysis of NOV4a 


Pfam Domain 


NOV4a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


adhjrinc 


22..344 


77/472(16%) 
216/472(46%) 


3e-16 



Example 5. 



The NOV5 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 5 A. 
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Table 5A. NOV5 Sequence Analysis 




SEQ1DNO: 11 


978 bp 


NOV5a, 
CG100222-01 DNA Sequence 


GATGATTCATGCTGAGATGGACCCTGAAGAAGCACTTTGTTGGCAATCCTACTAATAG 
TGACTTTGAGTTGAAGACAGCTGAGCTCCCACCCTTAAAAAATAGAGAGGTCCTGCTT 
GAAGCTTTGTTCCTCACTGGGGACCCTTACATGAGAGTGGCAGCCAAAAGACTGAAGG 
AAGGTGATACAATGATGGGGCAGCAAGTGGCCAGATTTGTGGAAAGTAAAAACGCAGC 
CTTACCAAACAGAACTATTGTACTGGCTCCTTCAGGCTGGACAGCACACTCCATTGCT 
CATGGAAAAGATCTGGAAAACCTGCCAACAGAATGGCCTGACACAGTACCACTGTCTC 
TGGCTCTGGGG AC AGTTAGCATGACAGG CCTCACTG CCTACTTTGGCCTACTTGACAT 
CTGTGGTGTGAAGGGTGGAGAAACAGTGATGGTTAATGCAGCAGCTGGAGCTGTGGGC 
TCTGTTTTGGG G CAGATT ACAAAGCTCAAGGGCTGCAAAGTGGTTGGAGCAGTAGGGT 
CTGACGAAAAGACTGCCTGCCTTAAAAAGCTTGGATCTGATGTCTTTAACTACAAGAC 
AGTAGAGTCTTTGGAAGAAACTTTGAAGAAAGCCTCTCCTGATGGTTACAATTGTTAT 
TTTGGTAATGTAAGTGGATGGTTTTCAAACACTGTTATCTCCCAGATGAAGAAATTTG 
GAAGAATTGCCACATGTGGAGCTATCTCTATATATAACACCAGCCCACTTCCCCCAGG 
CCCAATCCCAGAGATTGTTATGTATCAGGAGCTTCACATGGAAGGGTTCATTGTGACC 
TGCTGGCC^GGAGATGCCCACCAAAAAGCTCTAAAGGACTTGCTGAAATGGGTCTCAG 
AGGGTCAAATCCAGTACCAGGAATATATCATTGAAGGATATGAAAACATGCCAGCTGC 
ATTTATGGGAATGCTGAAAGGAGATAATTTGGAAAAACAATAGTGAAAGC 




ORF Start: ATG at 9 


ORF Stop: TAG at 969 




SEQ ID NO: 12 


320 aa 


M Wat 34932.1 kD 


NOV5a, 
CGI 00222-01 Protein Sequence 


MLRWTLKKHFVGNPTNSDFELKTAELPPLKNREVLLEALFLTGDPYMRVAAKRLKEGD 
TMMGQQVARFVESKNAALPNRTIVLAPSGWTAHSIAHGKDLENLPTEWPDTVPLSLAL 
GTVSMTGLTAYFGLLD I CGVKGGETVMVNAAAGAVGSVLGQITKLKGCKWGAVGSDE 
KTACLKKIX3SDVFNYKTVESLEETLKKASPDGYKCYFGNVSGWFSNTVISQMKKFGRI 
ATCGAI S I YNTS PL P PG P I PE XVMYQELHM EG F I VTCWPGDAHQKALKDLLKWVS EGQ 
IQYQEYI I EGYENMPAAFMGMLKGDNLEKQ 



Further analysis of the NOV5a protein yielded the following properties shown in 
Table 5B. 



Table 5B, Protein Sequence Properties NOV5a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.1644 probability located in microbody 
(peroxisome); 0.1620 probability located in lysosome (lumen); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV5a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 5C. 
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Table SC. Geneseq Results for NOV5a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV5a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM39I69 


Human polypeptide SEQ ID NO 2314 - 
Homo sapiens, 329 aa. [WO2001533 12- 
AI,26-JUL-2001] 


4..319 
7..324 


269/318(84%) 
281/318(87%) 


e-151 


AAM40955 


Human polypeptide SEQ ID NO 5886 - 
Homo sapiens, 417 aa. [WO200153312- 
A1,26-JUL-2001] 


4..319 
60..412 


267/353 (75%) 
280/353 (78%) 


e-145 


AAU37324 


Staphylococcus aureus cellular 
proliferation protein #1494 - 
Staphylococcus aureus, 334 aa. 
[WO200170955-A2, 27-SEP-2001] 


12..320 
15.328 


120/319(37%) 
175/319(54%) 


5e-49 


AAU37324 


Staphylococcus aureus cellular 
proliferation protein #1494 - 
Staphylococcus aureus, 334 aa. 
t WO200 1 70955-A2, 27-SEP-200 1 ] 


12..320 
15..328 


120/319(37%) 
175/319(54%) 


5e-49 


AAU36657 


Staphylococcus aureus cellular 

proliferation protein #827 - 
Staphylococcus aureus, 335 aa. 
[WO200170955-A2, 27-SEP-2001] 


12..320 
15..328 


120/319(37%) 
174/319(53%) 


6e-49 



In a BLAST search of public sequence datbases, the NOV5a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 5D. 
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Table 5D. Public BLASTP Results for NOVSa 



Protein 
Accession 
Number 


< 

Protein/Organism/Length 


NOV5a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


Q9H1X6 


BAI6L21.1.I (NADP-DEPENDENT 
LEUKOTRIENE B4 12- 
HYDROXYDEHYDROGENASE 
(ISOFORM 1)) - Homo sapiens (Human), 329 
aa. 


4.319 
7.324 


269/318 
(84%) 

281/318 
(87%) 


e-151 


Q14914 


NADP-dependent Ieukotriene B4 12- 
hydroxydehydrogenase (EC 1.1.1.-)- Homo 
sapiens (Human), 31 1 aa (fragment). 


4.305 
7.310 


256/304 
(84%) 

268/304 
(87%) 


e-143 


062642 


15-OXOPROSTAGLANDIN 13- 
REDUCTASE - Sus scrofa (Pig), 329 aa. 


4.319 
7.324 


248/318 
(77%) 

272/318 
(84%) 


e-141 


Q29073 


NADP-dependent Ieukotriene B4 12- 
hydroxydehydrogenase (EC 1.1.1.-)- Sus 
scrofa (Pig), 329 aa. 


4.319 v 
7.324 


247/318 
(77%) 

271/318 
(84%) 


e-141 


P97584 


NADP-DEPENDENT LEUKOTRIENE B4 
1 2-HYDROXYDEHYDROGENASE (EC 
1.1.1.-) (DITHIOLETHIONE-INDUCIBLE 
GENE-1) - Rattus norvegicus (Rat), 329 aa. 


4.319 
7.324 


239/318 
(75%) 

266/318 
(83%) 


e-136 



PFam analysis predicts that the NOV5a protein contains the domains shown in the 
Table 5E. 



Table 5E. Domain Analysis of NOVSa 


Pfam Domain 


NOV5a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


adh_zinc 


18.319 


63/471 (13%) 
206/471 (44%) 


1.2e-12 



Example 6. 



The NOV6 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 6A. 
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Table 6A, NOV6 Sequence Analysis 




SEQ ID NO: 13 


696 bp 


NOV6a, 
CGI 00266-01 DNA Sequence 


GTACTACATTTAAATCAATGTGACAAAGTCATTTTGTCTTTATGCAAACAATCAACTT 
GGT ATTTTCGTATAGTAACAG AAAAAATGGG CTTTG CAGACG CCGCTG C CACCGGGAG 
CCCTCTACTATCAGCCATGGTCAACCCCACCGTGTTCCTCGACATCGCTGTCCACGGT 
G AGCCCTTGGGTG GTATCCCCTT CGAGCTGTTTGCAGACAAGATCCCAAAG ACACCAG 
AAAACTTTCATGCTCTAAGCACTGGAGAAAAAGGATTTGGTTATAAGGGTTCCTGCTT 
TCACAGAATTATTCCCGGGGTTATGTGTCAGCGTGGTGACTTCACAAGTCATAATGGC 
ACTGGTGGCAAGTCCATCC^GGGGAGAAATTTGATGSACAAGAACTTCATCCTAAAGC 
AQaCAGGTCCTGGCATCTTGTCCATGGCAAATACTGGACCCAACACAAAC^ 
GTGTTTCATCTGCACTGCCAAGGCTGAGTGGTTGGATGGCAAGCATGTGGTCTTTGGC 
AAGGTGAAAGAAGGCAT AAAT AT TGTCGAGG CCATGGAGCGCTTTGGGTCCAGGAATG 
GCAAGACCAGCAAGAAGATCACCATTTCTGACTGTGGAAAACTCTAATACGTTTGACT 
TGTGTTTG ATATTTTATCTTAACCAC CAG ACT ATTCCrrCTGTAGCTCAGGAG AG CAC 




ORF Start: ATG at 85 


ORF Stop: TAA at 625 




SEQ ID NO: 14 


180 aa 


MW at 19224.9kD 


NOV6a, 
CGI 00266-01 Protein Sequence 


MGPADAAATGSPLLSAMVNPTVFLDIAVHGEPLGGIPFELFADKIPKTPENFHALSTG 
EKGFGYKGSCFHRIIPGVMCQRGDFTSHNGTGGKSIQGEKFDDKNFILKHAGPGILSM 
ANTGPNTNSFQCFICTAKAEWLDGKHVVFGKVKEGINIVEAMERFGSRNGKTSKKITI 
SDCGKL 




SEQ ID NO: 15 


545 bp 


NOV6b, 
! CGI 00266-02 DNA Sequence 

I 

; 

i 


AAATGGGCTTTG C AGACGCCG CTG CCACCGGG AGCCCTCTACTATCAGTCATGGTCAA 
CCCCACCGTGTTCCTCGACATCGCTGTCCACGGTGAGCCCTTGGGTGGTATCCCCTTC 
G AG CTGTTTG CAG ACAAG ATC C C AAAG AC AC C AG AAAACTTTC ATG CT CT AAG CACTG 
GAGAAAAAGGATTTGGTTATAAGGGTTCCTGCTTTCACAGAATTATTCCCGGGGTTAT 
GTGTC^GCGTGGTGACTTCACAAGTCATAATGGCACTGGTGGCAAGTCCATCCAAGGG 
GAGAAATTTGATGACAAGAACTTCATCCTAAAGCACGCAGGTCCTGGCATCTTGTCCA 
TGGCAAATACTGGACCCAACACAAACAGTTTCCAGTGTTTCATCTGCACTGCCAAGGC 
TGAGTX3GTTGGATGGCAAGCATGTGGTCITTGGCAAGGTGAAAGAAGGCATAAATATT 
GTCG AGG CCATGG AG CG CTTTG GGTC CAGGAATGGCAAGACCAGCAAG AAG ATCACCA 
TTTCTGACTGTGGAAAACTCTAA 




ORF Start: ATG at 3 


ORF Stop: TAA at 543 




SEQ ID NO: 16 


180 aa 


MWat 19252.9kD 


NOV6b, 
CG 100266-02 Protein Sequence 


MGFADAAATGSPLLSVMVNPTVFLD I AVHGEPLGGI PFELFADKIPKTPENFHALSTG 
EKGFGYKGSCFHRIIPGVMCQRGDFTSHNGTGGKSIQGEKFDDKNFILKHAGPGILSM 
ANTGPNTNS FQCFICTAKAEWLDGKHWFG KVKEG I NI VEAMERFGSRNGKTS KKI T I 
SDCGKL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 6B. 



Table 6B. Comparison of NOV6a against NOV6b. 


Protein Sequence 


NOV6a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV6b 


1..180 
1..180 


179/180 (99%) 
179/180 (99%) 



Further analysis of the NOV6a protein yielded the following properties shown in 
Table 6C. 
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Table 6C. Protein Sequence Properties NOV6a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in iysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV6a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 6D. 



Table 6D. Geneseq Results for NOV6a 



1 

Geneseq 
Identifier 

! 


Protein/Organism/Length [Patent 
#, Date] 


NOV6a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


; AAU01I95 

i 

i 

i 


Human cyclophilin A protein - 

Homo sapiens, 165 aa. 
[WO200132876-A2, 10-MAY- 
2001] 


17.. 180 
1..164 


141/164 (85%) 
148/164 (89%) 


2e-81 


, AAW56028 


Calcineurin protein - Mammalia, 
165 aa. [WO9808956-A2, 05- 
MAR-1998] 


17..I80 
1..164 


141/164 (85%) 
148/164 (89%) 


2e-81 


AAR 13726 


Bovine cyclophilin - Bos taurus, 
163 aa. [US5047512-A, 10-SEP- 
1991] 


18..180 
1..163 


140/163 (85%) 
147/163 (89%) 


4e-81 


AAG65275 


Haematopoietic stem cell 
proliferation agent related human 
protein #2 - Homo sapiens, 164 aa. 
[JP2001163798-A, 19-JUN-2001] 


18.. 180 
1..I63 


140/163 (85%) 
147/163 (89%) 


7e-81 


AAP90431 


Cyclophilin - Homo sapiens 
(human), 164 aa. [EP326067-A, 
02-AUG-1989] 


18.. 180 
1.163 


140/163 (85%) 
147/163 (89%) 


7e-81 



In a BLAST search of public sequence datbases, the NOV6a protein was found to 
5 have homology to the proteins shown in the BLASTP data in Table 6E. 
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Table 6E. Public BLASTP Results for NOV6a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV6a Residues/ 
Match Residues 


Identities/ 

Similarities for 
the Matched 
Portion 


Value 


CAC39529 


SEQUENCE 26 FROM PATENT 
WO0 132876 - Homo sapiens 
(Human), 165 aa. 


17..180 
L.164 


141/164 (85%) 

1 to/ 1 VJ*t ^07 /O j 


5e-81 


P04374 


Peptidyl-prolyl cis-trans 
isomerase A (EC 5.2.1.8) 
(PPIase) (Rotamase) (Cyclophilin 
A) (Cyclosporin A-binding 
protein) - Bos taurus (Bovine), 
and, 163 aa. 


18.. 180 
I.. 163 


140/163 (85%) 
147/163 (89%) 


1e-80 


Q9BRU4 


PEPTIDYLPROLYL 
ISOMERASE A 
(CYCLOPHILIN A)- Homo 
sapiens (Human), 165 aa. 


17..180 
1..164 


140/164(85%) 
147/164 (89%) 


2e-80 


P05092 


Peptidyl-prolyl cis-trans 
isomerase A (EC 5.2. 1 .8) 
(PPIase) (Rotamase) (Cyclophilin 
A) (Cyclosporin A-binding 
protein) - Homo sapiens 
(Human),, 164 aa. 


18..180 
L.163 


140/163(85%) 
147/163(89%) 


2e-80 


Q96IX3 


PEPTIDYLPROLYL 
ISOMERASE A 
(CYCLOPHILIN A) - Homo 
sapiens (Human), 1 65 aa. 


1 7.. 180 
1..164 


140/164(85%) 
147/164(89%) 


5e-80 



PFam analysis predicts that the NOV6a protein contains the domains shown in the 
Table 6F. 



Table 6F. Domain Analysis of NOV6a 


Pfam Domain 


NOV6a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


projsomerase 


21..180 


104/179(58%) 
140/179(78%) 


3e-85 



Example 7. 



The NOV7 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 7A. 
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Table 7A. NOV7 Sequence Analysis 




SEQ ID NO: 17 


665 bp 


NOV7a, 
CGI 00427-01 DNA Sequence 


GTAWUVTAAAATAAAGTTAAATTAATTAAAATCAAATAAAATTAJUU^CTTTAGGCT^ 
G CAGACACTGC CACCACTGGGCX3CCTCATACTG CCAGCCATGGTCAACCCTACTGTGT 
TCTTCGACATCACCGTCGATGGCGAGCCCTTGGGCAGCGTCTCCTTCGAGCTGTTTGC 
AGAGAAGTTTCCAAAGACAGAAGAAAACTTTCATCTTCTGAGCACTGGAGAGAAAGGA 
TTTGGTTATAAAAGTTCCTGCTTTCACAGAATTATTCCAGGGTTTATGTGTCGGGGTG 
ATGACTTCACATGCCATAACAGCACTGGTGGCAAGTCCGTCTATGGGGAGAAGTTTGT 
TGACGAGAACTTCGTCCTGAAGCATACAGATCCTGGCATCTTGTCCAGGGCAAATGCT 
GGACCCAGTACAAATGGTTCCCAGTTTTTTACCTGCACTGCTAAGACCGAGTGGTTCG 
GCAAGGTGAAAGAACGCATGATTATTGTGGAGGCCATGGGGCGCTTTGCGGGGAATGG 
GAAGACCAGCAAGAAGATCGCTGTTGCTGACTGTGGGCTACTCTAATATGTTTGACTT 
GGGTTTATCTTAACCAGCAGACCATTCTTTCTGTAGCTCCAGAGAGCACCCCTCTACC 
CCATTTGCTCG CAGTATCCTAGAATCT 




ORF Start: ATG at 98 


ORF Stop: TAA at 566 




SEQ ID NO: 18 


156 aa 


MWat !7112.3kD 


NOV7a, 
| CGI 00427-01 Protein Sequence 


MVNPTVFFDITVDGEPLGSVSFELFAEKFPKTEENFHLLSTGEKGFGYKSSCFHRIIP 
GFMCRGDDFTCHNSTGGKSVYGEKFVDENFVLKHTDPGILSRANAGPSTNGSQFFTCT 
AKTEWFGKVKERM 1 1 VEAMGRFAGNGKTS KKI AVADCGLL 



Further analysis of the NOV7a protein yielded the following properties shown in 
Table 7B. 



[ 

Table 7B. Protein Sequence Properties NOV7a 


' PSort analysis: 


0.6500 probability located in plasma membrane; 0.6400 probability located in 
microbody (peroxisome); 0.4500 probability located in cytoplasm; 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV7a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 7C. 
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Table 7C. Geneseq Results for NOV7a 


Geneseq 
Identifier 

i 

i 


Protein/Organism/Length [Patent #, 
Date] 


NOV7a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU01195 


Human cyclophilin A protein - Homo 
sapiens, 165 aa. [WO200132876-A2, 
10-MAY-2001] 


1..156 
I..I64 


1 29/164 (78%) 
135/164(81%) 


5e-70 


AAW56028 


Calcineurin protein - Mammalia, 165 aa. 
[WO9808956-A2, 05-MAR- 1 998] 


1 1 <£. 
I.. I JO 

K.164 


100/ 1/^/1 f1QO/\ 

135/164 (81%) 


je-/u 


AAG65275 


Haematopoietic stem cell proliferation 
agent related human protein U2 - Homo 
sapiens, 164 aa. [JP2001 163798-A, 19- 
JUN-2001] 


2.. 1 56 
1..163 


128/163 (78%) 
134/163 (81%) 


2e-69 


AAP90431 


Cyclophilin - Homo sapiens (human), 
164 aa. [EP326067-A, 02-AUG-1989] 


2.. 156 
1..163 


128/163 (78%) 
134/163 (81%) 


2e-69 


AAG03831 


Human secreted protein, SEQ ID NO: 
7912 - Homo sapiens, 165 aa. 
[EP1 033401 -A2, 06-SEP-2000] 


1..156 
1..164 


128/164(78%) 
134/164 (81%) 


3e-69 



In a BLAST search of public sequence datbases, the NOV7a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 7D. 
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Table 7D. Public BLASTP Results for NOV7a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV7a 
Residues/ 

Match 
Residues 


Identities/ 
• Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC39529 


SEQUENCE 26 FROM PATENT 
WO01 32876 - Homo sapiens (Human), 
165 aa. 


1..156 
1..164 


129/164(78%) 
135/164(81%) 


le-69 


Q9BRU4 


PEPTIDYLPROLYL ISOMERASE A 
(CYCLOPHILIN A) - Homo sapiens 
' (Human), 165 aa. 


1..156 
1..164 


129/164(78%) 
134/164(81%) 


3e-69 


Q96IX3 


PEPTIDYLPROLYL ISOMERASE A 
(CYCLOPHILIN A) - Homo sapiens 
(Human), 165 aa. 


1..156 
1..164 


129/164(78%) 
134/164 f81%> 


3e-69 


P05092 


Peptidyl-prolyl cis-trans isomerase A 
(EC 5.2. 1 .8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A-binding 
protein) - Homo sapiens (Human),, 1 64 
aa. 


2..156 
L.163 


128/163(78%) 
134/163 (81%) 


4e-69 


P04374 


Peptidyl-prolyl cis-trans isomerase A 
(EC 5.2.1.8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A-binding 
protein) - Bos taurus (Bovine), and, 163 
aa. 


2..154 
I..161 


127/161 (78%) 
133/161 (81%) 


le-68 



PFam analysis predicts that the NOV7a protein contains the domains shown in the 
Table 7E. 



Table 7E. Domain Analysis of NOV7a 


Pfam Domain 


NOV7a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


pro_isomerase 


5..I54 


94/177(53%) 
126/177(71%) 


7.9e-61 



Example 8. 



The NOV8 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 8A. 
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Table 8A. NOV8 Sequence Analysis 



SEQ ID NO: 19 



2646 bp 



NOV8a, 
CG 100456-0 1 DNA Sequence 



ATGTGGCTGCAGCAGCGGCTCAAGGGGCTGCCGGGACTGCTGTCGAGCAGCTGGGCCC 
GCCGCCTCCTCTGCCTGCTTGGCCTCCTGCTGCTGCTTCTGTGGTTTGGGGGGTCCGG 
CGCGCGGCGGGCGGCGGGCGGCCTGCACCTGCTGCCCTGGTCCCGCGGTGAGCCGGGC 
GCCGCCGAGCCGTCTGCCTGCCTGGAGGCGGCCACCCGCGCCTGGCGCGGCCTGCGGG 
AGCGCGGTGAGGTGGTACCGCTGGGTCCTGGAGTGCCGGCCCTGGTGGCCAACGGCTT 
CCTGGCCCTGGACGTGGCTGCCAATCGGCTGTGGGTGACTCCCGGGGAGCGGGAGCCC 
GC CGTGGCGCCGG ACTTTGTGCCC TTCGTGC AG CTGCGC CCGCTG AG CGCG CTGG CTG 
AAGCTGGAGAGG CGGTG CTGCTG CTGCGGGAGGGGCTTCTG CGCCGCGTGCGTTG CCT 
GCAG CTGGGGTC CCCAGGTCCTGG CCCCGTGG CCGCCGG CCCCGGGCCCGCCTCCGTC 
TCTGGCCTTGCCGCGGGGTCCGGCCGCGACTGCGTGCTGCTGCAAGAGGACTTTCTGG 
CGCACAGGGGCCGACCCCACGTCTACCTGCAGCGCATCCAGCTCAACAACCCCACGGA 
GCGCGTGGCCGCGCTGCAGACrrGTGGGGCCCACTGCCGGCCCAGCCCCCAATGCCrrTC 

CGCCTACGCCCACTGGGTTGGTGCACCTGGTGGTGGTGGCCGCCAAGAAGCTGGTGAA 

CCGCCTCCAAGTGGCTCCCAAGACXSCAGCrcGATGAGACGGTGCTGTGGGTGG^^ 

GTCTCTGGCCCCATTAACCCCCAGGTGCTCAAAAGCAAAGCAGCCAAGGAGCTCAAGG 

CG CTGCAGGACTTGG CACGGAAGG AAATG CTGG AGCTCTTGGACATG CCAG CGGCGGA 

GCTGCITCAAGACCACCAGCTCCTCTGGGCTCAGCTCTTCAGCCCAGGTGTGGA 

AAG AAGAT CACTG ACAC C CACACG C CGT CTGG C CT CAC CGTG AAC CTG ACGCT CT AT T 

AC^TGCTCTCCTGCTCGCCAGCCCCACTGCTCAGCCCCTCCCTGAGCCACAGGGAGCG 

AGACC^ATGGAGTCGACGCTCAACTATGAAGATCACTGCTTCAGCGGGCACGCCACC 

ATGCACGCCGAGAACCTGTGGCCGGGGCGGCTGTCCTCCGTCCAGCAGATCCTGCAGC 

TCTCTGACCTGTGGAGGCTGACCCTCCAGAAGCGTGGCTGCAAGGGGCTGGTGAAGGT 

GGGTGCCCCAGGCATCCTGCAGGGGATGGTGCTCAGCTTTGGGGGGCTGCAGTTCACA 

GAGAACCACCTCCAGTTCCAGGCCGACCCCGACGTGCTGCACAACAGCTATGCATTGC 

ATGGCATCCGCTACAAGAACGACCATATCAACCTGGCCGTGCTGGCGGATGCCGAGGG 

C^GCCCTACCTACACGTGTCCGTGGAGTCCCGTGGCCAGCCTGTCAAGATCrATGCC 

TGCAAGGCAGGCTGCCTGGACGAGCCAGTGGAGCTGACCTCGGCGCCCACGGGCCACA 

CCTTCTCGGTCATGGTGACACAGCCCATCACGCCACTGCTCTACATCTCCACCGACCT 

CACACACCTGCAGGACCTGCGGCACACGCTGCACCTCAAGGCCATCCTGGCCCATGAT 

CCCT AAT C ACC CT CTT C CAC C T CTT C CT CTT C AAG CT C ATCT ACAAC G AGTACTGTGG 
GCCTGGAGCCAAGCCCCTCTTCAGGAGTAAGGAAGATCCCAGTGTCTG AGTGAACTAA 
CAGTCCTGCITTCAGCCACC^TTTGCACAAGACACCCAGC^CTGAAAGTCCCGCTGCC 
AGGAGCAAGGGATCCTTTGGAAGCACCCGCCCTTTGTGCCTTGTTGGGGGAAACCGGT 
GACG C AGAAGTGAGTGTGGATACACCAG AG TTTGCATTGGAAGGAATG AGTGT CACGT 
GGGGAGGGAAGGGGCCAGTGGACCTTTTGTAAGCTTTCCACTCAATAAAATGAACCT 

AGGAAATAGATCATCTTTTGAAAAGACTCTTGTCTAGGAAAAGTTGTGTCCTTTTCCT 
AATTTAACGTGTTCTTTCTTAATGAAGTTTTAATTTATTT^ 

ATWCTTTTGCATCCCCTGTAGATGGTGAGTGTTGGCGGTGATGTCCGTCTCGGCGTT 
CGGAGGCCCCACGGTCCCGAGGCTGGGCCGGGGCCCCCCAGGGTGGCTGTGCTGCTGC 
CTGTAGGAGGGTGCGGGTTGTGCTGTCATCCTCGGGTTTGCACGCCC1TTTTTAGGAG 
CCTGTGGACATCTGTGGTTTTGTACrriTGGGGCITCAGGG 

GTGATTGATGATTGTCAG^TTTTG AAAT AC CAAAG CTTTTTTGTTCTG 1 1' i AAAT A 
AATATCrrTCAAACTTTCAAAAAAAAAAAAAAAAAA 





ORF Start: ATG at 1 


ORF Stop: TGAat 1903 




SEQ ID NO: 20 


634 aa 


MWat69142.3kD 


NOV8a, 
CGI 00456-01 Protein Sequence 


MWLQQRLKGLPGLItS S SWARRLLCI1LGLLLLLLWFGGSGARRAAGGI1HLLPWSRGE PG 
AAEPSACLEAATRAWRGLRERGEVVPI^PGVPALVANGFLALDVAANRLWVTPGEREP 
AVAPDFVPFVQLRPLSAIAEAGEAVLIJiREGL^ 

S GLAAGSGRD CVLLQEDFLAHRGRPHVYLQR I QLNN PTERVAALQTVGPTAG PAPNAF 

TSTLEKVGDHQFIjLYSGRSPPTPTGLVHLVWAAKKLVNRI^VAPKTQLDETVLWVVH 

VSG P INPQVLKS KAAKELKALQDLARKEMLELLDM PAAELIiQDHQLLWAQLFS PGVEM 

KKITDTHTPSGLTVNLTIiYYMLSCSPAPLLSPSLSHRERDQMESTIOT 

MHAENLWPGRliSSVCKJII^LSDLWRLTLQKRGCKGLVKVGAPGILQGMVLSFGGLQFT 

EOTII^FQADPDVLHNSYALHGIRYKNDHINIAVIJU)AEGKPYLHVSVESRGQPV 

ckagcxdepveltsaptghtfsvmvtqpitpij^ 
ehmaqqdpglpflfwfsvaslitlfhlflfkliyneycgpgakplfrskedpsv 



Further analysis of the NOV8a protein yielded the following properties shown in 
Table 8B. 
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Table 8B. Protein Sequence Properties NOV8a 


PSort analysis: 


0.4600 probability located in plasma membrane; 0.3000 probability located in 
lysosome (membrane); 0.2800 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 41 and 42 



A search of the NOV8a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 8C. 



Table 8C. Geneseq Results for NOV8a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV8a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB43028 


Human ORFX ORF2792 polypeptide 
sequence SEQ ID NO:5584 - Homo 
sapiens, 454 aa. [WO200058473-A2, 
05-OCT-2000] 


182..634 
2..454 


433/453 (95%) 
433/453 (95%) 


0.0 


ABB63158 

i 


Drosophila melanogaster polypeptide 
SEQ ID NO 16266 - Drosophila 
melanogaster, 606 aa. 
[ WO200 1 7 1 042-A2, 27-SEP-200 1 ] 


111..627 
96..602 


155/557 (27%) 
259/557 (45%) 


5e-55 


AAB27712 


Human secreted protein #3 i - Homo 
sapiens, 108 aa. [WO200055201-A1, 
21-SEP-2000] 


456..524 
30..98 


69/69(100%) 
69/69(100%) 


2e-34 


AAB27756 


Protein fragment encoded by gene 3 1 

- Homo sapiens, 65 aa. 
[ WO20005520 1 -A 1 , 2 1 -SEP-2000] 


524..585 
4..65 


61/62(98%) 
62/62 (99%) 


5e-29 


AAB27755 


Sequence homologous to protein 
fragment encoded by gene 3 1 - Homo 
sapiens, 65 aa. [WO200055201-A1, 
21 -SEP-2000] 


523..585 
2..65 


27/66 (40%) 
37/66 (55%) 


3e-05 



In a BLAST search of public sequence datbases, the NOV8a protein was found to 
5 have homology to the proteins shown in the BLASTP data in Table 8D. 
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Table 8D. Public BLASTP Results for NOV8a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV8a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q91X21 


RIKEN CDNA 25 1 00j9O 1 8 
GENE - Mus musculus (Mouse), 
634 aa. 


1 A 

1..oj4 
1..634 


607/634 (94%) 




Q9CY11 


2510039O18RIK PROTEIN - Mus . 
musculus (Mouse), 634 aa. 


1..634 
1..634 


581/634 (91%) 
607/634(95%) 


0.0 


Q9BSY1 


SIMILAR TO RIKEN CDNA 
2510039018 GENE -Homo 
sapiens (Human), 238 aa 
(fragment). 


400..634 
4..238 


235/235 (100%) 
235/235(100%) 


e-138 


Q9VQ60 


CG7289 PROTEIN - Drosophila 
melanogaster (Fruit fly), 606 aa. 


111. .627 
96..602 


155/557 (27%) 
259/557 (45%) 


le-54 


Q95RT9 


LD12115P- Drosophila 
melanogaster (Fruit fly), 637 aa. 


111. .627 
96..633 


155/566 (27%) 
269/566 (47%) 


2e-54 



PFam analysis predicts that the NOV8a protein contains the domains shown in the 
Table 8E. 



I 1 — 

Table 8E. Domain Analysis of NOV8a 



Pfam Domain 


NOV8a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 





Example 9. 



The NOV9 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 9 A. 
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Table 9 A NOV9 Sequence Analysis 




SEQIDNO:21 905 bp ^ 


NOV9a 
CGI 00466-01 DN A Sequence 


CCTGGCAGGTGGAGTGGCCGAAGCCATCTCCAAGACAGCGGTAGCGCCCATCGAGCAG 
GTCAAGCTGCTGCTGCAGGTG CAGCATGCCAG CAAGCAG ATCACCTCAG CTAAGCAAT 
ACAAGGGCATTATAGACTGCGTGGTCCGTATTCCCAAGAAGCAGGGAGTCCTGTCCTG 
GCGCGGTAACCTGGCCAATGCCATCAGATACTTCCCCACCCAGGCTTTTAACTTCGCC 
TTCAAAGATAAATACAAGCAGATCTTCCTGGGTGGTGTGGACAAAAGAACTCAGTTTG 
GGCGCTACTTTGCAGGGAATCTGGCATCAGGAGGTGCCGCTGGGGCCACATACTTGTG 
TTTTGTGTACCCTCTTGATTTTGCCCGTACCTGTCTAGCAGCTAATGTGGGTAAAGCT 
GAAGCTGAAAGGGAATTCCGAGGCCTCTGTGACTGCCTGGTTAAGATCTACAAATCTG 
ATGGGATTAAGGGCCTGTACCAAGGCTTTAACGTGTCTATGCAGGGTATTATCCGAGC 
TG CCTACTTCGGTATCTATGACAC CGCAAAGGGAATGCTTCCGGATCCCAAGGACACT 
CACATCGTCATCAGCTGGATGACCACACAGACTGTCACTGCCTTTTCTGGGTTGACTT 
CCTATTCATTTGACATCGTTCG CGTG ATGATTCAGTCAGGG CGCAAAGTAACTGACAT 
CATGTACACAGGCACACTTGACTGCTGGAGGAAGATTGCTGGTGArGAAGGAGGCAAA 
GCTTTTTTCAAGGGTTCATGGTCCAGTGTTCTCAGAGGCATGGGTGGTGCTTTTGTGC 
TTGTCTTGTTTGATGAAATCAAGAAGTACAGGTAA 




ORF Start: ATG at 24 


ORF Stop: TAA at 903 




SEQ ID NO: 22 


293 aa MW at 32215.1kD 


NOV9a, 
CGI 00466-01 Protein Sequence 


MTDAAVSFAKDFLAGGVAEAIS KTAVAPI EQVKLLLQVQHASKQITSAKQYKGI IDCV 

VRIPKKQGVLSWRGNLANAIRYFPTQAFNFAFKDKYKQIFLGGVDKRTQFGRYFAGNL 

ASGGAAGATYLCFVYPLDFARTCLAANVGKAEAEREFRGLCDCHjVKIYKSDGIKGLYQ 

GFNVSMQG 1 1 RAAYFG I YDTAKGMLPDPKDTH I VI S WMTTQTVTAFSGLTSYSFDI VR 

VMIQSGRKVTDIMYTGTLDCWRKIAGDEGGKAFFKGSWSSVl^GMGGAFVLVLFDEIK 
KYR 



Further analysis of the NOV9a protein yielded the following properties shown i: 
Table 9B. 



Table 9B. Protein Sequence Properties NOV9a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.2400 probability located in lysosome (lumen); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV9a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 9C. 
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Table 9C. Geneseq Results for NOV9a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV9a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAU 10379 

! 
1 


Human adenine nucleotide 
translocator 2 (ANT2) - Homo 
sapiens, 298 aa. [ WO200 1 85944-A2, 
15-NOV-200I] 


1..292 
1..297 


262/297 (88%) 
275/297 (92%) 


e-145 


AAU01199 


Human adenine nucleotide 
translocator-2 (ANT-2) protein - 
Homo sapiens, 298 aa. 
[WO200132876-A2, 10-MAY-2001] 


1..292 
1 ..297 


262/297 (88%) 
275/297 (92%) 


e-145 


AAY71032 


Human adenine nucleotide 
translocator ANT2 - Homo sapiens, 
298 aa. [WO200026370-A2, 1 1- 
MAY-2000] 


I. .292 
1..297 


262/297 (88%) 
275/297 (92%) 


e-145 


AAU 10380 


Human adenine nucleotide 
translocator 3 (ANT3) - Homo ■ 
sapiens, 298 aa. [WO200185944-A2, 
I5-NOV-2001] 


1..291 
1..296 


242/296 (81%) 
267/296 (89%) 


e-137 


AAU01200 

I 


Human adenine nucleotide 
transIocator-3 (ANT-3) protein - 
Homo sapiens, 298 aa. 
[WO2001 32876- A2 7 10-MAY-2001] 


1..29I 
1..296 


242/296 (81%) 
267/296 (89%) 


e-137 



In a BLAST search of public sequence datbases, the NOV9a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 9D. 
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Table 9D. Public BLAST? Results for NOV9a 


Protein 
Accession 

t INUI11UCI 

! 


Protein/Organism/Length 


NOV9a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


\ P05I4I 


A DP, ATP carrier protein, fibroblast 
isoform (ADP/ATP transiocase 2) 
(Adenine nucleotide translocator 2) 
(ANT 2) - Homo sapiens (Human), 
298 aa. 


1 70? 
I..297 


ZOH/Zy / {po /o) 

276/297 (92%) 


e-I4o 


A29132 


ADP,ATP carrier nrotein T9 - human 
298 aa. 


1..297 


zoz/zy/ (oo%) 
275/297 (92%) 


e-145 


Q09073 

i 
t 


ADP,ATP carrier protein, fibroblast 
isoform (ADP/ATP transiocase 2) 
(Adenine nucleotide translocator 2) 
(ANT 2) - Rattus norvegicus (Rat), 
298 aa. 


1..292 
1..297 


261/297 (87%) 
275/297 (91%) 


e-145 


' P51881 

J 


ADP,ATP carrier protein, fibroblast 
isoform (ADP/ATP transiocase 2) 
(Adenine nucleotide translocator 2) 
(ANT 2) - Mus musculus (Mouse), 
298 aa. 


I ..292 
I. .297 


260/297 (87%) 
274/297 (91%) 


e-144 


1 BAB84673 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 2 - Bos taurus 
(Bovine), 298 aa. 


1..292 
1..297 


259/297 (87%) 
274/297 (92%) 


e-144 



PFam analysis predicts that the NOV9a protein contains the domains shown in the 
Table 9E. 



r - 1 — ; — — ■ ' ■■ ■ . 

Table 9E. Domain Analysis of NOV9a 


Pfam Domain 


NOV9a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mitotan* 


7.. 104 


35/125 (28%) 
86/125 (69%) 


6.8e-28 


mito_carr 


111.205 


37/125 (30%) 
80/125 (64%) 


1.2e-18 


mito_carr 


206.293 


22/125(18%) 
61/125 (49%) 


7.6e-07 



Example 10. 



The NOV1 0 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 10A. 
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Table 10A. NOV10 Sequence Analysis 




SEQ ID NO: 23 


858 bp 


MOV 10a, 
CGI 00609-01 DNA Sequence 


ACATAGGCCC^GGTCTGTGGCTGGCOVGCTGCAGCTGTGGGGCCTGGCATGTGTCTCA 
ACATGGCCCTGG AGCTCTACATGGAC CTGCTGTCAG CACCCTG CCGTGCCGTCTACAT 
CTTCTCGAAGAAGCATGAC^TCCAGTTCMCTTTCAGTTTGTGGATCrcCTGAAAGGT 
CACCACCACAGCAAAGAATACATTGACATCAACCCCCTCAGGAAGCTTCCCAGCCTCA 
AAGATG GGAAATTT AT CTT AAG TG AAAG CGCGG C C AT C CTTTACT ACCTGTG C CG CAA 
GT ACAGCGCACC AT CGCACTGGTG CCCG CCAGACCTGCACG CACGTGCCCGTGTGG AT 
GAGTTCGTGGCTTGGCAACACACGGCCTTTCAGCTGCCCATGAAGAAGATAGTCTGGC 
TCAAGTTGCTGATCCCAAAG AT AACAGGGG AGGAAGTTT C AG CTG AGAAGATGG AGCA 
TGC AGTGG AAGAGGTGAAGAAC AG CCTG CAGCTCTTTGAGGAGTATTTTCTGCAGGAT 
AAGATGTTCATCACCGGGAACCAAATCT CACTGG CTGACCTGGTGGCCGTGGTGGAGA 
TGATG CAGCCCATGGCAGCCAACTATAATGTCTTCCTCAACAGCT CCAAG CT AGCTGA 
GTGGCGTATGCAGGTGGAGCTGAATATTGGCTCTGGCCTCTTTAGGGAGGCCCATGAT 
CGACTAATGCAGTTGGCCGACTGGGACTTTTCAACATTGGATTCAATGGTCAAGGAGA 
ATATTTCTGAGTTGCTGAAGAAGAGCAGGTGACCCTAAGCGCAGCCTGTCCCGCAGGG 
CCTGGCTGGCTTAGCAATCTGAG CCACCTTCCTT AAAGGAAATGTT 




ORF Start: ATG at 49 


ORF Stop: TGA at 784 




SEQ ID NO: 24 


245 aa MW at 28490.9kD 


NOV 10a, 
CGI 00609-01 Protein Sequence 


MCIJWALELYMDLLSAPCMVYI FSKKHDIQ^ 

PSLKDGKFI LS ESAAI L YYLCRKYS APSHWCP PDLHARARVDE FVAWQHTAFQLPM KK 
IVWLKLLIPKITGEEVSAEKMEHAVEEVKNSLQLPEEYFLQDKMFITGNQISIiADLVA 
VVEMMQPMAANYWFLNSSKl^WRMQVELNIGSGLFREAHDRLMQI^WDFSTLDSM 
VKENI SELLKKSR 



Further analysis of the NOV 10a protein yielded the following properties shown in 
Table 10B. 



Table 10B. Protein Sequence Properties NOVlOa 


PSort analysis: 


0.4826 probability located in microbody (peroxisome); 0.4708 probability located in 
mitochondrial matrix space; 0.1732 probability located in mitochondrial inner 
membrane; 0.1732 probability located in mitochondrial intermembrane space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 10a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 10C. 



125 



WO 03/010327 



PCT/US02/14199 





Table IOC. Geneseq Results for NOVlOa 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 10a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


j AAB85772 


Human drug metabolizing enzyme 
(ID No. 7472055CD1) - Homo 
sapiens, 241 aa. [WO200159127-A2, 
16-AUG-2001] 


5..245 
I. .241 


237/241 (98%) 
238/241 (98%) 


C"l JO 


AAY07034 


Breast cancer associated antigen 
precursor sequence - Homo sapiens, 
240 aa. [WO9904265-A2, 28-JAN- 
1999] 


5..237 
1..231 


116/233(49%) 
156/233(66%) 


9e-63 


AAB54309 


Human DanCreatlV mnrpr antirran 

protein sequence SEQ ID NO:761 - 

Homo sapiens, 255 aa. 
[WO200055320-A1.21-SEP-2000] 


5. .237 
15.. 246 


111/233 (47%) 
152/233 (64%) 


5e-59 


AAU30356 

j 


Novel human secreted protein #847 - 

Homo sapiens, 420 aa. 
[WO200179449-A2, 25-OCT-200 1 ] 


5..223 
107..33I 


93/226(41%) 
132/226 (58%) 


le-38 


AAM03015 

i 

i 

*. 

I 


Peptide #1697 encoded by probe for 
measuring breast gene expression - 

Homo sapiens, 64 aa. 
[WO200157270-A2, 09-AUG-2001] 


181. .244 
1..64 


62/64 (96%) 
63/64 (97%) 


5e-29 



In a BLAST search of public sequence datbases, the NOVlOa protein was found to 
have homology to the proteins shown in the BLASTP data in Table 10D. 
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Table 10D. Public BLASTP Results for NOVlOa 


Protein 
Accession 
Number 

t 
< 

i 


Protein/Organism/Length 


inw v i ua 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


? Q9D4P7 


4930583C14RIK PROTEIN - Mus 
musculus (Mouse), 240 aa. 


5..243 
1..238 


181/239 (75%) 
210/239 (87%) 


e-I03 


1 Q969K8 

1 


GLUTATHIONE S-TRANSFERASE 
TT1 (GLUTATHIONE S- 
TRANSFERASE THETA 1) 
(GLUTATHIONE TRANSFERASE 
TM) (EC 2.5. 1. 18) - Homo sapiens 
(Human), 240 aa. 


5..237 
1..231 


117/233 (50%) 
157/233 (67%) 


3e-63 


Q961Y3 


GLUTATHIONE S-TRANSFERASE 
THETA 1 - Homo sapiens (Human), 
240 aa. 


5..237 
1..231 


117/233 (50%) 
156/233(66%) 


Ie-62 


S44358 


glutathione transferase (EC 2.5.1.18) 
theta-1 [validated] - human, 240 aa. 


5..237 
I. .231 


116/233(49%) 
156/233 (66%) 


2e-62 


P30711 

! 

j 


Glutathione S-transferase theta I (EC 
2.5. 1.18) (GST class-theta) - Homo 
sapiens (Human), 239 aa. 


7..237 
2..230 


115/231 (49%) 
155/231 (66%) 


9e-62 



PFam analysis predicts that the NOVlOa protein contains the domains shown i: 
the Table 10E. 



Table 10E. Domain Analysis of NOVlOa 


Pfam Domain 


NOVlOa Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


GST_N 


4..80 


26/90 (29%) 
57/90 (63%) 


2.7e-12 


GSTC 


90..199 


23/115(20%) 
75/115(65%) 


5.2e-09 



Example 11. 



The NOV1 1 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 1 1 A. 
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ORF Start: ATG at 127 


ORF Stop: TAG at 721 




SEQ ID NO: 28 


198 aa f MW at21674.4kD 


NOV lib, 
CGI 0063 1-02 Protein Sequence 


MAELDPFGAPAGAPGGP ALGNGVAGAGEEDPAAAFLAQQESE I AG I ENDEAFAILDGG 
APGPQPHGEPPGGPDAVDGVMNGEYYQBSNGPTDSYAAI SQVDRLQSEPES IRKWREE 
QMERLEALDANSRKQEAEWKEKAIKELEEWYARQDEQLQKTKANNSTNINHPCYSIiEQ 
A/UiisAijHSHWINYVEFFWTKPFCL 




SEQ ID NO: 29 735 bp 


*~ NOVllc, 

j CG 10063 1-04 DNA Sequence 


CCCAGTCGGCACCACAGCGGTGGCTGCCGGGCGTGGTGTCGGTGGGTTGGTTGGTTTT 
TGTCTCACCGTTGGTGTCCGTGCCGTTCAGTTGCCCGCCATGGCTGAGCTGGATCCGT 
TCGGCGCCCCTGCCGGCGCCCCTGGCGGTCCCGCGCTGGGGAACX3GAGTGGCCGGCGC 
CGG CGAAGAAG ACCCGG CCGCGGCCTTCTTGGCGCAGCAAG AG AG CGAG ATTG CGGGC 
ATCGAGAACGACGAGGCCTTCGCCAATTCTCGGAAGCAAGAAGCAGAGTGGAAAGAAA 
AG GCAATAAAGGAGCT AGAAGAATGGTATG CAAGACAGGACGAGC AG CTACAG AAAAC 
AAT^AGCAAACAACAGGGCAGCAGAAGAAGCCTTTGTAAATGACATTGACGAGTCGTCC 
CC^GGCACTGAGTGGGAACGGGTGGCTCGGCTGTGTGACTTTAACCCCAAGTCTAGCA 
AG CAGGCCAAAG ATGTCTCCCGCATGCGCTCAGTCCTCATCTCCCTCAAG CAGGCCCC 
GCTGGTGCACTGAAGAGCCACCCTGTGGAAACACTACATCTGCAATATCTTAATCCTA 
CTCAGTGAAGCTCTTCACAGTCATTGGATTAATTATGTTGAGTTCTTTTGGACCAAAC 
CTTTTTGTCTTTAGAGTTGTTCATTGTTTGTGATTGCATGTTTCCTTCCTTCAACTGT 
GTTCTGCCTGGCATTCAGAGAGGAGGGAGAGGAGGAAGA 




ORF Start: ATG at 98 


ORF Stop: TGA at 533 




SEQ ID NO: 30 


145 aa I MW at 15769.2kD 


NOVllc, 
CGI 0063 1 -04 Protein Sequence 


MAELDPFGAPAGAPGGPALGNGVAGAGEEDPAAAFLAQQESE IAGI ENDEAFANSRKQ 
EAEWKEKAIKELEEWYARQDEQLQKTKANNRAAEEAFVNDIDESSPGTEWERVARLCD 
FNPKSSKQAKDVSRMRSVLISLKQAPLVH 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 1 IB. 



Table 11B. Comparison of NOVlla against NOVllb and NOVllc. 


Protein Sequence 


NOVlla Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOVllb 


1..161 
1..161 


125/161 (77%) 
125/161 (77%) 


NOVllc 


126..236 
53..145 


93/1 1 1 (83%) 
93/1 1 1 (83%) 



Further analysis of the NOV1 la protein yielded the following properties shown in 
Table 11C. 



Table 11C. Protein Sequence Properties NOVlla 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV1 la protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 ID. 





Table 11D. Geneseq Results for NOVlla 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOVlla 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB7I527 


Drosophila melanogaster 
polypeptide SEQ ID NO 4 1 373 - 
Drosophila melanogaster, 219 aa. 
[WO200171042-A2, 27-SEP-2001] 


20..234 
4..213 


77/231 (33%) 
118/231 (50%) 


2e-27 


AAB57I65 

< 


Human prostate cancer antigen 
Drotein semienrp QFH IP* "MO* 1 7/1 ^ 

Homo sapiens, 116aa. 
[WO200055174-AI, 21-SEP-2000] 


10..79 
7..77 


39/71 (54%) 
42/71 (58%) 


9e-I I 


AAY 0/249 


Human signal peptide containing 
protein HSPP-26 SEQ ID NO:26 - 

Homo sapiens, 82 aa. 
[WO2000006 1 0-A2, 06-J AN-2000] 


161..180 
18..37 


19/20 (95%) 
20/20(100%) 


0.001 


ABB6263I 


Drosophila melanogaster 
polypeptide SEQ ID NO 14685 - 
Drosophila melanogaster, 305 aa. 
[ WO200 1 7 1 042-A2, 27-SEP-200 1 ] 


89.. 17 1 
199..282 


26/86 (30%) 
47/86 (54%) 


0.007 


AAY71045 


Streptococcus pyogenes strain AP49 
partial GRAB protein - 
Streptococcus pyogenes, 271 aa. 
[WO200026240-A2, 11-MAY- 
2000] 


73..219 
98..253 


46/160(28%) 
69/160(42%) 


0.025 



In a BLAST search of public sequence datbases, the NOV1 la protein was found to 
5 have homology to the proteins shown in the BLASTP data in Table 1 IE. 
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Table HE. Public BLASTP Results for NOVlla 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOVlla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


P09496 


v_icuiu in llglll Llldin /\ yL,Ca) - 

Homo sapiens (Human), 248 
aa. 


I..248 


2 j 6/248 (95%) 
236/248 (95%) 


e-133 


LRRTA1 


clathrin light chain Al - rat, 
286 aa (fragment). 


1..236 
39..286 


231/248 (93%) 
234/248 (94%) 


e-130 


P08081 


Clathrin light chain A (Lea) - 
Rattus norvegicus (Rat), 248 
aa. 


1 ..236 
L.248 


231/248 (93%) 
234/248 (94%) 


e-130 


008585 


Clathrin light chain A (Lea) - ! 
Mus musculus (Mouse), 235 
aa. 


1..236 
I. .235 


220/236 (93%) 
223/236 (94%) 


e-123 


| P04973 


Clathrin light chain A (Lea) - 
Bos taurus (Bovine), 243 aa. 


1..236 
1..243 


223/248 (89%) 
226/248 (90%) 


e-121 



PFam analysis predicts that the NOV1 la protein contains the domains shown in 
the Table 11 F. 



Table 11F. Domain Analysis of NOVlla 


Pfam Domain 


NOVlla Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Clathrinjg^ch 


3..235 


154/257(60%) 
226/257 (88%) 


5.2e-142 



Example 12. 



The NOV 12 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 12 A. 
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Table 12A. NOV12 Sequence Analysis 




SEQ ID NO: 31 


1849 bp 


NOV12a, 
CG 1 007 1 0-0 1 DNA Sequence 

| 

i 


CATGTCGTGGCTCTTCGGCGTTAACAAGGGCCCCAAGGGTGAAGGCXaCGGGGCCGCCG 

CCGCCTTTGCCGCCCGCGCAGCCCGGGGCCGAGGGCGGCGGGGACCGCGGTTTGGGAG 

ACCGGCCGGCGCCCAAGGACAAATGGAGCAACTTCGACCCCACCGGCCTGGAGCGCGC 

CGCCAAGGCGGCGCGCGAGCTGGAGCACTCGCGTCAGACACAGGAGCGGCTGTCAGGC 

AGTGCCAGCCCTGAGCAAGTGCCAGCTGGTGAGTGCTGTGCTCTGCAGGAGTATGAGG 

CCGCCGTGGAGCAGCTCAAGAGCGAGCAGATCCGGGCGCAGGCTGAGGAGAGGAGGAA 

GACCCTGAGCGAGGAGACCCGGCAGCACCAGGCCAGGGCCCAGTATCAAGACAAGCTG 

GCCCC^CAGCGCTACGAGGACCMCrGAAGCAGCAGCAACTTCTCAATGAGGAGAATT 

TACGGAAGCAGGAGGAGTCCGTGCAGAAGCAGGAAGCCATGCGGCGAGCCACCGTGGA 

GCGGG AGATGGAGCTG CGGCACAAGAATGAGATG CTG CGAGTGG AGACCG AGGCCCGG 

GCGCGCGCCAAGGCCGAG CGGGAGAATGCAGACATC ATCCGCGAG CAGATCCG CCTGA 

AGGCGTCCGAGCACCGTCAGACCGTCTTGGAGTCCATCAGGACGGCTGGCACCTTGTT 

TGGGGAAGGATTCCGTGCCTTTGTGACAGACCGGGACAAAGTGACAGCCACGGTAGCT 

GGGCTGACGCTGCTGGCTGTCGGGGTCTACTCAGCCAAGAATGCGACAGCCGTCACTG 

^u^GCTTCATCGAGGCTCGGCTGGGGAAGCCGTCCCTAGTGAGGGAGACGTCCCGCAT 

CACGGTGCTGGAGGCGCTGCGGCACCCCATCCAGGTAAGCCGGCGGCTCCTCAGTCGA 

CCCCAGGACGTGCTGGAGGGTGTTGTGCTTAGTCCCAGCCTGGAAGCACGGGTGCGCG 

ACATCGCCATAGC^CC^GGAACACCAAGAAGAACCGGGGCCTGTACAGGCACATCCT 

GCTGTATGGGCCACC^GGCACCGGGAAGACGCTGTTTGCCAAGAAACTCGCCCTGCAC 

TCAGGCATGGACTACGCCATCATGACAGG CGGGG ACGTGG CCCCCATGGGGCGGG AAG 

GCGTGAC03CCATGCACAAGCTCTTTGACTGGGCCAATACCAGCCGGCGCGGGCTCCT 

GCTCTTCATGGATGAAGCAGACGCCTTCCTTCGGAAGCGAGCCACTGAGGAGATAAGC 

AAGGACCTCAGAGCCACACTGAACGCCTTCCTGTACCACATGGGCCAACACAGCAACA 

AGTTCATGCTGGTCCTGGCCAGCAATCTGCCTGAGCAGTTCGACTGTGCCATCAACAG 

CCG CATTGACGTGATGGTCCACTTCGACCTGCCG CAGCAGG AGGAGCGGGAG CGCCTG 

GTGAGACTGCATTTTGACAACTCTGTTCTTAAGCCGGCCACAGAAGGAAAACGGCGCC 

TG AAGCTGG CCCAGTTTG ACTACGGG AGG AAGTGCTCGGAGGTCG CTCGGCTGACGGA 

GGGCATGTCGGGCCGGGAGATCGCTCAGCTGGCCGTGTCCTGGCAGGCCACGGCGTAT 

GCCTCCGAGGACGGGGTCCTGACCGAGGCCATGATGGACACCCGCGTGCAAGATGCTG 

TCCAGCAGCACCAGCAGATGATGCGCTGGCTGAAGAGGGGGAGGCCTGGGCCCGAGGA 

CGAGCAACCCTCATCCXGAGTCCATGGGGAGACCACACCTCACGGAGCCTGGCCGCGG 

ACCCCTCCCACCCCTGCCTTTGCGGCCCCGCACATTTAGGAAATArTrrrr 


J ORF Start: ATG at 2 


ORF Stop: TGA at 1757 




SEQ ID NO: 32 


585 aa I MW at 65861. IkD 


NOV 12a, 
CGI00710-01 Protein Sequence 

1 


MSWLFGVNKGPKGEGAGPPPPLPPAQPGAEGGGDRGLGDRPAPKDKWSNFDPTGLERA 
AKAARELEHSRQTQERLSGS AS PEQVPAGECCALQE YE AAVEQL KS EQIRAQAEERRK 

TLSEETRQHQARAQYQDKIARQRYEDQLKQQQLLNEENLRKQEESVQKQEAMRRATVE 

REMELRHKNEMLRVETEARARAKAERENADIIREQIRLKASEHRQTVLESIRTAGTLF 

GEGFRAFVTDRDKVTATVAGLTLLAVGVYSAKNATAVTGRFIEARLGKPSLVRETSRI 

TVLEALRHPIQVSRRLLSRPQDVLEGWLS PSLEARVRDI AIATRNTKKNRGLYRHI L 

LYGPPGTGKTLFAKKIJU,HSGMDYAIMTGGDVAPMGREGVTAMHKLFDWANTSRRGLL 

LFMDEADAFLRKRATEEISKDLRATLNAFLYHMGQHSNKFMLVLASNLPEQFDCAINS 

RIDVMVHFDLPQQEERERLVRLHFDNCVLKPATEGKRRLKIAQFDYGRKCSF^ARLTE 

3MSGREIAQLAVSWQATAYASEDGVLTEAMMITC 

2QPSS 



Further analysis of the NOV12a protein yielded the following properties shown i: 
Table 12B. 



Table 12B. Protein Sequence Properties NO VI 2a 


PSort analysis: 


, 0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0. 1 524 probability located in iysosome (lumen); 0. 1 000 probability 
located in mitochondrial matrix space 


SignalP analysis: j 


No Known Signal Sequence Predicted 
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A search of the NOV 12a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 12C. 

i — — 



Table 12C Geneseq Results for NOV12a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOVI2a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB93086 


Human protein sequence SEQ ID 
NO: 1 1 926 - Homo sapiens, 586 aa. 
[EP 1 0746 1 7-A2, 07-FEB-200 1 ] 


1..585 
1..586 


529/586 (90%) 
547/586 (93%) 


0.0 


AAM39035 


Human polypeptide SEQ ID NO 21 80 
- Homo sapiens, 602 aa. 
[WO200153312-A1, 26-JUL-2001] 


70..585 
25..540 


502/516(97%) 
505/516(97%) 


0.0 


AAB94853 


Human protein sequence SEQ ID 
NO: 1 6038 - Homo sapiens, 480 aa. 
[EP 1 0746 1 7-A2, 07-FEB-200 1 ] 


168..585 
1..418 


404/418 (96%) 
407/418(96%) 


0.0 


ABB71478 

l 


Drosophila melanogaster polypeptide 
SEQ ID NO 41226- Drosophila 
melanogaster, 599 aa. 
[WO200I71042-A2, 27-SEP-200I] 


L.570 
1..576 


342/579(59%) 
423/579(72%) . 


0.0 


AAB63264 


Human breast cancer associated 
antigen protein sequence SEQ ID 
NO:626 - Homo sapiens, 208 aa. 
[WO200073801-A2, 07-DEC-2000] 


279..474 
3.. 198 


181/196 (92%) 
187/196(95%) 


8e-99 



In a BLAST search of public sequence datbases, the NOV 12a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 12D. 
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Table 12D. Public BLASTP Results for NOV12a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV 12a 
Residues/ 
Match Residues. 


identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BUK4 

* 

i 


SIMILAR TO HYPOTHETICAL 
PROTEIN FLJ 10709 
(HYPOTHETICAL 72.6 KDA 
PROTEIN) - Homo sapiens (Human), 
648 aa. 


L.585 

1 ..JOO 


551/586 (94%) 
Do i/joo 


0.0 

1 


Q9NVI7 


CDNA FLJ10709 FIS, CLONE 
NT2RP3000869 - Homo sapiens 
(Human), 586 aa. 


1..585 

1 ..JoO 


529/586 (90%) 

J4//D50 {yoVo) 


0.0 


Q96A50 


HYPOTHETICAL 66.3 KDA PROTEIN 
(UNKNOWN) (PROTEIN FOR 
MGC: 14291) (PROTEIN FOR 
MGC:20264) - Homo sapiens (Human), 
586 aa. 


1..585 
1..586 


528/586 (90%) 
546/5S6 (93%) 


0.0 


Q96T67 


TOB3 - Homo sapiens (Human), 578 aa. 


1..572 
1..572 


527/573 (91%) 
541/573 (93%) 


0.0 


Q925I1 

! 


TOB3 - Mus musculus (Mouse), 591 aa. 


I..583 
1..583 


500/584 (85%) 
540/584 (91%) 


0.0 



PFam analysis predicts that the NOV 12a protein contains the domains shown i 
the Table 12E. 



Table 12E. Domain Analysis of NOV12a 


Pfam Domain 


NOV12a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


AAA 


346..542 


60/221 (27%) 
140/221 (63%) 


2.6e-32 



Example 13. 



The NOV13 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 13 A. 
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Table 13A. NOV13 Sequence Analysis 




SEQ ID NO: 33 


1705 bp 


NOV 13a, 
CG 100730-01 DNA Sequence 


GATAAATGGAGGAAGCTAAGACCTCACTGAGCCTGTGG CTCCGG CCG CTGAGTGATGG 
TCCCrrCroTCTGCCAGGGTGGGATCGGGCACTCATCCAGTTGCAGGTGCGAGCACTA 
TTGAGTAGC^CAGGGTCTOUIGCTGTGGCCATGGACTTGGGCAACAGGAAATTAGAAA 
TATCCTCTGGAAAGCTGG CCAGATTTGCAAATGG CTCTGCTGTAAT ACAGTCAGGTG A 
CACTGCGGTAATGGCCACAGCCGTCAGTAAAACAAAACCTTCCCCTTCCCAGTTTATG 
CCTTTGGTGGTTG ACTACGGACAAAAGGCTG CTG CAGT AGGTAG AATTCCCAC AAACC 
CTCTGAGAAGAGAGATTAGTACTTCTGATAAAGAAATTCTTACACGTGGAATAGTAGA 
TTGTTCAGTTAGACCTATCTTTCCAGCTGGCTACTTCTGTGATATACAGGTTCTGTGT 
AATCTGTTAGCAGTAGATGGTATTAATGAACTTGATGTCCTAGCAATTAATGGCAACT 
CTGTAGCCCTCTCATTATCAGATATTCTTTGGAATGGACCTGTTGGGACAGTAAGAAT 
AGGAATG ACTG ATGG AG AATG TGTTG TTAACC CAAC AAG AAAAG AAATG TCTTCT AG T 
ACTTTAAATGTAGTGGTTGCTGGAG CACCTAAAAGTCAG ACTGTCATGTTGGAAG CCT 
CTGCAGAGAACATTTTACAGCAGGACTTTTGCCACGCTATCAAAGTGGGAGTGAAATA 
T ACCCAG CAAAT AATT CAG GG C AT C CAG CAG CTGGT AAAAG AAATTGGTGTTACCAAG 
AGGACACCTGAGAAGTTATTTACCCCTTCACAAGAGATTGTGAAACATGCTCATAAAC 
TCACTATGGAG AG ACTCT ATG CAGTTTTT ACAG AT T ATG AACATG AT AAAATTTC CAG 
AGATGAAGCTOTTAACAAAATAAGATTAGATAC^GAGGAACAACTAAAAGAAATATTT 
CC^GAGGTCGATCTATATGAGATAATAGAATCCTTO^TACTGTTGCy^GAAAGTTT 
TTAGT^GTATTATTTTGAATGAATACA^ 

TCXSGAATATAAGTTGTGAGGTAGATATGTTTAAAACCCTTCATGGATC^GAATTATTT 
CAAAGAGGACAAACACAGCTGCTTTGTGCTGTTACATTTGATTCATTAGAATCCAGTA 
TTAAGTTGGATCGAGTTATAACAACTATAAATGGGATAAAAGATAAAAATTTCATGCT 
GC^CTATGAGTTTCCTCCTTATGCAACTAATGAAATTGTCAAAGTCACCGGTATAAAT 
AGAAGAGAACTTGGGCCTG ATG CTCTTG CTGAG AAAG CTTTGTATCCTGTTATrCCCA 
AAGATTTTCCTTTTGCCATAAGAGTTACATCTGAAGTCCTAGAGTCAAATGGGTCATC 
TTCTATGGCATCTGCATGTGGTGGAAGTTTAGCATTAATGGATTCAGGGCTTCCAATT 
AAATCTGCTGTCACAGGTGTAG CAATGGG ATTGG CCACCAAAACCGATCTTGAGAAGG 
GTGAAATAGAAGATTATCATTTGCTAACAGATATTCTGGGAATTGAAGCTTACAATGG 
TTGACATGGACTTCAAAATAGCTGGCACTAATAAAGGAATAACTGCATTACAGGCAGA 
TATTAAATT ACCTGG AATAACAA 




ORF Start: ATG at 6 


ORF Stop: TGA at 1626 




SEQ ID NO: 34 


540 aa 


MWat 59I97.4kD 


NOV 13a, 
CGI 00730-0 1 Protein Sequence 


MEEAKTSLSLWLRPLSDGPFCLPGWDRALIQLQVRALLSSTGSQAVAMDLGNRKLEIS 

SGKLARFANGSAVIQSGDTAVMATAVSKTKPSPSQFMPLVVDYGQKAAAVGRIPTNPL 

RREISTSDKEILTRGIVDCSVRPIFPAGYFCDIQVLCNLLAVDGINELDVIiAINGNSV 

ALSLSDILWNGPVGTWIGMTIX3ECVWPTRK£MSSSTLNVWAG 

ENILQQDFCHAIKVGVKYTGX3IIQGIQQLVKEIGVTKRTPQKLFTPSQEIVKHAH 

MERLYAVFTDYEHDKISRDEAVNKIRLDTEEQ 

SIII^EYKRODGRDLTLiaiNISCEVDMFKTIiHGSELFQRGQTQLLCAVTFDSLESSIK 
LDRVITTINGIKI)KNFMIiHYEFPPYATNEIVK^GINRRELGPDALAEKALYPVIPKD 
FPFAIRVTSEVLESNGSSSMASACGGSLALMDSGLPIKSAVTGVAMGLATKTDIjEKGE 
I ED YHLLTD I LG I EAYNG 



Further analysis of the NOV 13a protein yielded the following properties shown in 
Table 13B. 



Table 13B. Protein Sequence Properties NOV13a 


PSort analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0. 1 000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV1 3a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 13C. 





Table 13C. Geneseq Results for NOV13a 


Genesen 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 13a 
Residues/ 
Match Residues 


Identities/ 
oimitariues lor 
the Matched 
Region 


Expect 
Value 


AAB08712 


Amino acid sequence of a human 
OLD-35 polypeptide - Homo sapiens, 
705 aa. [WO200046391-A2, 10- 
AUG-2000] 


17..540 
1..524 


455/524 (86%) 
477/524 (90%) 


0.0 


AAB92684 


Human protein sequence SEQ ID 
NO: 1 1 065 - Homo sapiens, 504 aa. 
[EP 1 0746 1 7-A2, 07-FEB-200 1 ] 


10..466 
11. .467 


396/457 (86%) 
416/457 (90%) 


0.0 


ABGI7275 


Novel human diagnostic protein 
#17266 - Homo sapiens, 899 aa. 
[WO200175067-A2, Il-OCT-2001] 


192..540 
522..852 


328/349 (93%) 
330/349 (93%) 


0.0 


ABG08546 


Novel human diagnostic protein 
#8537 - Homo sapiens, 899 aa. 
[ WO200 1 75067-A2, 1 1 -OCT-200 1 ] 


192..540 
522..852 


328/349 (93%) 
330/349 (93%) 


0.0 


ABGI7275 


Novel human diagnostic protein 
' # 1 7266 - Homo sapiens, 899 aa. 
[ WO200 1 75067-A2, 1 1 -OCT-200 1 ] 


192..540 
522..S52 


328/349 (93%) 
330/349 (93%) 


0.0 



In a BLAST search of public sequence datbases, the NOV 13a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 13D. 
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Table 13D. Public BLASTP Results for NOV13a 


Protein 
Accession 
j Number 


Protein/Organism/Length 


NOV 13a 
Residues/ 
Match Residues 


i Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


09DC52 


izuuuujr jzkjk r t\\J i JbllN - ivius 
musculus (Mouse), 540 aa. 


13.. 534 


1 4j4/522 (83%) 
467/522 (89%) 


0.0 


Q96T05 


CDNA FU1453I FIS, CLONE 
NT2RM2000371, WEAKLY SIMILAR 

TO POLYRIBONUCLEOTIDE 
NUCLEOTIDYLTRANSFERASE (EC 
2.7.7.8) - Homo sapiens (Human), 504 aa. 


10..466 
1 1..467 


396/457 (86%) 
416/4*57 


0.0 


Q9V9X7 


CGI 1337 PROTEIN - Drosophila 
melanogaster (Fruit fly), 748 aa. 


35..540 
29..540 


262/512(51%) 
354/512(68%) 


e-143 


Q95RX7 


LD03255P - Drosophila melanogaster 
(Fruit fly), 720 aa. 


55..540 
1..489 


257/489 (52%) 
342/489 (69%) 


e-142 


Q9S7G6 


POLYNUCLEOTIDE 
PHOSPHORYLASE - Arabidopsis 
thaliana (Mouse-ear cress), 99 1 aa. 


49..540 
58..545 


201/494(40%) 
307/494(61%) 


e-104 



PFam analysis predicts that the NOV 13a protein contains the domains shown in 
the Table 13E. 



Table 13E. Domain Analysis of NOV13a 


Pfam Domain 


NOV 13a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RNaseJPH 


48..250 


58/255 (23%) 
137/255 (54%) 


4.5e-20 


RNase_PH 


357..537 


58/260 (22%) 
136/260 (52%) 


3.2e-15 



Example 14. 



The NOV14 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 14A. 
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Table 14A. NOV14 Sequence Analysis 




SEQ ID NO: 35 


2626 bp 


NOV 14a, 
CGI 008 19-0 1 DN A Sequence 

! 

i 
i 


CCTGCGTTGACCGCGTGCCGGGTGTCATGGCGGCCTGCAGGTACTGCTGCTCGTGCrT 

CCGGCTCCGG CCCCTGAG CGATGGTCCTTTCCTTCTGCCACGGCGGGATCGGGCACTC 

ACCCAGTTGCAAGTGCGAGCACTATGGAGTAGCGCAGGGTCTCGAGCTGTGGCCGTGG 

ACTTAGGCAACAGGAAATTAGAAATATCTTCTGGAAAGCTGGCCAGATTTGCAGATGG 

CTCTGCTGTAGTACAGTCAGGTGACACTGCAGTAATGGTCACAGCGGTCAGTAAAACA 

AAACCTTCCCCTTCCCAGTTTATGCCTTTGGTGGTAGACTACAGACAAAAAGCTGCTG 

CAGCAGGTAGAATTCCCACAAACTATCTGAGAAGAGAGATTGGTACTTCTGATAAAGA 

AATTCTAACAAGTCGAATAATAGATCGTTCAATTAGACCGCTCTTTCCAGCTGGCTAC 

TTCTATGATACACAGGTACTGTGTAATCTGTTAGCAGTAGATGGTGTAAATGAGCCTG 

ATGTCCTAG CAATTAATGG CGCTTCCGTAGCCCTCTCATTATCAGATATTCCTTGGAA 

TGGACCTGTTGGTGCAGTACGAATAGGAATAATTGATGGAGAATATGTTGTTAACCCA 

ACAAGAAAAGAAATGTCTTCTAGTACTTTAAATTTAGTGGTTGCTGGAGCACCTAAAA 

GTC^GATTGTCATGTTGGAAGCCTCTGCAGAGAAC^TTTTACAGCAGGACTTTTGCCA 

TGCTATCAAAGTGGGAGTGAAATATACCCAACAAATAATTCAGGGCATTCAGCAGTTG 

GTAAAAGAAACnSGTGTTACCAAGAGGACACCTCAGAAGTTATTTACCCCTTCGCCAG 

AGATTGTGAAATATAOTCATAAACTTGCTATGGAGAGACTCTATG(^GTTTTTACAGA 

TTACGAGCATGACAAAGTTTCCAGAGATGAAGCTGTTAACAAAATAAGATTAGATACG 

GAGGAACAACTAAAAGAAAAATTTCCAGAAGCCGATCCATATGAAATAATAGAATCCT 

TCAATGTTGTTGCAAAGGAAGTTTTTAGAAGTATTGTTTTGAATGAATACAAAAGGTG 

CGATGGTCGGGATTTGACTTCACTTAGGAATGTAAGTTGTGAGGTAGATATGTTTAAA 

ACCCTTCATGGATCAGCATTATTTCAAAGAGGACAAACACAGGTGCTTTGTACCGTTA 

CATTTGATTCATTAGAATCTGGTATTAAGTCAGATCAAGTTATAACAGCTATAAATGG 

GATAAAAGATAAAAATTTCATGCTGCACTACGAGTTTCCTCCTTATGCAACTAATGAA 

ATTGG CAAAGTCACTGGTTTAAATAGAAG AGAACTTGGG CATGGTGCTCTTGCTGAGA 

AAGCTTTGTATCCTGTTATTCCCAGAGATTTTCCTTTCACCATAAGAGTTACATCTGA 

AGTCCTAGAGTCAAATGGGTCATCTTCTATGGCATCTGCATGTGGCGGAAGTTTAGCA 

TTAATGGATTCAGGGGTTCCAATTTCATCTGCTGTTGCAGGCGTAGCAATAGGATTGG 

TCACCAAAACCGATCCTGAGAAGGGTGAAATAGAAGATTATCGTTTGCTGACAGATAT 

TTTGGG AATTG AAG ATTACAATGGTGACATGGACTTCAAAAT AG CTGG CACTAATAAA 

GGAATAACTGCATTACAGGCTGATATTAAATTACCTGGAATACCAATAAAAATTGTGA 

TGGAGGCTATTCAACAAGCTTCAGTGGCAAAAAAGGAGATATTACAGATCATGAACAA 

AACTATTTCAAAACCTCGAGCATCTAGAAAAGAAAATGGACCTGTTGTAGAAACTGTT 

CAGGTTCCATTATCAAAACGAGCAAAATTTGTTGGACCTGGTGGCTATAACTTAAAAA 

AACTTCAGGCTGAAACAGGTGTAACTATTAGTCAGGTGGATGAAGAAACGTTTTCTGT 

ATTTGC ACCAACACCCAGTGTT ATGCATGAGG CAAGAGACT TCATTACTGAAATCTG C 

AAGGATGATCAGGAGCAGCAATTAGAATTTGGAGCAGTATATACCGCCACAATAACTG 

AAAT CAGAGATACTG GTG TAATGGTAAAATTAT AT CCAAATATGACTGCGGT ACTG CT 

TCATAACACACAACrTTGATAACGAAAGATTAAACATCCTACTGCCCTAGGATTAGAAG 

TTGGCC^AGAAATTCAGGrGAAATACTTTGGACGTGACCCAGCCGATGGAAGAATGAG 

GCTl w i l Cu v L:ciAAAAGTGCriTCAGTCGCCAGCTACAACCGTGGTCAGAACT 

AGAAG T AGTATTGT AATCGGAG AACCTATTTCACAGTCATCATCTAATT CTCAG TGAT 

TTTTTTTTTTTAAAGAGAATTCTAGAATTCrATTTTGTCT 

GCAACATTTTAGTAGATCTTCCATTGTGTAGATTTCTATATAATATAAATACATTTTA 
ATTATTTGTACTAAAATGCTCATTTACATGTGCCATTTTTTTAATTCGAGTAACCCAT 
ATTTG TTT AATTGTATTTACATTATAAAT CAAGAAAT ATTT ATTATT AAAAGT AAGTC 
ATTTATACATCTTAGA 




ORF Start: ATG at 27 


ORF Stop: TAG at 2 193 




SEQ ID NO: 36 


722 aa 


MWat79307.2kD 


NOV 14a, 
CGI 008 19-01 Protein Sequence 


MAACRYCCSCLRLRPLSDGPFLLPRRDRALTQLQVRALWSSAGSRAVAVDLGNRKLEI 
S SGKIiARFADGSAWQS GDTAVMVTAVS KTKPS PSQFMPLWDYRQKAAAAGRI PTNY 
LRREIGTSDKEILTSRIIDRSIRPLFPAGYFYDTQVLCNLLAVDGVNEPDVLAINGAS 
VALSLSDI PWNGPVGAVtflGI IDGEYWNPTRKEMSSSTLNLWAGAPKSQIVMLEAS 
AENI LQQDFCHAI KVGVKYTQQ 1 1 QGIQQLVKETGVTKRTPQKLFTPS PE I VKYTHKL 
AMERLYAVFTDYEHDKVSRDEAVNKI RLDTEEQLKEKFPEADPYE 1 1 ESFNWAKEVF 
RSIVLNEYKRCDGRDLTSLRNVSCEVDMFKTLHGSALFQRGQTQVLCTVTFDSLESGI 
KSDQVI TAING I KDKNFMLHYE FP PYATNE I GKVTGLNRREIjGHGALAEKALYPVI PR 
DFPFTIRVTS EVL ESNGS S SMAS ACGGSLALMDSGVP I S SAVAGVAIGLVTKTDPEKG 
E I EDYRLLTD I LG IEDYNGDMDFKI AGTNKGI TALQADI KLPG I P I KIVMEAI QQASV 
AJOCEILQIMNKTISKPRASRKENGPVVETVQVPLSKRAKFVGPGGYNLKKIiQAETGVT 
I SQVDEETFSVFAPTPSVMHEARDFITEI CKDDQEQQLEFGAVYTATITEIRDTGVMV 
KLYPNMTAVLLHNTQLDNERLNILLP 
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Further analysis of the NOV 14a protein yielded the following properties shown in 
Table 14B. 



Table 14B. Protein Sequence Properties NOV14a 


i PSort analysis: . , 

i 


0.5016 probability located in mitochondria! matrix space; 0.4500 probability 
located in cytoplasm; 0.2212 probability located in mitochondrial inner membrane; 
0.2212 probability located in mitochondrial intermembrane space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 14a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 14C. 



Table 14C Geneseq Results for NOV14a 


| 

| Geneseq 
{ Identifier 

i 


Protein/Organism/Length [Patent #, Date] 


NOV 14a 
Residues/ 

Match 
Residues^ 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


! AAB08732 


Amino acid sequence of a human OLD-35 
polypeptide - Homo sapiens, 705 aa. 
[WO200046391-A2, 10-AUG-2000] 


18..722 
1..705 


704/705 
(99%) 

705/705 
(99%) 


0.0 


AAB92684 


Human protein sequence SEQ ID 
NO:l 1065 - Homo sapiens, 504 aa. 
[EP1074617-A2, 07-FEB-2001] 


1..467 
1..467 


466/467 
(99%) 

467/467 
(99%) 


0.0 


ABB58546 


Drosophila melanogaster polypeptide SEQ 
ID NO 2430 - Drosophila melanogaster, 
748 aa. [WO200171042-A2, 27-SEP-200I] 


36..717 
29..693 


363/688 
(52%) 

485/688 
(69%) 


0.0 


ABG17275 


Novel human diagnostic protein #17266 - 
Homo sapiens, 899 aa. [WO200I75067-A2, 
ll-OCT-2001] 


190..583 
519..894 


335/394 
(85%) 

349/394 
(88%) 


0.0 


ABG08546 


Novel human diagnostic protein #8537 - 
Homo sapiens, 899 aa. [WO200175067-A2, 
ll-OCT-2001] 


190..583 
519..894 


335/394 
(85%) 

349/394 
(88%) 


0.0 



In a BLAST search of public sequence datbases, the NOV 14a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 14D. 
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Table 14D. Public BLAST? Results for NOV14a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV 14a 
Residues/ 

iviatcn 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


? Q9DC52 


1200003F12RIK PROTEIN - Mus muscuhiQ 
(Mouse), 540 aa. 


1 ^"2/1 
1 ..Dj4 

1..534 


4 j4 (89%) 
506/534 (94%) 


0.0 


! Q96T05 


CDNA FLJ 14531 FIS, CLONE 
NT2RM2000371, WEAKLY SIMILAR TO 
POLYRIBONUCLEOTIDE 
NUCLEOTIDYLTRANSFERASE (EC 
2.7.7.8) - Homo sapiens (Human), 504 aa. 


1..467 
1..467 


466/467 (99%) 
467/467 (99%) 


0.0 


Q95RX7 


LD03255P - Drosophila melanogaster (Fruit 
fly), 720 aa. 


56..717 
1..665 


373/665 (56%) 
494/665 (74%) 


0.0 


j Q9V9X7 


CGI 1337 PROTEIN - Drosophila 
melanogaster (Fruit fly), 748 aa. 


36..717 
29..693 


363/688 (52%) 
485/688 (69%) 


0.0 


| Q9S7G6 


POLYNUCLEOTIDE PHOSPHORYLASE 
- Arabidopsis thaliana (Mouse-ear cress), 
991 aa. 


50..715 
58..713 


278/668(41%) 
417/668 (61%) 


e-148 



PFam analysis predicts that the NOV 14a protein contains the domains shown i 
the Table 14E. 



Table 14E. Domain Analysis of NOV14a 


Pfam Domain 


NOV 14a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RNase_PH 


48..251 


68/255 (27%) 
147/255 (58%) 


l.le-37 


RNasePH 


358..581 


75/261 (29%) 
175/261 (67%) 


4e-55 


KH-domain 


609..651 


14/49(29%) 
31/49(63%) 


0.35 



Example 15. 



The NOV15 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 15A. 
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Table ISA. NOV15 Sequence Analysis 




SEQ ID NO: 37 


2030 bp 


NOV15a, 
CGI 00872-01 DNA Sequence 

i 

? 

i 

i 


CCATGTCCTTCCAGAGTATCATCCACCTGTCCCTGGACAGCCCTGTCCATGCCGTTTG 

TGTGTTGGGCACAGAAATCTGCTTGGATCTCACCAGGTGTGCCCCCCAGAAGTGCCAG 

TGCTTCACCATCCATGGCTCTGGGAGGGTCTTGATCGATGTGGCCAACACGGTGATTT 

CTGAGAAGGAGGACGCCACCATCTGGTGGCCCCTGTCTGATCCCACGTACGCCACAGT 

GAAGATGACATCGCCCAGCCCTTCCGTGGATGCGGATAAGGTAAGCCTCACATACTAT 

GGGCCCAACGAGGATGCCCCCGTGGGCACAGCTGTGCTGTACCTCACTGGCATTGGTG 

AGTGTTG CTCCAACTGGGAGCCTCCCAACCG CTGGG ACCCCAG ATCCATTTTTTCTCA 

GAAAAAATGGATCTGGGGTCCCAG CGGTTGGGGTG CCATCCTG CTTGTGAATTGCAAC 

CCTGCTGATGTGGGCC^GCAACTTGAGGACAAGAAAACCAAGAAAGTGATCTTTTCAG 

AGGAAATAACGAATCTGTCCCAGATGACTCTGAATGTCCAAGGCCCCAGCTGTATCTT 

AAAGAAATATCGGCTAGTCCTCCATACCTCCAAGGAAGAGTCGAAGAAGGCGAGAGTC 

TACTGGCCCCAAAGTGAAGACAACTCCAGTACCTTTGAGTTGGTGCTGGGGCCCGACC 

AGCACGCCTATACCTTGGCCCTCCTCGGGAACCACTTGAAGGAGACTTTCTACGTTGA 

AGCTATAGCATTCCCATCTGCCGAATTCTCAGGCCTCATCTCCTACTCTGTGTCCCTG 

GTGGAGGAGTCTTTGCAGTCAATTCCAGAGACTGTGCTGTACAAAGACACGGTGGTGT 

TCCGGGTGGCTCCCTGTGTCTTCATTCCCTGTACCCAGGTGCCTCTGGAGGTTTACCT 

GTGCTATCTGAGCTGGGTGTTGAGGGTTGAGCAGGAGTTGGCAGGAAGGAAGGGGGTT 

CTTACCGTACCTTTTCCGGCAAACAATCTGTCTTCCTCTCCATTCCAGGATGAGATGG 

CCTTCTGCTACACCCAGGCTCCCCACAAGACAACGTCCTTGATCCTCGACACACCTCA 

GGCCGCCGATCTCGATGAGTTCCCCATGAAGTACTCACTGGTGCCTGGTATTGGCTAC 

ATG ATCCAGG ACACTGAGGACCATAAAGTGG CCAG CATGG ATTCCATTGGGAACCTG A 

TGGTGTCCCCACCTGTCAAGGTCCAAGGGAAAGAGTACCCGCTGGGCAGAGTCCTCAT 

TGGCAGCAGCTTTTACCCCTCTAG CG CAGAGGGCCGGGCCATGAGTAAGACCCTCCGA 

GACTTCCTCTATGCCCAGCAGGTCCAAGCG CCGGTGGAG CTCTACTCAGATTGGCTAA 

TGACTGGCCACGTGGATGAGTTCATGTG CTTC ATCCCCACAGATGACAAG CAGGGCTT 

C CTGCTGCTCCTGGCCAGCCCCAGTG CCTG CT ATAAACTGTTCCG AG AG AAACAGAAG 

GAAGGCTATGGCGACGCTCTTCTGTTTGATGAGCTTAGAGCAGATCAGCTCCTGTCTA 

ATGGTAAGGGAACTCCCTTTCGCCCCCCCCCCCACCCACCCACCCACCCACAGAAGTG 

CATTCACCTGAACCGTGACATCCTGAAGACGGAGCTGGGCCTGGTGGAACAGGACATC 

ATCGAGATTCCCCAG CTGTTCTGCTTGG AG AAGCTG ACTAACATCCCCTCTGACCAG C 

AGCCCAAGTTGCGGATGATTGTGATGGGCAAGAACCTGGGGATCCCCAAGCCTTTTGG 

GCCCCAAATCAAGGGGACCTGCTGCCTGGAAGAAAAGATTTGCTGCTTGCTGGAGCCC 

C TGGG CTT CAAG TG C ACCTT CAT CAATG ACTTTG ACTGTT ACCTGACAG AGGTCGG AG 

ACATCTGTGCCTGTGCt^CATCCGCCGGGTGCCCTTTGCCnTCAAATGGTGGAAGAT 

GGTACCTTAGACCCAGGCCCTGGAGCTGCCAGCTCTGCCCCAGCGTGGATGGCCCACT 




ORF Start: ATG at 3 


ORF Stop: TAG at 1980 




SEQ ID NO: 38 


659 aa 1 MW at 73697.2kD 


NOV 15a, 
CG 100872-01 Protein Sequence 


MSFQSIIHLSLDSPVHAVCVLGTEICLDLTR^ 

E KED AT I WWPLSD PTYATVKMTS P S PSVDADKVS LT YYGPN EDAP VGTAVL YLTG I G E 
CCSNWEPPNRWDPRS I FSQKKWI WGPSGWGAILLVNCNPADVGQQLEDKKTKKVI PSE 
EITNLSQMTLNVQGPSCILKKYRLVLFH'SKEESKKARVyWPQSEDNSSTFELVLGPDQ 
HAYTLALLGNHLKETFYVEAIAFPS AE FSGL I S YS VSLVEES LQS I PETVLYKDTWF 
RVAPCVF I PCTQVPLEVYLC YLS WVLRVEQELAGRKGVLTVPFPANNLS SS PFQDEMA 
FCYTQAPHKTTSLILDTPQAADLDE FPM KYSLVPG I GYMI QDTEDHKVASMDS I GNLM 
VSPPVKVQGKEYPLGRVLIGSSFYPSSAEGRAMSKTLRDFLYAQQVQAPVELYSDWLM 
TGHVDEFMCFI PTDDKQGFLLLLAS PSACYKLFREKQKEGYGDALLFDELRADQLLSN 
GKGTPFRPPPHPPTHPQKCIHLNRDI LKTELGLVEQDI I EI PQLFCLEKLTNI PSDQQ 
PKU^IVMGKNLGIPKPFGPQIKGTCCLEEKICCLLEPLGFKCTFINDFDCYLTEVGD 
I CACANI RRVP FAFKWWKMVP 



Further analysis of the NOV1 5a protein yielded the following properties shown in 
Table 15B. 
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Table 15B. Protein Sequence Properties NOVlSa 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3600 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0598 probability located in microbody (peroxisome) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 15a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 5C. 



Table 15C. Geneseq Results for NOVlSa 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 15a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


j AAE06034 


Mouse oocyte protein 5 (MOP5) - Mus 
musculus. 664 aa. [WO2001 53 3 3 9- A2, 
26-JUL-2001] 


20.. 659 
I. .664 


386/672 (57%) 
492/672 (72%) 


0.0 


AAB41584 


Human ORFX ORF1348 polypeptide 
sequence SEQ ID NO:2696 - Homo 
sapiens, 663 aa. [WO200058473-A2, 05- 
OCT-2000] 


1..659 
1..663 


289/666 (43%) 
411/666 (61%) 


e-155 


ABB57299 


Mouse ischaemic condition related 
protein sequence SEQ ID NO: 839 - Mus 
musculus, 673 aa. [WO2001881 88-A2, 
22-NOV-2001] 


7..659 
15..673 


268/667 (40%) 
379/667 (56%) 


e-132 


ABG26518 


Novel human diagnostic protein #26509 
- Homo sapiens, 761 aa. 
[WO200175067-A2, ll-OCT-2001] 


31. .632 
36..642 


263/620 (42%) 
361/620 (57%) 


e-122 


ABG26518 


Novel human diagnostic protein #26509 
- Homo sapiens, 761 aa. 
[WO200175067-A2, ll-OCT-2001] 


31. .632 
36..642 


263/620 (42%) 
361/620 (57%) 


e-122 



5 In a BLAST search of public sequence datbases, the NOV 1 5a protein was found to 

have homology to the proteins shown in the BLASTP data in Table 15D. 
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Table 15D. Public BLASTP Results for NOVlSa 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV15a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


Q9UM07 


Protein-arginine deiminase type V (EC 
3.5.3.15) (Peptidylarginine deiminase V) 
(HL-60 PAD) - Homo sapiens (Human), 663 
aa. 


1..659 
1..663 


289/666 
(43%) 

411/666 
(61%) 


e-154 


088807 


Protein-arginine deiminase type IV (EC 
3.5.3.15) (Peptidylarginine deiminase IV) 
(PAD-R4) (Peptidylarginine deiminase type 
alpha) - Rattus norvegicus (Rat), 666 aa. 


1..659 
1..666 


297/675 
(44%) 

404/675 
(59%) 


e-148 


P70708 


Protein-arginine deiminase type III (EC 
3.5.3.15) (Peptidylarginine deiminase HI) - 
Rattus norvegicus (Rat), 664 aa. 


1..659 
1..664 


290/677 
(42%) 

408/677 
(59%) 


e-145 


Q9ZI84 


Protein-arginine deiminase type HI (EC 
3.5.3.15) (Peptidylarginine deiminase III) - 
Mus musculus (Mouse), 664 aa. 


1..659 
1..664 


288/677 
(42%) 

409/677 
(59%) 


e-144 


057400 


PEPT1 DYLARG FNINE DEIMINASE - 
Gallus gallus (Chicken), 672 aa. 


1..659 
1..672 


286/682 
(41%) 

399/682 
(57%) 


e-143 



PFam analysis predicts that the NOV 1 5a protein contains the domains shown in 
the Table 15E. 



Table 15E. Domain Analysis of NOVlSa 


Pfam Domain 


NOV 15a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PAD 


1..659 


311/702(44%) 
446/702 (64%) 


4.7e-222 



Example 16. 



The NOV 16 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 16A. 
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Table 16A. NOV16 Sequence Analysis 




SEQ ID NO: 39 


2018 bp 


NOV 16a, 
CGI 00980-0 1 DNA Sequence 

I 
\ 

\ 

i 

j 

i 
1 


GATGG CCCCAAAGAGAGTTGTGCAG CTGTCCCTGAAGATGCCTACCCATGCCGTGTGT 

GTGGTGGGAGTCGAGGCACATGTGGACATTCACAGTGATGTGCCCAAGGGTGCCAACA 

GCTTCAGGGTCTCTGGAAG CTCCGGGGTGGAGGTCTTCATGGT CTACAACCGCACACG 

TGTGAAAGAGCCCATAGGCAAGGCCCGTTGGCCGCTAGACACTGATGCAGACATGGTC 

GTATCTGTGGGCACAGCCAGTAAGGAATTAAAGGACTTCAAGGTAAGAGTCTCCTACT 

TTGGGGAGCAGGAAGACC7U^GCTCTGGGCCGCAGCGTGCTTTACCTCACTGGCGTCCT 

AGAACCCCTTCCTGGAGCCTCCTGGGTCCCACTGATCCCAAGGC^TCTTATTTTGCCA 

CAG AAAACCTGGCGCTGGGGCCCTGAGGG CTATGGGG CTATCTTGCTGGTG AACTGTG 

ACCGGG ACAATCACAGGTCCGCAG AGCCTGACCTCACCCAC AG CTGG CTGATGTCGCT 

GGCTGACCTGCAGGACATGTCCCCAATGCTGCTGAGCTGCAATGGCCCCGACAAGCTC 

TTCGACAGCCACAAGCTTGTCTTGAACGTGCCCTTTTCTGATTCCAAAAGAGTGAGGG 

TCTTCTGTG CCAGGGGTCCTGAGGATGTGTGTGAGG CCTATAGGCATGTGCTGGGCCA 

AAATAAGGTGTCCTATGAGGTACCCCGCTTGC^TGGGGATGAGGAGCGCrTCTTCGTG 

GAAGGCCTGTCCTTCCCTGATGCCGGCTTCACAGGACTCATCTCCTTCCATGTCACTC 

TGCTGGACGACTCCAACGAGGATTTCTCGGCATCCCCTATCTTCACTGAC^CTGTGGT 

GTTCCGAGTGGC^CCCTGGATCATGACGCCCAGCACTCTGCCACCCCTAGAGGTGTAT 

GTGTGCCGGGTGAGGAACAACACGTGTTTTGTGGATGCGGTGGCAGAGCTGGCCA.GGA 

AGG CCGGCTGCAAGCTG ACCATCTGCCCACAGGCCGAGAACCG CAACGACCGCTGGAT 

CCAGGATGAGATGGAGCTGGGCTACGTTCAGGCGCCGCACAAGACCCTCCCGGTGGTC 

TTTGACTCCCCAAGGAATGGGGAACTGCAGGATTTCCCTTACAAAAGAATCCTGGGTC 

CAGATTTTGGTTACGTGACTCGGGAACCACGCGACAGGTCTGTGAGTGGCCTGGACTC 

CTTTGGGAACCTGGAGGTCAGCCCTCCAGTGGTGGCCAATGGGAAAGAGTACCCCCTG 

GGGAGGATCCTCATTGGGGGCAACCTGCCTGGGTCAAGTGGCCGCAGGGTCACCCAGG 

TGGTGCGGGACTTCCTCCATGCCCAGAAGGTGCAGCCCCCCGTGGAGCTCTTTGTGGA 

CTGGTTGGCCGTGGGCCATGTGGATGAGTTTCTGAGCTTTGTCCCTGCCCCCGATGGG 

AAGGGCTTCCGGATGCTCCTGGCCAGCCCTGGGGCCTGCTTCAAGCTCTTCCAGGAAA 

AG CAGAAGTGTGG CCACGGG AGGGCCCTCCTGTTCCAGGGGGTTGTTGGTGATG AGC A 

GGTCAAGACCATCTCCATCAACC^GGTGCTCTCCAATAAAGACCTCATCAACTACAAT 

AAGTTTGTGCAGAGCTGCATTGACTGGAACCGTGAGGTGCTGAAGCGGGAGCTGGGCC 

TGGCA.GAGTGTGACATCATTGACATCCCACAGCTCTTCAAGACCGAGAGGAAAAAAGC 

AACGGCCITLTTCCCTGACTTGGTGAACATGCTGGTGCTGGGGAAGCACCTGGGCATC 

CCCAAGCCCTTTGGGCCCATCATCAATGGCTGCTGCTGCCTGGAGGAGAAGGTGCGGT 

CCCTGCTGGAGCCGCTGGGCCTCCACTGCACCTTCATTGATGACTTCACTCCATACCA 

C^TGCTGCATGGGGAGGTGCACTGTGGCTVCCAATGTGTC 

AAGTGGTGGAACATGGTGCCCTGAGACAGCTCCCACCCACCATCCT 




ORF Start: ATG at 2 


ORF Stop: TGA at 1994 




SEQ ID NO: 40 


664 aa 


MW at 74356.9kD 


NOV 16a, 
CGI 00980-01 Protein Sequence 


MAPKRWQLSLKMPTHAVCWGVEAHVDIHSDVPKGANSFR^ 

VKE PIGKARWPLDTDADMWSVGTASKELKD FKVRVS YFGEQEDQALGRS VLYLTGVL 
EPLPGASWVPLIPRHLILPQKTWRWGPEGYGAILLVNCDRDNHRSAEPDLTHSWLMSL 
ADLQDMSPMLLSOTGPDKLFDSHKLVIjNVPFSDSKRWV^ 

NKVSYEVPRLHGDEERFFVEGLS FPDAGFTGLI SFHVTLLDD SNEDFS AS PI FTDTW 
FRVAPW I MT PSTLP PLEVYVCRVRlsJNTCFVD AVAELiARKAG C KLT I C P QAENRNDRW I 
QDE^ELGYVQAPHKTLPVVFDSPRNGELQDFPYKRIIiGPDFGYVTREPRDRSVSGLDS 
FGNLEVSPPWANGKEYPLGRILIGGNLPGSSGRRVTQVVRDFI4HAQKVQPPVELFVD 
WLAVGHVDEFLSFVPAPDGKGFRMLLASPGACFKLFQEKQKC^HGRALLFQGVVGDEQ 
VKTI S INQVLSNKDL INYNKFVQSC IDWNREVLKRELGIiAE CD I IDI PQLFKTERKKA 
TAFFPDLVNMLVLGKHLGIPKPFGPIINGCCC^EEKVRSLLEPI^LHCTFIDDFTPYH 
MLHGEVHCGTNVCRKPFSFKWWNMVP 



Further analysis of the NOV 16a protein yielded the following properties shown in 
Table 16B. 
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Table 16B. Protein Sequence Properties NOV16a 


Psort analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.2424 
probability located in lysosome (lumen); 0.1410 probability located in microbody 
(peroxisome); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 16a protein against the Geneseq database,- a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 16C. 



Table 16C. Geneseq Results for NOV16a 


j Geneseq 
| Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 16a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


1 ABG26518 

i 

• 


Novel human diagnostic protein 
#26509 - Homo sapiens, 761 aa. 
[ WO200 1 75067- A2, 1 1 -OCT-200 1 ] 


32..645 
37..650 


591/614(96%) 
595/614(96%) 


0.0 


i ABG26518 

i 


Novel human diagnostic protein 
#26509 - Homo sapiens, 761 aa. 
[WO200I75067-A2, 1 1 -OCT-200 1 J 


32..645 
37..650 


591/614(96%) 
595/614(96%) 


0.0 


AAB41584 


Human ORFX ORF1348 polypeptide 
sequence SEQ ID NO:2696 - Homo 
sapiens, 663 aa. [WO200058473-A2, 
05-OCT-2000] 


1..664 
I. .663 


377/671 (56%) 
468/671 (69%) 


0.0 


ABB57299 


Mouse ischaemic condition related 
protein sequence SEQ ID NO:839 - 

Mus musculus, 673 aa. 
[WO200188188-A2, 22-NOV-2001] 


1..664 
9..673 


352/670 (52%) 
457/670 (67%) 


0.0 


AAE06034 


Mouse oocyte protein 5 (MOP5) - Mus 
musculus, 664 aa. [WO200153339- 
A2, 26-JUL-2001] 


20..664 
1..664 


280/675(41%) 
402/675 (59%) 


e-136 



In a BLAST search of public sequence datbases, the NOV 16a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 16D. 
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Table 16D. Public BLASTP Results for NOV16a 


Protein 
Accession 
Number 

; 

1 


Protein/Organism/Length 


lsin v i fa 

IN W V lOd 

Residues/ 

Match 
Residues 


identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


j Q9ULW8 

i 


PrOtein-aroininp HfMminac** h/np III 

(EC 3.5.3.15) (Peptidylarginine 
deiminase III) - Homo sapiens 
(Human), 664 aa. 


\ ..OOh 

I. .664 


C-Jl 1£{LA /*7O0/"\ 

j j l/oo4 (/yVo) 
565/664 (84%) 


0.0 


J Q9Z184 


r iuiciu-<ii^iihiic ueiminase type in 
(EC 3.5.3.15) (Peptidylarginine 
deiminase III) - Mus musculus 
(Mouse), 664 aa. 


I ..004 

1..664 


AO A IC£ A /T^ft / \ 

484/664 (72%) 
552/664 (82%) 


0.0 


P70708 


Protein-arginine deiminase type III 

(EC 3.5.3.15) (Peptidylarginine 
deiminase III) - Rattus norvegicus 
(Rat), 664 aa. 


1..664 
1..664 


481/664 (72%\ 
546/664 (81%) 


o n 


\ 002849 

i 

i 

i 

i 


Protein-arginine deiminase type III 

(EC 3.5.3.15) (Peptidylarginine 
deiminase III) - Ovis aries (Sheep), 
664 aa. 


1..664 
1..664 


485/664 (73%) 
539/664 (81%) 


0.0 


, Q9ULC6 


Protein-arginine deiminase type I 
(EC 3.5.3.15) (Peptidylarginine 
deiminase I) - Homo sapiens 
(Human), 663 aa. 


1..664 
1..663 


487/669 (72%) 
544/669 (80%) 


0.0 



PFam analysis predicts that the NOV16a protein contains the domains shown in 
the Table 16E. 



Table 16E. Domain Analysis of NOV16a 


Pfam Domain 


NOV 16a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PAD 


1..664 


465/691 (67%) 
623/691 (90%) 


0 



Example 17. 



The NOV 17 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 17A. 
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Table 17A. NOV17 Sequence Analysis 




SEQIDNO:41 


1027 bp 


NOV 1 7a, 
! CGI 72805-01 DNASeauence 

; 
1 

j 

! 
i 


GTCGACGCGGCCGCGGGCGGCGGAGATGGCGGCGGGGATAGTGGCTTCTCGCAGACTC 
CGCGACCTACTGACCCGGCGACTGACAGGCTCCAACTACCCGGGACTCAGTATTAGCC 
TTCGCCTCACTGGCTCCTCTGCACAAGAGGCGGCTTCCGGAGTAGCCCTCGGTGAAGC 
CCCAGACCACAGCTATGAGTCCCTTCGTGTGACGTCTGCGCAGAAACATGTTCTGCAT 
GTCCAGCTCAACCGGCCCAACAAG AGGAATG CCATGAACAAGGTCTT CTGGAG AGAG A 
TGGTAGAGTGCTTCAACAAGATTTCGAGAGACGCTGACTGTCGGGCGGTGGTGATCTC 
TGGTGCAGGAAAAATGTTCACTGCAGGTATTGACCTGATGGACATGGCTTCGGACATC 
CTGCAGCCCAAAGGAGATGATGTGGCCCGGATCAGCTGGTACCTCCGTGACATCATCA 
CTCGATACCAGGAGACCTTCAACGTCATCG AGAGGTG CCC CAAG C CCGTGATTGCTGC 
CGTCCATGGGGGCTGCATTGGCGGAGGTGTGGACCTTGTCACCGCCTGTGACATCCGG 
TACTGTGCCCAGGATGCTTTCTTCCAGGTGAAGGAGGTGGACGTGGGTTTGGCTGCCG 
ATGTAGGAACACTGOiGCGCCTGCCCAAGGTCATCGGGAACCAGAGCCTGGTCAACGA 
GCTGGCCTTCACCGCCCGCAAGATGATGGCTGACGAGGCCCTGGGCAGTGGGCTGGTC 
AGCCGGGTGTTCCCAGACAAAGAGGTCATGCTGGATGCTGCCTTAGCGCTGGCGGCCG 
AGATTTCCAGCAAGAGCCCCGTGGCGGTGCAGAGCACCAAGGTCAACGTGCTGTATTC 
CCGCGACCATTCGGTGGCCGAGAGCCTCAACTACGTGGCGTCCTGGAACATGAGCATG 
CTGCAGACCCAAGACCTCGTGAAGTCGGTCCAGGCCACGACTGAGAACAAGGAACTGA 
AAACCGTCACCTTCTCCAAGCTCTGAGAGCCCTCGCGTCCC 




ORF Start: ATG at 26 ORF Stop: TG A at 1 0 1 0 




SEQ ID NO: 42 


328 aa 


MW at 35757.7kD 


NOV 1 7a, 
CGI 72805-01 Protein Sequence 

i 


MAAGIVASRRLRDLLTRRLTGSNYPGLSISLRLTGSSAQEAASGVALGEAPDHSYESL 
RVTSAQKHVLHVQLNRPNKRNAMNKV^ 

GIDLMDMASDILQPKGDDVARISVm*RDIITRYQETFNVIERCPKPVIAAVHGGCIGG 
GVDLVTACD I RY CAQDAFFQVKEVDVGLAADVGTLQRLP KVI GNQSL VNE LAFTARKM 
MADEALGSGLVSRVFPDKEVMIJDAAIiALAAEISSKSPVAVQSTFCVNLLYSRDHSVAE 
LNYVASWNMSMLQTQDLVKSVQATTENKELKTVTFSKL 



Further analysis of the NOV 17a protein yielded the following properties shown in 
Table 17B. 



Table 17B. Protein Sequence Properties NO VI 7a 


PSort analysis: 


0.9200 probability located in mitochondrial matrix space; 0.8000 probability 
located in microbody (peroxisome); 0.6000 probability located in mitochondrial 
inner membrane; 0.6000 probability located in mitochondrial intermembrane space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 17a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 17C. 
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Table 17C Geneseq Results for NOV17a 


Geneseq 
Identifier 

! 


Protein/Organism/Length [Patent #, 
Date] 


INV-JV I la 

Residues/ 

Match 
Residues 


laeniiiies/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


! AAB43664 


Human cancer associated protein 
sequence SEQ ID NO:l 109 - Homo 
sapiens, 300 aa. [WO200055350-A1 , 
21-SEP-2000] 


2.216 
5..27S 


265/275 (96%) 
268/275 (97%) 


e-147 


I AAM24501 

! 

| 


Colon tumour related predicted amino 

acid sequence for CT632 - Homo 
sapiens, 167 aa. [WO2001497 1 6-A2, 
12-JUL-2001] 


46..212 
I ..167 


166/167 (99%) 
166/167 (99%) 


le-93 


j AAB11904 

i 


Human colon tumour protein CT632, 
SEQ ID NO:204 - Homo sapiens, 167 
aa. [WO200037643-A2, 29-JUN-2000] 


46..212 
1..167 


166/167 (99%) 
166/167 (99%) 


le-93 


j ABB61607 


Drosophila melanogaster polypeptide 

SEQ ID NO 11613- Drosophila 
melanogaster, 289 aa. [ WO200 1 7 1 042- 
A2,27-SEP-2001] 


51. .328 
13..289 


145/281 (51%) 
199/281 (70%) 


le-73 


= AAG00231 


Human secreted protein, SEQ ID NO: 
4312 - Homo sapiens, 1 18 aa. 
[EP1 03340 1-A2, 06-SEP-2000] 


I..1I8 
1..118 


118/118(100%) 
118/118(100%) 


2e-61 



In a BLAST search of public sequence datbases, the NOV 17a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 17D. 
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Table 17D. Public BLASTP Results for NOV17a 


Protein 
Accession 
Number 


Protein/Organ ism/Length 


NOV 17a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q8WVX0 


HYPOTHETICAL 35.8 KDA 
PROTEIN - Homo sapiens (Human), 
328 aa. 


I..328 
.L.328 


328/328(100%) 
328/328(100%) 


0.0 


Q96EZ9 


UNKNOWN (PROTEIN FOR 
MGC: 1 956 1 ) - Homo sapiens 
(Human), jzo aa. 


L.328 
L.328 


327/328 (99%) 
327/328 (99%) 


0.0 


Q13011 


Delta3 ,5-delta2,4-dienoyI-CoA 
isomerase, mitochondrial precursor 
(EC 5.3.3.-) - Homo sapiens (Human), 
328 aa. 


L.328 
L.328 


318/328 (96%) 
320/328 (96%) 


e-178 


Q62651 


DeIta3,5-deita2,4-dienoyl-CoA 
isomerase, mitochondrial precursor 
(EC 5.3.3.-) - Rattus norvegicus (Rat), 
327 aa. 


L.328 
L.327 


248/328 (75%) 
289/328 (87%) 


e-143 


A57626 


peroxisomal enoyl hydratase-like 
protein - rat, 327 aa. 


L.328 
L.327 


247/328(75%) 
288/328 (87%) 


e-!42 



PFam analysis predicts that the NOV 17a protein contains the domains shown in 
the Table 17E. 



Table 17E. Domain Analysis of NO V17a 


Pfam Domain 


NOV 17a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ECH 


68..248 


66/185(36%) 
132/185(71%) 


le-41 



Example 18. 



The NOV 18 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 1 8A. 
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Table 18A. NOV18 Sequence Analysis 




SEQIDNO:43 1067 bp 


NOV 18a, 
CG56763-0I DNA Sequence 


CCCTTTCCTCTTGCTCTTTGATGTTTTGTAGGCCTGCAOrTr'r r a anrn r'hnii.nnr'^ 

GAGTGGGGAGAATGTCACCAGGGTCGGCACCTTCATCCTGGTGGGCTTCCCCACGGCC 

CCAGGGCTGCAGTACCTGCTCTTCCTCCTCTTCCTGCTCACCTACCTCTTTGTCCTGG 

TGGAGAACCTGGCCTiTCATCCTCACCGTCTGGAGCAGCACCTCCCTCCACAGGCCCAT 

GTACTACTTTCTGAGCTCCATGTCTTTCCTAGAGATCTGGTACGTGTCTGACATCACC 

CCCAAGATGCTGGAGGGCTTCCTCCTCCAGCAGAAACGCATCTCTTTCGTCGGGTGCA 

TGACGCAGCTCTACTTCTTCAGCTCCCTGGTGTGCACCGAGTGTGTGCTTCTGGCCTC 

CATGGCCTACGACCGCTACGTGGCCATCTGCCACCCGCTGCGCTACCACGTCCTTGTG 

ACCCCGGGGCTGTGCCTCCAGCTGGTGGGCTTCTCCTTTGTGAGTGGCTTCACCATCT 

CCATGATCAAGGTCTGTTTTATCTCCAGCGTCACGTTCTGTGGCTCCAACGTCTTGAA 

CCACTTCTTCTGTGACATTTCCCCCATCCTC^GCTGGCCTGCACGGACTTCTCCACT 

GCAGAGCTGGTGGATTTCATTCTGGCCTTCATCATCCIX3GTGTTTCCACTCCTGGCCA 

CCATGCTGTCATATGCGCACATCACCCTGGCTGTCCTGCGCATCCCCTCGGCCACCGG 

CTGCTGGAGAGCCTTCTTCACCTGCGCCTCTCACCTCACCGTGGTCACCGTCTTCTAT 

ACAGCCTTGCTTTTCATGTATGTCCGGCCCCAGGCCATTGATTCCCGGAGCTCCAACA 

AGCTCATCTCTGTTTTGTACACAGTTATCACCCCCIATCTTGAACCCCTTGATATACTG 

CCTGAGGAATAAGGAATTTAAGAATGCCTTGAAAAAAGCCTTCGGCTTGACGAGCTGC 

GCCGTAGAGGGGAGGCTTTCTAGTCTTCTGGAACTTCATCTCCAAATACACAGCCAGC 
CTCTCTGAGGAGGCCATTTGACT 




ORF Start: ATG at 21 


ORF Stop: TGA at 1050 




SEQ ID NO: 44 


343 aa I MW at 38584.5kD 


NOV 1 8a, 
CG56763-01 Protein Sequence 


MFCRPAAPKHRGMSGENVTRVGTFILVGFPTAPGLQYLLFLLFLLTYLFVLVENLAII 
LTVWSSTSLHRPMYYFLSSMSFLEIWYVSDITPKMLEGFLLQQKRISFVGCMTQLYFF 
S S LVCTECVLLASMAYDRYVAI CH PLRYHVLVTPGLCLQLVG FS FVS G FT I SM I KVCF 
ISSVTFCGSNVLNHFFCDISPILKLACTDFSTAELVDFILAFIILVFPLLATMLSYAH 
I TLAVLRI PSATGCWRAFFTC^SHLTVSTTVFYTALLFMYVRPQAI DSRSSNKLI SVLY 
TVITPILNPLIYCLRNKEFKNALKKAFGLTSCAVEGRLSSLLELHLQIHSQPL 



Further analysis of the NOV 18a protein yielded the following properties shown 
Table 18B. 



Table 18B. Protein Sequence Properties NOV18a 


PSort analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0. 1 000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 64 and 65 



A search of the NOV1 8a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 18C. 
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Table ISC. Geneseq Results for NOV18a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 18a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG71967 


Human olfactory receptor polypeptide, 

SEQ ID NO: 1648 - Homo sapiens, 
319 aa. (WO200127158-A2, 19-APR- 
2001] 


I3..332 
1.319 


304/320 (95%) 
308/320 (96%) 


e-173 


AAG71962 


Human olfactory receptor polypeptide, 

SEQ ID NO: 1643 - Homo sapiens, 
314 aa. [WO200127158-A2, 19-APR- 
2001] 


13..319 
1..307 


289/307 (94%) 
299/307 (97%) 


e-167 


ABG 16907 


Novel human diagnostic protein 
#16898 - Homo sapiens, 291 aa. 
[WO200175067-A2, ll-OCT-2001] 


5..295 
20..289 


267/291 (91%) 
267/291 (91%) 


e-150 


4 ABG 16907 


Novel human diagnostic protein 
#16898 - Homo sapiens, 291 aa. 
[WO200175067-A2, ll-OCT-2001] 


5..295 
20..289 


267/291 (91%) 
267/291 (91%) 


e-150 


AAG71675 


Human olfactory receptor polypeptide, 

SEQ ID NO: 1356 - Homo sapiens, 
327aa.[WO200127158-A2, 19-APR- 
2001] 


15..313 
6..308 


177/303 (58%) 
224/303 (73%) 


e-100 



In a BLAST search of public sequence datbases, the NOV 1 8a protein-was found to 
have homology to the proteins shown in the BLASTP data in Table 18D. 
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Table 18D. Public BLASTP Results for NOV18a 


Protein 
Accession 
Number 


Protpin/Oroanicm/T onrrfki 
r i viciii/ wl gdUISui/ LfCngln 


NOV18a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q8VGU5 


OLFACTORY RECEPTOR MORI 03- 
2 - Mus musculus (Mouse), 3 12 aa. 


13.319 
1..307 


270/307 (87%) 
287/307 (92%) 


e-156 


Q8VGU4 


OLFACTORY RECEPTOR MORI 03- 
3 - Mus musculus (Mouse), 3 12 aa. 


13 31Q 
I. .307 


OA i lien /©co/ \ 
284/307 (92%) 


e-153 


Q9EPG2 


M5I OLFACTORY RECEPTOR - 
Mus musculus (Mouse), 3 14 aa. 


17..322 
5.3 10 


211/306(68%) 
250/306(80%) i 


e-124 


Q8VGU1 


OLFACTORY RECEPTOR MOR103- 
4 - Mus musculus (Mouse), 3 14 aa. 


17.322 
5.310 


210/306(68%) 
249/306(80%) 


e-124 


Q9EPV0 


M50 OLFACTORY RECEPTOR 
(OLFACTORY RECEPTOR M50) 
(OLFACTORY RECEPTOR 
MORI 03- 16) - Mus musculus (Mouse), 
316 aa. 


16.315 
2.301 


209/300 (69%) 
246/300 (81%) 


e-121 



PFam analysis predicts that the NOV 18a protein contains the domains shown i: 
the Table 18E. 



Table 18E. Domain Analysis of NOV18a 


Pfam Domain 


NOV 18a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 

i- 


7tm_l 


53. .302 


50/269(19%) 
181/269(67%) 


6.5e-27 


Colicin_V 


147..310 


29/197(15%) 
100/197(51%) 


0.43 



Example 19. 



The NOV19 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 1 9 A. 
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Table 19A. NOV19 Sequence Analysis 




SEQ ID NO: 45 


1071 bp 


NOV19a, 
CG56777-01 DNA Sequence 


GAAAAGGAGGAAAGACAGGTTAGCTGGGGATGATGACGGATCTGAAGCAAAGCCATTC 


AGTGAGGCTGAATGATGGACCCTTCATGCCAGTGCTGGGA 

GATCATACTCCCAAAAG CCAGGCTG CCGAGGCCACCAAAGTGGCTATTG ACGTAGGCT 
TCCGCCATATTGATTCAGC7^TACTTATACCAAAATGAGGAGGAGGTTGGACAGGCCAT 
TTGGGAGAAGATCGCTGATGGTACCGTCAAGAGAGAGGAAATATTCTACACCATCAAG 
CITTGGGCTACTTTCTTTCGGGCAGAATTGGTTCACCCGGCCCTAGAAAGGTCACTGA 
AGAAACTTGGACCGGACTATGTAGATCTCTTCATTATTCATGTACCATTTGCTATGAA 
GCCTGGGAAAGAATTACTGCCAAAGGATGCCAGTGGAGAGATTATTTrAGAAACTGTG 
GAGCTTTGTGACACTTGGGAGGCCCTGGAGAAGTGCAAAGAAGCAGGTTTAACCAGGT 
CCATTGGGGTGTCCAATTTCAATCACAAGCTGCrGGAACTCATCCTCAACAAGCCAGG 
GCTCAAGTACAAGCCCACCTGCAACCAGGTGGAATGTCACCCTTACCTCAACCAGAGC 
AAACTCCTGGAGTTCTGCAAGTCCAAGGACATTGTTCTAGTTGCCTACAGTGCCCTGG 
GATCCCAAAGAGACCC^CAGTGGGTGGATCCCGACTGCCCACATCTCTTGGAGGAGCC 
GATCTTG AAAT CCATTG CCAAGAAACACAGTGGAAGCCCAGGCCAGGTCGCCCTGCGC 
TACCAGCTG CAGCGGGGAGTGGTGGTG CTGGCCAAG AG CTTCTCTCAGGAG AG AATCA 
AAGAGAACTTCCAGATTTTTGACTTTGAGTTGACTCCAGAGGACATGAAAGCCATTGA 
TGGCCrrCAACAGAAATCTCCGATATGACAAGTTACAATTTGCTGCTAATCACCCTTAT 
TTTCCATTTTCTGAAGAATATTGACCATGAGCTATTGAACATTACCACCACAATTTCC 
TCCTTTTAGGTGACAGAGAGAGGATTT 




ORF Start: ATG at 30 


ORF Stop: TGA at 1008 




SEQ ID NO: 46 


326 aa 


MWat37132.2kD 


NOV 19a, 
CG56777-01 Protein Sequence 


MMTDLKQSHSVRLNDG P FM P VIiGFGT YAPDHTP KS QAAE ATKVA IDVG FRH I DSAYLY 
QNEEEVGQAIWEKIADGTVKREEIFYTIKIiWATFFRAELVHPALERSLKKLGPDYVDL 
FI IHVPFAMKPGKELLPKDASGEI ILETVELCDTWEALEKCKEAGLTRSIGVSNFNHK 
LLEL I LNKPGLKYKPTCNQVECHPYLNQS KLLE FCKS KD I VLVAYS ALG S QRDPQWVD 
PDCPHLLEE P I L KS I AKKHS GS PGQVALRYQLQRG VWLAKS F SQERI KENFQI FDFE 
LTPEDMKAIDGLNRNLRYDKLQFAANHPYFPFSEEY 




SEQ ID NO: 47 


1108 bp 


NOV 19b, 


GAAAAGGAGGAAAGACAGGTTAGCTGGGGATGATGACGGATCTGAAGCAAAGCCATTC 


CG56777-02 DNA Sequence 


AGTGAGGCTGAATGATGGACCCTTCATGCCAGTGCTGGGATTTGGCACTTATGCTCCT 
GAT CAT ACTCCCAAAAG CCAG G CTG C CG AGG CCAC CAAAGTGG CTATTG ACGTAG G CT 
TCC^CC^TATTGATTC^GCATACrTATACC^AAATGAGGAGGAGGTTGGACA 
TTGGGAGAAGATCGCTGATGGTACCGTCAAGAGAGAGGAAATATTCTACACCATCAAG 
CTTTGGGCTACTTTCTTTCGGGCAGAATTGGTTCACCCGGC 

AGAAACTTGGACCAGACTATGTAGATCTCTTCATTATTCATGTACCATTTGCTATGAA 

GGTCAGGTTCTTCCTGAGAGAGGC^GATTTGAGTGAGGTGCATAGGATTGCCAAAACT 

TTCCCTGGGAAAGAATTACTGCCAAAGGATGCCAGTGGAGAGATTATTTTAGAAACTG 

TGGAGCTTTGTGACACTTGGGAGGCCCTGGAGAAGTGCAAAGAAGCAGGTTTAACCAG 

GT CCATTGGGGTGTCC^^TTTCAATCACAAGCTG CTG G AACTCAT CCTC AACAAG C CA 

GGGCTCAAGTACAAGCCCACCTGCAACC^GGTGGAATGTCACCCTTACCT 

GCAAACTCCTGGAGTTCinrCAAGTCCAAGGACATTGTTCTAGTTGCCT 

GGGATCCCAAAGAGACCCACAGTGGGTGGATCCCGACTGCCCACATCTCTTGGAGGAG 

CCGATCTTGAAATCCATTGCC^GAAACTVCAGTCGAAGCCCAGGCCAGGTCGCCCTGC 

GCTACCAGCTGCAGCGG^GAGTGGTGGTGCTGGCCAAGAGCTTCT 

GRAAGAG AACTTCCAG ATTTTTGACTTTGAGTTGACT C CAGAGGACATGAAAGCCATT 

G ATGG CCTCAACAG AAATCTC CGATATGAT AAGTTACJU^TTTG CTX3CTAATCAC CCTT 

ATTTTC CATrrrCTGAAGAATATTGAT CACCCTTATTTT CCIATTTTCTGAAG AATATT 

GATCAC 




ORF Start: ATG at 30 


ORF Stop: TGA at 1068 




SEQ ID NO: 48 


346 aa 


MWat39673.1kD 


NOV 19b, 
CG56777-02 Protein Sequence 


^TDIJCQSHSVRLNDGPFMPVIXSFGTYAPDHTPKS QAAEATKVAIDVG FHH IDS AYLY 
QNEEEVGQAI WEKI ADGTVKRE E I FYTI KLWATFFRAELVHPALERSLKKLGPDYVDL 
FI IHVPFAMKVRFFIJlEADLSEVHRIAKTFPGKELIiPKDASGEIILETVELOT 
EK(IKEAGLTRSIGVSNFNHKLLELILNKPGLKYKOT 

DIVLVAYSALGSQRDPQWVDPDCPHLLEEPILKSIAKKHSRSPGQVALRYQLQRGVW 
LAKSFSQFillK^^QIFDFELTPEDMKAIDGIJiRNIJlYDKLQFAANHPYFPFSEEY 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 19B. 
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Table 19B. Comparison of NOV19a against NOV19b. 


Protein Sequence 


NOV 19a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV 19b 


I. .326 
I. .346 


323/346 (93%) 
323/346.(93%) 



Further analysis of the NOV 19a protein yielded the following properties shown in 
Table 19C 



Table 19C. Protein Sequence Properties NOV19a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.4275 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in Iysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 19a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 19D. 
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Table 19D. Geneseq Results for NOV19a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 19a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAM79455 i 


Human protein SEQ ID NO 3101- 
Homo sapiens, 325 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


4.32o 
4.325 


269/323 (83%) 


C 1 J 1 


AAM78471 


Human protein SEQ ID NO 1 1 j-> - 
Homo sapiens, 323 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


4..3 zo 
2.323 


269/323 (83%) 


P .ni 


AAW14799 


TypeS 17-beta-hydroxysteroid 
aenyarogenase - nomo sapiens, 
323 aa.[W0971U62-Al,27- 
MAR- 1997] 


4..326 
2.323 


220/323 (68%) 
269/323 (83%) 


e-131 


AAB43444 


Human cancer associated protein 
sequence SEQ ID NO:889 - Homo 
sapiens, 336 aa. [WO200055350- 
Al,21-SEP-2000] 


1.326 
10.336 


216/327(66%) 
268/327(81%) 


e-128 


AAB76865 


Human lung tumour protein related 
protein sequence SEQ ID NO: 783 - 
Homo sapiens, 364 aa. 
[ WO200 1 00828-A2, 04-J AN- 
2001] 


33.326 
71.364 


200/294 (68%) 
248/294 (84%) 


e-120 



In a BLAST search of public sequence datbases, the NOV19a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 19E. 



155 



WO 03/010327 



PC17US02/14199 



Table 19E. Public BLASTP Results for NOV19a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV 19a 
Residues/ 
Match Residues 


IHpntitipc/ 

JtU&llLIlICo/ 

Similarities for 
the Matched 
Portion 


Expect 
Value 


P05980 


Prostaglandin-F synthase 1 (EC 
1. LI. 1 88) (PGF synthase 1 ) (PGF 1 ) 
(Prostaglandin^ 1 1 reductase I) 
(PGFSI) - Bos taurus (Bovine), 323 aa. 


7..326 
4..323 


229/320 (71%) 
270/320 (83%) 


e-134 


P52897 


Prostaglandin-F synthase 2 (EC 
1.1.1.188) (PGF synthase 2) (PGF 2) 
(Prostaglandin^ 1 1 reductase 2) 
(PGFSII) - Bos taurus (Bovine), 323 aa. 


7..326 
4..323 


228/320 (71%) 
268/320 (83%) 


e-134 


P52898 


Dihydrodiol dehydrogenase 3 (EC 1 .-.-.-) 
(Prostaglandin F synthase) - Bos taurus 
(Bovine), 323 aa. 


11. .326 
8..323 


226/316(71%) 
264/316(83%) 


e-133 


Q9NS54 


3ALPHA-HYDROXYSTEROID 
DEHYDROGENASE VARIANT (EC 
1.1.1213)- Homo sapiens (Human), 323 
aa. 


11. .326 
8..323 


216/316(68%) 
265/316(83%) 


e-131 


P80508 


Prostaglandins 9-reductase (EC 
1 . 1 . 1 . 1 89) (20-alpha-hydroxysteroid 
dehydrogenase) (EC 1 . 1 . 1 . 149) (20- 
alpha-HSD) - Oryctolagus cuniculus 
(Rabbit), 323 aa. 


11. .326 
8..323 


219/316(69%) 
259/316(81%) 


e-131 



PFam analysis predicts that the NOV 19a protein contains the domains shown in 
the Table 19F. 



Table 19F. Domain Analysis of NOV19a 


Pfam Domain 


NOV 19a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


aldoJcet_red 


13..306 


164/368 (45%) 
261/368(71%) 


1.5e-144 



Example 20. 



The NOV20 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 20A. 
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Table 20A. NOV20 Sequence Analysis 




SEQ ID NO: 49 


979 bp 


NOV20a, 
CG56941-01 DNA Sequence 


TGGAAAACTTGTCACAGTTTAGGACAACTCTCTTTGTAAAATTAGTGGTTCTTGGGAG 
TCAAAATGCTTCCTGGTGCCGGGAGTGAGCGATGAGCCGGCTTCTGTCCCTGGCGGCC 
AGAGTCGCCTTGGCCGCCTTGCCCTGCCGCCGCTCTCGCGGGTTCGGGATGTTCTATG 
CCGTGAGGAGGGGCGGCAAGTCCGCGGTCTTTCTGACCGGGAATGAGTGCAAAGCACA 
GGTGGACCGGTTTCCTGCTGCCAGATTTAAGAAGTTTGCCACAGAGGATGAGGCCTGG 
GACTTTGTCAGGAAATCTGCAAGCCCGGAAGTTTCAGAAGGGCAGGAAAATCAACATG 
GACAAGAATCGGAGACGAAAGCCAGCAAGCGACTCCGTGAGCCACTGGATGGAGATGG 
AG ATGAAAGCGCAG AG CCATATGCAAGCACAATGAAG CCGAGCGTGAAG CCGG CGCCT 
CCAGTTAGCAGAGACACGTTTTCCTACATGGGAGACTTCGTCGTCGTCTACGCTGATG 
GCTGCTGCTCCAGTAATGGGCGTAGAAAGCCACGAGCAGGAATCCGCGTTTACTGGGG 
GCCGGGCTATCCTTTAACTGTAGGCATTAGACTTCCTGGGTGGCAGACAAACCAAAGA 
GCGG AAATTCATGCAGCCTG CAAGG CCATTG AACAAGCACAGACTCAG AAAATCAATA 
AACTGGTTCTGTATACAGACAGTATGTTTACGATAAATGGTATAACTAACTGGGTTCA 
AGGTTGGAAGAAAAATGGGTGGAAGACAAGTGCAGGGAAAGAGGTGATCAACAAAGAG 
GACTTCGTGGCACTGGAGAGGCTGACCCAGGGGATGGACATTCAGTGGGCGAGTATCT 
TAAATGTTCATGTCCC^TGGCTATTGTGACCAAATAAGCTAAGTTTACTAGAGATTAA 
TGGTTTATGTAAGTGGTTTAATGACATAACAATTTCCATGAAGTTAATATC 




ORF Start: ATG at 90 


ORF Stop: TGA at 897 




SEQ ID NO: 50 j 269 aa 


M W at 29980.9kD 


NOV20a, 
CG56941-01 Protein Sequence 


MSRLLSLAARVAIAALPCRRSRGFGMFYAVRRGGKSAVFLTGNECKAQVDRFPAARFK 
KFATEDEAWDFVRKSASPEVSEGQENQHGQESETKASKRLREPIiDGDGDESAEFYAST 
MKPSVKPAPPVSRDTFSYMGDFWVYADGCCSSNGRRKPRAGIRVyWGPGYPLTVGIR 
LPGWQTNQRAE I HAACKAI E QAQTQKI NKL VLYTDSM FT INGI TNWVQG WKKNGWKTS 
AGKEVINKEDFVALERLTQGMDIQWAS I LNVHVPWLL 



Further analysis of the NOV20a protein yielded the following properties shown in 
Table 20B. 



Table 20B. Protein Sequence Properties NOV20a 


PSort analysis: 


0.3700 probability located in outside; 0.2339 probability located in microbody 
(peroxisome); 0.1080 probability located in nucleus; 0.1000 probability located in 
endoplasmic reticulum (membrane) 


SignalP analysis: 


Cleavage site between residues 24 and 25 



A search of the NOV20a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 20C. 
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Table 20C. Geneseq Results for NOV20a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV20a 
Residues/ 
Match Re^idue^ 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY70235 


Human RNA-associated protein- 1 6 
(RNAAP-16) - Homo sapiens, 286 
aa. [ WO2000 11171 -A2, 02-MAR- 
2000] 


I..266 
1..262 


236/267 (88%) 
244/267 (90%) 


e-134 


AAB97508 


Human type II RNase H protein - 
Homo sapiens, 286 aa. 
[WO200123613-A1, 05-APR-2001] 


1..266 
1..262 


235/267 (88%) 
242/267 (90%) 


e-134 


AAY25094 


Human type 2 RNase H protein - 
Homo sapiens, 286 aa. [W09928447- 
Al, 10-JLTN-1999] 


1..266 
L.262 


235/267 (88%) 
242/267 (90%) 


e-134 


AAB97511 


E coli type 11 RNase H protein - 
Escherichia coli, 216 aa. 
[ WO200 123613-A1, 05-APR-200 1 ] 


23..266 
4..206 


157/244 (64%) 
178/244 (72%) 


2e-85 


AAY25098 


Mouse RNase H homoiogue protein 
fragment - Mus sp, 216 aa. 
[W09928447-A1, 10-JUN-1999] 


23..266 
4..206 


157/244 (64%) 
178/244 (72%) 


2e-85 



In a BLAST search of public sequence datbases, the NOV20a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 20D. 



Table 20D. Public BLASTP Results for NOV20a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV20a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


060930 


Ribonuclease HI (EC 3.1.26.4) 
(RNase HI) (Ribonuclease H type II) 
- Homo sapiens (Human), 286 aa. 


1..266 
1..262 


237/267 (88%) 
244/267 (90%) 


e-134 


Q8VCR6 


RIBONUCLEASE HI - Mus 
musculus (Mouse), 285 aa. 


1..266 
1.261 


183/267(68%) 
213/267(79%) 


e-101 


070338 


Ribonuclease HI (EC 3.1.26.4) 
(RNase HI) - Mus musculus 
(Mouse), 285 aa. 


1..266 
1..261 


183/267(68%) 
213/267(79%) 


e-101 


Q91953 


MRN A, COMPLETE CDS, CLONE 
CLFEST65 - Gallus gallus (Chicken), 
293 aa. 


M.. 266 
6..269 


141/270 (52%) 
178/270(65%) 


2e-71 


Q21024 


F59A6.6 PROTEIN - Caenorhabditis 
elegans, 369 aa. 


27..260 
67..342 


76/276 (27%) 
122/276(43%) 


2e-25 
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PFam analysis predicts that the NOV20a protein contains the domains shown in 
the Table 20E. 



Table 20E. Domain Analysis of NOV20a 


Pfam Domain 


NOV20a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RnaseH 


I36..257 


50/147 (34%) 
93/147 (63%) 


l.le-27 



Example 21. 



The NOV21 clone was analyzed, and the nucleotide and encoded polypeptide 
5 sequences are shown in Table 2 1 A. 
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Table 21A. NOV21 Sequence Analysis 




SEQ ID NO: 51 


2536 bp 


NOV21a, 
CG57 109-01 DNA Sequence 

! 

<- 

j 

| 

i 

\ 

i 

j 

i 


GGGAOVCTGACATGGACTGAAGGAGTAGAAAAGT^GCCTTGGGCTCTCCCAGATGGAA 

GAATGACCGTGTGAGGAAACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCT 

GATTTCTTCAGGGAAGGGGATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTGA 

AGAGCATTCAGGTGGCTGTAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACT 

GGCCCAGCACAGCCGTGCCCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCT 

CTGAAAGGAGACCACCGCTGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTG 

CAGGATGCAAGGCAGCCATGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGA 

TGACAGAGCGAGGACCCAGAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGT 

AGCAAGCCCCCCAGGGAAGCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATC 

TTGGGGTGGAGATTGAAAAGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAG 

AGAGAGGGAGCTTCAGCAGAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAG 

CTGGATATGGGGAAGGGCCCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCT 

GCAGGAGGTCTCCCGAGGCAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAG 

CCACAGGAGCAGCCCCAGGAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATG 

G ACAAGAAAG AGGACAG AGGCCCAG AGG ATCAAG AAAG CCATG CTCAGGGAGCAGCCA 

AGGCCAAGAAGGACCTTGTGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGGGAGGT 

GAAGAAGGACACCAGGCCCATGAG CAGG AGCAAACATGGTGG CTGG CTCCTGAGAGAG 

CACCAGGCGGGCTTTGAGAAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGA 

AGGAGAAAAAGCCATGTATGTCTGGAGGCAGAAGGATGACTCTCAGAGATGACCAACC 

TG CAAAGCTAGAAAAGGAGCCCAAG ACGAGGCCAG AAGAGAACAAGCCAGAG CGG CCC 

AGCGGTCGGAAGCCACGGCCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATG 

AGACTGGCCGGGTCATTGGGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCG 

CX5AGACCAGGCAGGCCTATGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAG 

GAGGACATGGTGGACAGTGAGATCTTGATCATCCAGAGCCTCTCTCACCCCAACATCG 

TGAAATTGCATGAAGTCTACGAAACAGACATGGAAATCTACCTGATCCTGGAGTACGT 

GCAGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTGAAGTTCCCGGAGCCCGAT 

GCTGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCCACATGCACGACAAGAGCA 

TTGTCCACCGGGACCTCAAGCCGGAAAACCTTTTGGTTCAGCX5AAATGAGGACAAATC 

TACTACCTTGAAATTGGCTGATTTTGGACTTGCAAAGCATGTGGTGAGACCTATATTT 

ACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAAATTCTTTCTGAGAAAGGTTATG 

GACTGGAGGTGGACATGTGGGCGGCTGGCGTGATCCTCTATATCCTGCTGTGTGGCTT 

TCCCCCATTCCGCAGCCCTGAGAGGGACCAGGACGAGCTCTTTAACATCATCCAGCTG 

GGCCACTTTGAGTTCCTCCCCCCTTAGTGGGACAATATCTCTGATGCTGCTAAAGATC 

TGGTG AGCCGGTTGCTGGTGGTAGACCCCAAAAAGCG CTACACAGCT CATCAGGTTCT 

TCAGCACCCCTGGATCGAAACAGCTGGCAAGACCAATACAGTGAAACGACAGAAGCAG 

GTGTCCCCCAGCAGCGAGGGTC^CTTCCGGAGCCAGCACAAGAGGGTTGTGGAGCAGG 

TATCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTCTGCTCAAGGACAGAGA 

AAAGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCnTCTTCACATAATTGGTGAA 

TCAGAGGGAGAGACACTGAGTATA1TTTAAAGCATATTAAAAAAATTAAGTCAATGTT 

AAATGT CACAACAT ATTTTTAGATTTGT AT ATTTAAAG C CTTT AAT ACATTTTTGGGG 

GGTAAGCATTGTCATCAGTGAGGAATTTTGGTAATAATGA^ 

TGTAACCAAGTTTATTCTGTACTACAGGAGTGGTGCTTACCAGGGTCTAAACTCCCCC 
TGTG AG ATTAATAAGG TG C ATTGTGGTCTTT CT GTGTTAATAAAATGTG CTCTGAATA 
ACAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 




ORF Start: ATGat 150 


ORF Stop: TAG at 2094 




SEQ ID NO: 52 


648 aa | MW at 73813.6kD 1 


NOV21a, 
CG57 109-01 Protein Sequence 


MGKEPLTLKSIQVAVEELYPNKARALTLAQHSRAPSPRiRSRLFSKALKGDHRCGETE 
TPKSCSEVAGCKAAMRHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATLEE 
RHARGEKHLGVE I EKTSGEI IRCEKCKRERELQQSLERERLSLGTSELDMGKGPMYDV 
EKXVRTRSCRRSPEANPASGEEGWKGDSHRSSPRNPTQELRRPSKSMDKKEDRGPEDQ 
ESHAG^AAKAKKDLVEVLPWEEGLREVKKDTRPM 

RGEEKEAEKEKKPCMSGG1UIMTLRDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGII 
AANVE KHYETGRVI GDGNFAWKECRHRETRQA YAM K I IDKSRLKGKEDMVDSEILII 
QSLSHPNIVKLHEVYETDMEI YLILEYVQGGDLFDAI I ESVKFPEPDAALMIMDLCKA 
LVHMHDKS I VHRDLKPENLLVQRl^DKSTTLKLADFGLAKHVVRPI FTVCGTPTYVAP 
E I LS EKGYGLEVDMWAAGVI LY I LLCGFPPFRS PERDQDELFNI IQLGHFEFLPPYWD 
NI SDAAKDLVSRLLWDPKKRYTAHQVLQHPWI ETAGKTNTVKRQKQVSPSSEGHFRS 
QHKRWEQVS 




SEQ ID NO: 53 


2808 bp 


NOV21b, 
CG57 109-02 DNA Sequence 

• 


TTTACAGAGTCAGGCCTCACCGTGAGAGGGCTCCTGTATTAGTCCCITTTCA 
TTATAGAGACATACCTG AGACTGGG CAATTTG CAG AGAAAGGTT/TTCTTGG ACTTACA 
GTTAGTTCCACGTCGCTGGGGAAGCCTCACAATCATGGCGGAAGGCAAGGAAGGG 
GTCCCATCTITACAT^XSATGG CAGCAGGCAAAG AG AGAATGAGGAAG ATGCAAAAG CGG 
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AAACCCCTGATGTAACCATCAGATCTTATGAGATTTATTCA.CTACCATGGAACAGACA 
GC31AGGCGTATGTGACCATTCTCTAGAATATTT^ 

CTGCAAGGCTCCTGG CTG CCTG CCAGCCGAG^AATCrGGAGAAACCATTCCTGGGG C 
CGCGTGGCCCCGTCGTGCCCTTGTTCTGCCCTCGGAATGGCCTTCACTCAGCACATCC 
TGAGAACAGCCCTCTGAAGCCCAGGGTCGTGACCGTAGTGAAGCTGGGTGGGCAGCGC 
CCCCGAAAGATCACTCTGCTCCTCAACAGGCGATCAGTGCAGACGTTCGAGCAGCTCT 
TAGCTGACATCTCAGAAGCCTTGGGCTCTCCCAGATGGAAGAATGACCGTGTGAGGAA 
ACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCTGATTTCTTCAGGGAAGGG 
GATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTGAAGAGCATTCAGGTGGCTG 
TAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTGGCCCAGCACAGCCGTGC 
CCCTTCTCC^GGCTGAGGAGaVGGCTGTTTAGCAAGGCTCTGAAAGGAGACCACCGC 
TGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGCAGGATGCAAGGCAGCCA 
TGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGATGACAGAGCGAGGACCCA 
GAAGAAGTGGGGGAGC3GGGAAATGGGAGCCAGAACCCAGTAGCAAGCCCCCCAGGGAA 
GCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATCTTGGGGTGGAGATTGAAA 
AGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGAGAGAGGGAGCTCCAGCA 
GAG CCTGGAGCGTGAGAGG CTTTCTCTGGGGACCAGTGAGCTGG ATATGGGGAAGGGC 
CCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCTGCAGGAGGTCTCCCGAGG 
CAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAGCCACAGGAGCAGCCCCAG 
GAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATGGACAAGAAAGAGGACAGA 
GGCCCAGAGGATCAAGAAAGCCATGCTCAGGGAGCAGCCAAGGCCAAGAAGGAGCTTG 
TGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGGGAGGTGAAGAAGGACACCAGGCC 
CATGAGCAGGAGCAAACATGGTGGCTGG CTC CTGAGAG AGCACCAGGCGGGCTTTGAG 
AAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAAGGAGAAAAAGCCATGTA 
TGTCTGGAGGC^GAAGGATGACTCTCAGAGATGACCAACCTGCAAAGCTAGAAAAGGA 
GCCCAAGACGAGGCCAGAAGAG AACAAGCCAGAG CGG CCCAGCGGTCGGAAGCCACGG 
CCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATGAGACTGGCCGGGTCATTG 
GGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGCGAGACCAGGCAGGCCTA 
TGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGGAGGACATGGTGGACAGT 
GAGATCTTGATCATCCAGAGCCTCTCTCACCCCAACATCGTGAAATTGCATGAAGTCT 
ACGAAACAGACATGGAAATCTACCTGATCCTGGAGTACGTGCAGGGAGGAGACCTTTT 
TGACGCCATCATAGAAAGTGTGAAGTTCCCGGAGCCCGATGCTGCCCTCATGATCATG 
GACTTATGCAAAGCCCTCGTCCACATGCACGACAAGAGCATTGTCCACCGGGACCTCA 
AGCCGGAAAACCTTTTGGTTCAGCGAAATGAGGACAAATCTACTACCTTGAAATTGGC 
TGATTTTGGACTTGCAAAGCATGTGGTGAGACCTATATTTACTGTGTGTGGGACCCCA 
ACTTACGTAGCTCCCGAAATTCTTTCTGAGAAAGGTTATGGACTGGAGGTGGACATGT 

TGAGAGGGACCAGGACGAGCTCTTTAACATCATCCAGCTGGGCCACTTTGAGTTCCTC 
CCCCCTTACTGGGACAATATCTCTGATACAGCTGCTAAAGATCTGGTGAGCCGGTTGC 
TGGTGGTAGACCCCAAAAAGCGCTACACAGCTCATCAGGTTCTTCAGCACCCCTGGAT 
CGAAACAGCTGGCAAGACCAATACAGTGAAACGACAGAAGCAGGTGTCCCCCAGCAGC 
GAGGGTCACTTCCGGAGCCAGCACAAGAGGGTTGTGGAGCAGGTATCATAGTCACCAC 



C TTGGG AAT CTGTCCAG CCCCCAG TT CTG CTCAAGGACAG AGAAAAGG ATAGAAGTTT 
G AGAGAAAAACAATG AAAG AGG CTTCTTCAC ATAATTGGTGAAT CAGAGGG AGAGACA 
CTG AGTATATTTTAAAG CATATT A 



ORF Start: ATG at 151 



ORF Stop: TAG at 2659 



SEQ ID NO: 54 



836 aa 



MWat95152.6kD 



NOV21b, 
CG57 109-02 Protein Sequence 



MAEGKEGQVPSYMDGSRQRENEEDAKAETPDVTIRSYEIYSLPWNRQQGVCDHSLEYL 
SSRISERKLQGSWLPASRGNXiEKPFLGPRGPVVPLFCPRNGLHSAHPENSPLKPRVVT 
WKLGGQRPRKITLIiLNRRS VQTFEQLLAD I S EALGS PRWKNDRVRKLFNLKGREI RS 
VSDFFREGDAF IAMGKEPI/TLKS I QVAVEELYPNKARALTLAQHSRAPS PRLRSRLFS 
KALKGDHRCGETETPKS CS EVAGCKAAMRHQGKI PEELS LDDRARTQKKWGRGKWE PE 
PSSKPPREATLEERHARGEKHLGVEIEKTSGEIIRCEKCKRERELQQSIjERERLSLGT 
S ELDMG KG PMYDVE KLVRTRS CRRS P EANPAS GEEGWKGDSHRS S PRNPTQELRRPS K 
SMDKKEDRGPEDQESHAQGAAKAKKDLVEVLPWEEGLREVKKDTRPMSRSKHGGWIi 
REHQAGFEKLRRTRGEEKEAEKEKKPCMSGGRRMTLRDDQPAKLEKEPKTRPEENKPE 
RPSGRKPRPMGI I AANVEKHYETGRVI GDGNFAWKECRHRETRQAYAMKI IDKSRIiK 
GKEDMVDSEILI IQSLSHPNIVKLHEVYETDMEIYLIIiEYVQGGDLFDAI IESVKFPE 
PDAALMIMDLCKALVHMHDKS I VHRDLKPENLLVQRNEDKSTTLKIJ^DFGLAKHVVRP 
IFTVCGTPTYVAPEILSEKGYGLEVDMWAAGVILYILLCGFPPFRSPERDQDELFNII 
QI^HFEFLPPYWDNISDTAAKDLVSRLLVVDPKKRYTAHQVLQHPWIETAGKTNTVKR 
QKQVSPSSEGHFRSQHKRWEQVS 
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SEQIDNO: 55 


J 3016 bp 


NOV21c 
CG57109-03 DNA Sequence 

i 

! 


1T1 1 ACAGAGTCAGGCCTCACCGTGAGAGGGCTCCTGTATTAGTCCCTTTTCATGCTGC 

TTATAGAGACATACCTGAGACTGGGCAATTTGCAGAGAAAG^ 

GTTAGTTCCACGTGGCTGGGGAAGCCra 

GTCCCATCTTACATGGATGGCAGCAGGCAAAGAGAGAATGAGGAAGATGCAAAAGCGG 
AAAC CCCTGATGTAACCAT CAGATCTTATGAG ATTTATTCACTACCATGG AACAG ACA 
GCAAGGCGTATGTGACCATTCTCTAGAATATTTAAGCrCGAGAATCTCAGAGCGGAAG 
CTGCAAGGCTCCTGGCTGCCTGCCAGCCGAGGGAATCTGGAGAAACCATTCCTGGGGC 
CGCGTGGCCCCGTCGTGCCCTTGTTCTGCCCTCGGAATGGCCTTCACTCAGCACATCC 
TGAGAACAGCCCTCTGAAGCCCAGGGTCGTGACCGTAGTGAAGCTGGGTGGGCAGCGC 
CCCCGAAAGATCACTCTGCTCCTCAACAGGCGATCAGTGCAGACGTTCGAGCAGCTCT 
TAGCTGACATCTCAGAAGCCTTGGGCTCTCCCAGATGGAAGAATGACCGTGTGAGGAA 
ACTGTTTAACCTCAAGGGCZAGGGAAATCAGGAGCGTCTCTGATTTCTTCAGGGAAGGG 
G ATGCTTTCATAG CTATGGG CAAAGAACCACTGACACTG AAGAG CATTCAGGTGGC TG 

TAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTGGCCCAGCACAGCCGTGC 

CCCTTCTCCAAGGCTGAGGAGCACX3CTGTTTAGC^GGCTCTGAAAGGAGACCACCGC 

TGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGCAGGATGCAAGGCAGCTA 

TGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGATGACAGAGCGAGGACCCA 

GAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTAGCAAGCCCCCCAGGGAA 

GCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATCTTGGGGTGGAGATTGAAA 

AGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGAGAGAGGGAGCTTCAGCA 

GAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAGCTGGATATGGGGAAGGGC 

CCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCTGCAGGAGGTCTCCCGAGG 

CAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAGCCACAGGAGCAGCCCCAG 

GAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATGGACAAGAAAGAGGACAGA 

GGCCCAGAGGATCAAGAAAGCCATGCTCAGGGAGCAGCCAAGGCCAAGAAGGACCTTG 

TGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGGGAGGTGAAGAAGGACACCAGGCC 

CATGAGCAGGAGCAAACATGGTGGCTGGCTCCTGAGAGAGCACCAGGCGGGCTTTGAG 

AAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAAGGAGAAAAAGCCATGTA 

TGTCTGGAGGCAGAAGGATGACTCTCAGAGATGACCAACCTGCAAAGCTAGAAAAGGA 

GCCCAAGACGAGGCCAGAAGAGAACAAGCCAGAGCGGCCCAGCX3GTCGGAAGCCACGG 

CCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATGAGACTGGCCGGGTCATTG 

GGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGCGAGACCAGGCAGGCCTA 

TGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGGAGGACATGGTGGACAGT 

GAGATCTTGCATGAAGTCTACGAAACAGACATGGAAATCTACCTGATCCTGGAGTACG 

TGCAGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTGAAGTTCCCGGAGCCCGA 

TGCTGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCCACATGCACGACAAGAGC 

ATTGTC CACCGGG ACCTCAAG CCGG AAAACCTTTTGGTTCAGCGAAATGAGGACAAAT 

CTACTAC CTTGAAATTGGCTG ATTTTGGACTTG CAAAGCATGTGGTGAGACCT 

TACTGTGTGTGGGACCCGAACTTACGTAGCTCCCGAAATTCTTTCTGAGAA^ 

GGACTGGAGGTGGACATGTGGGCTGCTGGCGTGATCCTCTATATCCTGCTGTGTGGCT 

TTCCCCCATTCCGCAGCCCTGAGAGGGACCAGGACGAGCTCrTTAACATC^TCCAG^ 

GGGCCACTTTGAGTTCCTCCCCCCTTACTGGGAC^TATCTCTGATGCTGCT 

CTGGTGAGCCGGTTGCTGGTGGTAGACCCCAAAAAGCGCTACACAGCTCATCAGGTTC 

TTCAGCACCCCTGGATCGAAACAGCTGG CAAG ACCAATACAGTGAAACGACAGAAGCA 

GGTGTCCCCCAGCAGCGAGGGTCACTTCCGGAGCCAGCACAAGAGGGTTGTGGAGCAG 

GTATCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTCTGCTC^AGGACAGAG 

AAAAGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCTTCrTCACATAATTGGTGA 

ATC^GAGGGAGAGACACTGAGTATATTTTAAAGCATATTAAAAAAATTAAGTCAATG^ 

TAAATGTCACAACATATTTTTAGATTTGTATATTTAAAG^ 

GGGTAAGCATTGTCATCAGTGAGGAATTTTGGTAATAATGATGTGTTTTGCTTCCCCT 
TTGTAACCAAGTTTATTCTGTACTACAGGAGTGGTGCTTACCAGGGTCTAAACTCCCC 
CTGTG AGATTAATAAGGTGCACTGTGGTCTTTCTGTGTTAATAAAATGTG CTCTGAAT 




ORF Start: ATG at 52 


ORF Stop: TAG at 2617 




SEQIDNO: 56 


855 aa 1 MW at 97447.3kD 


NOV21c, 
CG571 09-03 Protein Sequence 

1 
1 


MLLIETYIiRIXSNI^RKVFLJDLQLV^ 

KAETPDVTIRS YEI YSLPWNRQQGVCDHSLEYLS SRISERKLQGSttLPASRGNLEKPF 
I^PRGPVVPLFCPRNGMSAHPENSPLKPRVVTVVK^^ 

QI^ADISEAIXSSPRWKNDRVRKLFNLKGREIRSVSDFFREGDAFIAMGKEPLTLKSIQ 
VAVEELYPNKARALTLAQHSRAPS PRLRSRLFSKALKGDHRCGETETPKS CS EVAGCK 
AA>HIHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATLEERHARGEKHLGVE 
IEKTSGEIIRCEKCKRERELQQSLERERLSIX3TSEU3MGKGPMYDVEKLVRTRSCRRS 
PEANPASGEEGWKGDSHRSSPRNPTQELRRPSKSMDKKEDRGPEDQESHAQGAAKAKK 
DLVEVLPVTEEGIJU2VKKDTRPMSRSKHGGWLLREHQAGFEK^ 

P04SGGRRMTLRDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGI IAANVEKHYETGR 
VTGDGNFAWKECRHRETRQAYAMKI IDKSRLKGKEDMVDSEILHEVYETDMEIYLIL 
EYVQGGDLFDAI I ESVKFPEPDAALMIMDLCKALVHMHDKSIVHRDLKPENLLVQRNE 
DKSTTLKLADFGLAKHWRP I FTVCGTPTYVAPEI LSEKGYGLEVDMWAAGVI LY I LL 
3GFPPFRSPERDQDELFKI IQLGHFEFXPPYMDNI SDAAKDLVSRLLVVDPKKRYTAH 
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QVLQHPWIETAGKTNTVKRQKQVSPS SEGHFRSQHKRWEQVS 




SEQIDNO:57 2433 bp 


NOV21d, 
CG57109-04 DNA Sequence 

I 
t 

j 

« 

l 

5 

i 

! 

1 

1 

j 

I 


TTTACAGAGTCAGGCCTCACCGTGAGAGGGCTCCITGT^ 

TTATAGAGACATACCTG AG ACTGGG CAATTTG CAGAGAAAGGTTTTCTTGGACTTACA 
GTTAGTTCCACGTGGCTGGGGAAGCCTCACAATCATGGCGGAAGGCAAGGAAGGGCAA 
GTCCCATCTTACATGGATGGCAGCAGGCAAAGAGAGAATGAGGAAGATGCAAAAGCGG 
AAACCCCTGATGTAACCATCAGATCTTATGAGATTTATTCACTACCATGGAACAGACA 
GCAAGGCGTATGTGACCATTCTCTAGAATATTTAAGCTCGAGAATCTCAGAGCGGAAG 
CTGCAAGGCTCCTGGCTGCCTGCCAGCCGAGGGAATCTGGAGAAACCATTCCTGGGGC 
CG CGTGGCCCCGTCGTG CCCTTGTTCTGCCCTCGGAATGGCCTTCACTCAGCACAT CC 
TGAGAACAGCCCTCTGAAGCCCAGGGTCGTGACCGTAGTGAAGCTGGGTGGGCAGCGC 
CCCCGAAAGATCACTCTGCTCCTCAACAGGCGATCAGTGCAGACGTTCGAGCAGCTCT 
TAGCTGACATCTCAGAAGCCTTGGGCTCTCCCAGATGGAAGAATGACCGTGTGAGGAA 
ACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCTGATTTCTTCAGGGAAGGG 
G ATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTGAAGAG CATTCAGGTGG CTG 
TAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTGGCCCAGCACAGCCGTGC 
CCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCTCTGAAAGGAGACCACCGC 
TGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGCAGGATGCAAGGCAGCCA 
TGAGGCACCAGGGGAAGATCGCCGAGGAGCTTTCACTAGATGACAGAGCGAGGACCCA 
GAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTAGCAAGCCCCCCAGGGAA 
GCCACTCTGGAAG AG AGG CACG CAAGGGG AGAGAAGCATCTTGGGGTGG AGATTGAAA 
AGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGAGAGAGGGAGCTCCAGCA 
GAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAGCTGGATATGGGGAAGGGC 
CCAATGTATGATGTGGAGAAAAAGCCATGTATGTCTGGAGGCAGAAGGATGACTCTCA 
GAGATGACCAACCTGCAAAGCTAGAAAAGGAGCCCAAGACGAGGCCAGAAGAGAACAA 
GCCAGAGCGGCCCAGCGGTCGGAAGCCACGGCCCATGGGCATCATTGCCGCCAATGTG 
GAAAAG CATTATG AGACTGGCCGGGTCATTG GGGATGGG AACTTTGCTGTCGTGAAGG 
AGTGCAGACACCGCGAGACCAGGCAGGCCTATGCGATGAAGATCATTGACAAGTCCAG 
ACTCAAGGGCAAGGAGGACATGGTGGACAGTGAGATCTTGATCATCCAGAGCCTCTCT 
CACCCCAACATCGTGAAATTGCATGAAGTTTACGAAACAGACATGGAAATCTACCTGA 
TCCTGGAGTACGTGCAGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTGAAGTT 
CCCGGAGCCCGATGCTGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCCACATG 
CACGACAAGAGCATTGTCCACCGGGACCTCAAGCCGGAAAACCTTTTGGTTCAGCGAA 
ATGAGGACAAATCTACTACCTTGAAATTGGCTGATTTTGGACTTGCAAAGCATGTGGT 
GAGACCTATATTTACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAAATTCTTTCT 
GAGAAAGGTTATGGACTGG AGGTGGAC ATGTGGGCTG CTGG CGTGATCCTCTATAT CC 
TGCTGTGTGGCTTTCCCCCATTCCGCAGCCCTGAGAGGGACCAGGACGAGCTCTTTAA 
CATCATC CAG CTGGG CC ACTTTG AGTT CCTTCCCCCCTTACTGGGACAATATCTCTGAT 
ACAG CTG CTAAAGATCTGGTG AG CCGGTTGCTGGTGGTAGACCCCAAAAAGCGCTACA 
C^GCTC^TCAGGTTCTTCAGC^CCCCrrGGATCGAAACAGCTGGCAAGACCAATACAGT 
G AAACGACAG AAG CAGGTGTCCCCC AG CAG CGAGGGTCACTTC CG GAGCCAGCACAAG 
AGGGTTGTGGAGCAGGTATC^TAGTCACCACCTIX^^TCTGTCCAGCCCCCAGTTC 
TG CTCAAGGACAGAG AAAAGG ATAG AAGTTTG AG AGAAAAACAATGAAAGAGG C!TTCT 
TCAOVTAATTGGTGAATCAGAGGGAGAGACACrGAGTATATTTTAAAGCATATTA 




ORF Start: ATG at 52 


ORF Stop: TAG at 2284 




SEQ ID NO: 58 


744 aa 


MW at 84729.4kD 


NOV21d, 
CG57109-04 Protein Sequence 


MLLIETYIJUXSNLQRKVFLDLQLVPRGWGSLTIMA^ 

KAETPDVT I RS YE I YSLPWNRQQGVCDHS LE YLS SRI SERKLQGSWLPASRGNLEKPF 
LGPRG P WPL FCPRNGLHS AH PENS PLKPRVVTVVKLGGQRPRKITLLLNRRS VQTFE 
QLLADI SEALGS PRWKNDRVRKLFNLKGREI RSVSDFFREGDAFIAMGKEPLTLKSIQ 
VAVEELYPNKARALTLAQHSRAPSPRLRSRLFSKALKGDHRCGETETPKSCSEVAGCK 
AAMRHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATIiEERHARGEKHLGVE 
IEKTSGETIRCEKCKRERELQQSLERERLSLGTSELDMGKGPMYDVEKKPCMSGGRRM 
TLRDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGIIAANVEKHYETGRVIGDGNFAV 
VKECRHRETRQAYAMKI IDKSRLKGKEDMVDSEILI IQSLSHPNIVKLHEVYETDMEI 
YLILEYVQGGDLFDAI I ESVKFPE PDAALMIMDLCKALVHMHDKS IVHRDLKPENLLV 
QRNED KSTTLKLAD FGLAKHVVR P I FTVCGT PTYVAPE I LS E KG YGL E VDMWAAGVI L 
YI IiLCG FPPFRS PERDQDEL FN I IQLGHFEFLPPYWDNI SDTAAKDLVSRLLWDPKK 
RYTAHQVLQHPWI ETAG KTNTVKRQKQVS PS S EGHFRS QHKRWEQVS 
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SEQIDNO:59 




2133 bp 


NOV21e, 
CG57109-05 DNA Sequence 


TTGACCXjTGTGAGGAAACTGTTTAACCrC^GGGC^ 

TTTCTTCAGGGAAGGGGATGCrTTCATAGCrCATGGGCAAAGAACCACT 

AG CATTCAGGTGGCTGTAGAAGAACTGTACCCCAAGAAAG CCCGGGCCCTGACACTGG 




GAAAGGAGACCACCGCTGTGGGGAGACCGAGACCCCCAAGAGCTGCMCGAAGTTGCA 
GGATGCAAGGCAGCCATGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGATG 
ACAGAGCGAGGACCCAGAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTAG 
CAAGCCCCCCAGGGAAGCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATCTT 
GGGGTGGAGATTGAAAAGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGAG 
AG AGGG AGCTCCAG CAGAGCCTGGAG CGTG AGAG G CTTTCTCTGG GGACCAGTG AGCT 
GGATATGGGGAAGGGCCCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCTGC 
AGGAGGTCTCCCGAGGCAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAGCC 
ACAGGAGCAGCCCCAGGAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATGGA 
CAAG AAAG AGG ACAGAGGCCCAGAGG AT CAAG AAAG CCATGCTCAGGG AG CGG CCAAG 
GCCAAGAAGGACCTTGTGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGAGAGGTGA 
AGAAGG ACACCAGG CCCATG AGCAGG AGCAAACATGGTGGCTG GCTCCTG AGAGAGCA 
CCAGGCGGGCTTTGAGAAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAAG 
GAGAAAAAGCCATGTATGTCTGGAGGCAGAAGGATGACTCTCAGAGACGACCAACCTG 
CAAAGCTAGAAAAGGAGCCCAAGACGAGGCCAGAAGAGAACAAG C CAG AGCGG CCCAG 
CGGTCGGAAGCCACGGCCCATGGGCATCATTGCCGCCAATGTGGAAAAGC^TTATGAG 
ACTGGCCGGGTCATTGGGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGCG 
AGACCAGGCAGG CCTATGCGATGAAG ATCATTGACAAGTCCAGACT CAAGGGCAAGG A 
GGACATGGTGGAC^GTGAGATCTTGATCATCCAGAGCCTCTCTCACCCCAACATCGTG 
AAATTGCATGAAGTCTACGAAACAGACATGGAAATCTACCTGATCCTGGAGTACGTGC 
AGGGAGGAGACCTTTTTG ACG C CATCATAG AAAGTGTG AAGTTCCCGG AGC CCGATGC 
TGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCCACATGCACGACAAGAGCATT 
GTCCACCGGG ACCTCAAG CCGG AAAACCTTTTGGTTCAG CG AAATGAGGACAAATCT A 
CTACCTTG AAATTGG CTG ATTT TGGACTTG C AAAG CATGTGGTGAGACCTAT ATTTAC 
TGTGTGTGGG AC CCCAACTTACGT AG CTCCCGAAATT CTTTCTGAGAAAG GTT ATGGA 




CCCCATTCCGCAGCCCTGAGAGGGACCAGGACGAGCTCTTTAACATCATCCAGCTGGG 
CCACTTTGAGTTCCTCCCCCCTTACTGGGACAATATCTCTGATGCTGCTAAAGATCTG 
GTGAGCCGGTTGCTGGTGGTAGACCCCAAAAAGCGCTACACAGCTCATCAGGTTCTTC 
AGCACCCCTGGATCGAAACAGCTGGCAAGACCAATACAGTGAAACGACAGAAGCAGGT 
GTCCCCCAGCAGCGAGGGTCACTTCCGGAGCCAGCACAAGAGGGTTGTGGAGCAGGTA 
TCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTCTGCTCAAGGACAGAGAAA 
AGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCTTCTTCACA 




ORF Start: ATG at 90 


ORF Stop: TAG at 2034 




SEQIDNO.60 


648 aa 


MWat738I3.6kD 


NOV21e, 
CG57109-05 Protein Sequence 


MGKEPLTIJCSIQVAVEELyPNKARALTI^QHSRAPSPRLRSRLFSKALKGDHRCGETE 
TPKSCSEVAGCKAAMRHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATLEE 
RHARGEKHLGVE IE KTSGE I IRCEKCKRERELQQSLERERLSLGTSELDMGKGPMYDV 
EKLVRTRSCRRS PEANPASGEEGWKGDS HRS S P RNP TQELRRPSKSMDKKEDRGPEDQ 
ESHAQGAAKAKKDLVEVLPVTEEGLREVKKDTRPMSRSKHGGWLLRE 
RGEEKEAEKEKKPCMSGGRRm'IJUDDQPAKLEKEPKTRPEE^KPERPSGRKPRPMGI I 
AANVEKHYETGRVIGDGNFAWKECRHRETRQAYAMKI IDKSRLKGKEDMVDSEILII 
QSLSHPNIVKLHEVYETOMEiyiilLEYVQGGDLFDAI IESVKFPEPDAALMIMDLCKA 
LVHMHDKS IVHRDLKPENLLVQRNEDKSTTLKLADFGIAKHVVRPI FTVCGTPTYVAP 
EII^EKGY^GLEVDMWAAGVILYIIXCGFPPFl^PERDQDELFNIIQLGHFEFLPPyWD 
NISDAAKDLVSRLLVTOPKKRYTAHQVLQHPWIE™ 
QHKRWEQVS' 
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SEQ ID NO: 61 


2720 bp 


NOV21f, 
CG57109-06 DNA Sequence 

1 
l 

s 

1 

i 
i 

i 


GGACACTGACATGGACTCAAGGAGTAGAAAAGAAGCCTTGGGCrrCTCCCAGATGGAA^ 

AATGACCGTGTGAGGAAACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCTG 

ATTTCTTCAGGGAAGGGGATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTG 

GAGCATTCAGGTGGCTGTAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTG 

GCCCAGCACAGCCGTGCCCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCTC 

TGAAAGGAGACCACCGCTGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGC 

AGGATGCAAGGCAGCTATGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGAT 

GACAGAGCGAGGACCCAGAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTA 

GCAAGCCCCCCAGGGAAGCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATCT 

TGGGGTGGAGATTGAAAAGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGA 

GAGAGGGAGCTTCAGCAGAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAGC 

TGGATATGGGGAAGGGCCCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCTG 

CAGGAGGTCTCCCGAGGCAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAGC 

CAC^GGAGCAGCCCCAGGAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATGG 

ACAAGAAAGAGGACAGAGGCCCAGAGGATCAAGAAAGCCATGCTCAGGGAGCAGCCAA 


1 

I 

J 

i 
i 

i 
1 


AAGAAGGACACCAGGCCCATGAGCAGGAGCAAACATGGTGGCTGGCTCCTGAGAGAGC 
ACCAGGCGGGCTTTGAGAAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAA 
GG AG AAAAAG CCATGTATGTCTGGAGG CAGAAGGATGACTCTCAGAG ATGACCAACCT 
G CAAAG CTAGAAAAGGAGCCCAAGACGAGGCCAGAAGAGAACAAGCCAGAGCGGCC CA 
GCGGTCGGAAGCCACGGCCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATGA 
GACTGGCCGGGTCATTGGGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGC 
GAGACCAGGCAGGCCTATGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGG 
AGGACATGGTGGACAGTGAGATCTTGCATGAAGTCTACGAAACAGACATGGAAATCTA 
CCTGATCCTGGAGTACGTGCAGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTG 
AAGTTCCCGGAGCCCGATGCTGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCC 
ACATGCACGACAAGAGCATTGTCCACCGGGACCTCAAGCCGGAAAACCTTTTGGTTCA 
GCG AAATGAGGACAAATCTACTACCTTG AAATTGG CTGATTTTG GACTTGCAAAG CAT 
GTGGTGAGACCTATATTTACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAAATTC 
TTTCTGAGAAAGGTAAGTGTTACACATCGATCTGTGGGACTCTAGTTCCCTTACTAAC 
AAATGTTTCATTTGTCATATTTACTAGTTTTCAATATGGAATAAATCTCAGAGAACTG 
ACACTTAGGCTTGGATTTGGACTTCAATGAAAATATTTGAAGTAGGCTTGACCAAAGC 
ATGAGAGCTTTCTCCTCATTAGGGCTGCCCTTGTTACAGTCAATGGATCAGTGTGTGT 




TGATGGAGAATTGGGTGGGGAACTCAAGCAAGCAAGG 
CTAATAGGAGAGAATTGAAGCAGTCCTTATGGTACTTGGTTTAAAA^ 
ACCTTGCATTTAAAGGAAAAGGATCCCATTTTCCTCCATGAACTCTATGAATATTTAT 
TACCTACTTGTATATTATGCAAGGATCCT^TGAGCTGTTTTAGTGACAAACTTTCTAA 
AACATTT AAAAAGG AAATAATAG TTATGATATGG CTCTAAATATATG AATGACTATTT 
G ACTACTGTGG CACTCCAG GAGAAACCAATTTACC CAC CATTG GTAAG ATGGGAAGAC 
TCTCATTGGGTTGCAGGGTTGGTGACAGGGAGAAGG C 
AGCAG AAG CATCAGGATAATTAAGATG AG G AGATGG C C AGGG ATGGTG G CTCATG CCT 
GTAATCCCAGCTCTTTGGGAGGCTGAGGCAGGTGGATC^ 

AACCATCCTGGCCAACATGGTGAAACCCTGT CTGTACTAAAACTACAAAAA C 
TGGGCGTGGTGGCACATGCCTGTAATCTGAGCTACTCGGGAGGCTGAGGCAGGAGAAT 
TGCTTGAACCGGGGAGGTGGAGTTTGCAGTGAGCCAAGATCGTGCCATTGCACTCCAG 
CCTGGGCAACAAGAGTGAAACTTCATCTCAAAAAAAAAAAAAAAAAAAAAAC ! 




ORF Start: ATG at 149 


ORF Stop: TGAat 1826 




SEQ ID NO: 62 


559 aa 


MWat63436.9kD 


NOV21f, 
CG57 109-06 Protein Sequence 


MGKEPLTLKSIQVAVEELYPNKARALTLAQHSRAPSPRLRSRLFSKALKGDHRCGETE 
TPKSCSEVAGCKAAMRHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATLEE 
RHARGE KHLGVE I E KTSG E 1 1 RCEKCKRERE LQQS LERERL SLGTS ELDMGKGPMYDV 
EKLVRTRSCRRSPEANPASGEEGWKGDSHRSSPRNPTQELRRPSKS^KKEDRGPEDQ 
ESHAQGAAKAKKDLVEVLPVTEEGLREVKKDTRPMSRS 

RGEEKEAEKEKKPCMSGGRR^^^RDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGII 
AANVEKHYETGRVI GDGNFAWKE CRHRETRQAYAMKI I DKSRLKGKEDMVDS E I LHE 
VYETDMEr^ILEYVC^GDLFDAIIESVKFPEPDAAIWIMDLCKALVHMHDKSIVHRD 
LKPENLLVQRNEDKSTTLKLADFGLAKHVVRPI FTVCGTPTYVAPEILSEKGKCYTS I 
CGTLVPLLTNVSFVI FTS FQYGINLRELTLRLGFGLQ 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 2 IB. 
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Table 21B. Comparison of NOV21a against NOV21b through NOV21f. 


rroiein sequence 


NOV2 la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV21b 


1..648 
I88..836 


608/649 (93%) 
608/649 (93%) 


NOV2Ic 

i 


1..648 
221. .855 


595/648 (91%) 
595/648 (91%) 


NOV21d 


300..648 
39S..744 


349/350 (99%) 
349/350 (99%) 


NOV21e 


1..648 
1..648 


608/648 (93%) 
608/648 (93%) 


NOV21f 


1. .529 
1..516 


476/529 (89%) 
476/529 (89%) 



Further analysis of the NOV21a protein yielded the following properties shown in 
Table 21 C. 



Table 21C. Protein Sequence Properties NOV21a 


PSort analysis: 


0.7000 probability located in plasma membrane; 0.3000 probability located in 
microbody (peroxisome); 0.3000 probability located in nucleus; 0.2000 probability 
located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV21a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 2 1 D. 
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Table 21D. Geneseq Results for NOV21a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV2 la Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAY42696 


Rat serine-threonine protein 
isjiiabc r 1x01/ bccjucncc tvanus 
norvegicus, 733 aa. 
[WO9950395-A1, 07-OCT- 
1999] 


1..648 

Vo../Ji 


495/649 (76%) 
535/649 (82%) 


0.0 




iNovei protein Kinase, oiiv^ it-' 
NO: 148 - Mus musculus, 806 
aa. [WO200073469-A2, 07- 
DEC-2000] 


1 ..647 
172..805 


472/648 (72%) 
521/648 (79%) 


0.0 


AAY42697 


Mouse serine-threonine protein 

kinase PK80 seauence - Mm 
musculus, 732 aa. [WO9950395- 
A1,07-OCT-1999] 


1..647 

98 7^1 

70.. rj 1 


472/648 (72%) 


0.0 


AAB65621 


Novel protein kinase, SEQ ID 
NO: 147 - Homo sapiens, 686 
aa. [WO200073469-A2, 07- 
DEC-2000] 


276..648 
313..686 


370/374 (98%) 
371/374 (98%) 


0.0 


AAY54579 


A rat calcium/calmodulin 
dependent protein kinase 
designated CaMK-VI - Rattus 
sp, 422 aa. [EP978562-A1, 09- 
FEB-2000] 


348..641 
75..367 


154/294 (52%) 
212/294 (71%) 


3e-87 



In a BLAST search of public sequence datbases, the NOV21a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 2 IE. 
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Table 21E. Public BLASTP Results for NOV21a 


Protein 
Accession 
Number 


Protein/Organism/Length 


iNUvz ia 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9C098 


KIAA1765 PROTEIN - Homo sapiens 
(Human), 608 aa (fragment). 


41. .648 
1..608 


608/608(100%) 
608/608(100%) 


0.0 


01 5075 


Serine/threonine-protein kinase 
DCAMKL 1 (EC 2.7. 1 .-) (Doublecortin- 
like and CAM kinase-like 1) - Homo 
sapiens (Human), 740 aa. 


348..641 
382..674 


155/294(52%) 
212/294 (71%) 


2e-87 


Q9JLM6 


CPG 1 6 - Mus musciililQ Tl\/fnii<;p^ A*\^ an 


•245? £/t i 

75.367 


1 34/294 (52%) 
212/294(71%) 


7e-87 


008875 


Serine/threonine-protein kinase 
DCAMKL 1 (EC 2.7. 1 .-) (Doublecortin- 

like and CAM kinase-like 1) 
(Calcium/calmodulin-dependent protein 
kinase type Hike CPG 1 6) - Rattus 
norvegicus (Rat), 433 aa. 


348..641 
7S..367 * 


154/294 (52%) 
212/294 (71%) 


7e-87 


Q9JLM8 

! 

i i 


Serine/threonine-protein kinase 
DCAMKL 1 (EC 2.7. 1.-) (Doublecortin- 
like and CAM kinase-like 1) - Mus 
musculus (Mouse), 756 aa. 


348..641 
398..690 


154/294 (52%) 
212/294 (71%) 


7e-87 



PFam analysis predicts that the NOV21a protein contains the domains shown in 
the Table 2 IF. 



Table 21F. Domain Analysis of NOV21a 


Pfam Domain 


NOV21a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Pkinase 


356..613 


107/297 (36%) 
219/297 (74%) 


2.3e-101 



Example 22. 



The NOV22 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 22A. 
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Table 22A. NOV22 Sequence Analysis 



SEQ ID NO: 63 



3553 bp 



NOV22a, 
CG57366-01 DNA Sequence 



TACATTCTTCATCGTTTAGATCAAGAAGAAGCTTTGCGACAACACCTCACAAAAGAAA 
CTG^AGAGATGTTAAATCMCTGCACATTAAAAGCAGTGGATGCTTTCrrTTACCTAGA 



i 



ACGAGTTTTAGATGGAGTTGTAGAAAATTTTATTATGTTAAGAGAAATTCGTGACATC 
CCAGGAACTCTAAATGGTTTATATCTCTGGCTGTGCCAAAGACTTTTTGTAAGAAAAC 
AATTTGCAAAGGTTCAGCCTATTTTGAATGTGATTCTTGCAGCCTGCCGACCTTTGAC 
CATAACGGAATTATATCACGCAGTATGGACCAAAAACATGTCGTTAACTTTGGAAGAT 
TTT CAACGC AAGTTAGAT ATCCTCTC CAAACTTCTTGTTG ATGGACTAGG AAATACAA 
AAATACTGTTTCATTATAGTTTTGCCGAGTGGCTTCTGGATGTGAAACACTGTACTCA 
GAAGTATTTATGTAATGCAGCAGAAGGACACAGAATGTTGGCTATGAGTTATACCTGT 
CM.GCCAAGAATTTAACACCATTGGAAGCACAAGAATTTGCATTGCACTTAATTAACT 
CAAACTTACAATTAGAGACAGCGGAGTTAGCTCTGTGGATGATATGGAATGGTACACC 
TGTCAGAGATTCCCITT CTACTTTGATACCCAAGGAACAAGAAGTG CTACAGCTGTTG 
GTTAAAGCTGGGGCTCATGTCAACAGTGAAGACGATCGCACATCATGCATAGTTCGAC 
AAGCCTTAGAAAGAGAGGATTCCATTCGGACATTATTAGATAATGGAGCTTCAGTAAA 
TCAGTGTGATTCAAATGGGAGAACATTATTGGCTAATGCTGCATATAGTGGCAGTCTT 
GATGTAGTCAATTTACTTGTCTCTAGGGGAGCAGATTTAGAGATAGAAGATGCTCATG 
GACATACACCACTCACTCTAGCGGCTAGACA^ 

GATTGGGTGTGGAGCAAATATTAATCATACTGATCAAGATGGTTGGACAGCATTAAGA 
TCTGCTGCTTGGGGTGGCCATACTGAGGTAGTTTCTGCACrACTTTATGCTGGCGTAA 
AAGTGGATTGTGCAGATGCTGATAGCCGAACAG CTTTG AGAGCAG CAGCATGGGGAGG 
ACACGAGGATATTGTACTGAATTTGCTACAACATGGCGCTGAAGTGAACAAAGCTGAT 
AATGAAGGTAGAACTGCTTTGATAGCAGCAGCATACATGGGACATAGAGAGATTGTGG 
AACACCTACTGGACCATGGAGCAGAAGTAAATCATGAGGATGTTGATGGCAGGACTGC 
ACTCTCTGTAGCTGCACTTTGTGTGCCTGCAAGTAAAGGGCACGCATCAGTTGTTAGC 
CTTTTAATTGATCGAGGTGCTGAAGTAGATCATTGTGATAAAGATGGCATGACTCCAC 
TGCTGGTAG CTGCCTATGAAGGACATGTTGATGTGGTTGACTTG CTTCTAGAAGGGGG 
AGCAGATGTAGATCACACAGATAACAATGGCCGTACACCCCTCTTAGCAGCAGCGTCT 
ATGGGTCATGCATCAGTTGTAAATACACTTTTGTTTTGGGGTGCAGCTGTGGATAGTA 
TTGATAGTGAAGGTAGGACAGTCCTCAGTATAGCTTCAGCACAAGGAAATGTTGAGGT 
GGTACGTACTCTACTGGATAGAGGGTTAGATGAAAATCACAGAGATGATGCTGGATGG 
ACACCTTTGCACATGG CAGCTTTTGAAGGG CACAGATTG ATATGTG AAGCACTTATTG 
AACAAGGTG CTAGAAC AAATG AG ATTG ACAATG ATGGACG AATC C CT TTCATATTAG C 
TTCA.CAAGAGGGTCATTATGATTGTGTTCAAATATTACTGGAAAACAAATCCAACATT 
GATCAAAGAGGTTATGATGGAAGAAATGCACTGCGGGTTGCTGCATTAGAAGGGCACA 
GGGACATTGTTGAATTGCTTTTTAGCCATGGTGCTGATGTTAACTGCAAAGATGCTGA 
TGGTCGGCCTACACTTTATATCTTGGCCTTAGAAAATCAGCTTACAATGGCCGAATAT 
TTTTTAGAAAATGGTG CAAACGTAGAAG CAAGTGATG CTGAAGGAAGGACAGCACTTC 

TGACGTCAATGCTGCAGACAATGAAAAGCGCTCTGCTTTGCAGTCTGCAGCCT 

GGCCATGTAAAAGTGGTTCAGCTTCTGATTGAGCATGGTGCTGTAGTTGACCATACAT 

GTAACCAAGGTGCAACTGCACTCTGTATTGCAGCCCAGGAAGGGCACATTGATGTTGT 

T CAGGT C T T ATTAGAG CATGG TG CTG ATC CAAACC ATG CTGAT CAATTTGG ACG CACT 

G CTATG CGTGTTG CAG CCAAAAATGG AC ATT CTCAGATAATTAAATTATTAGAAAAAT 

ATGGTGCATCTAGITTGAATGGCTGTTCCCCATCTCCTGTTC^CACAATGGAGCAAAA 

ACCTCTACAGTCATTGTCTTCAAAAGTGCAGTCATTAACAATTAAATCAAATAGCTCT 

GGTAGTACTGGTGGAGGGGATATGCAGCCTTCGTTACGTGGTTTACCTAATGGGCCTA 

CTCATGCTTTTAGTTCTCCTTCAGAATCTCCAGATTCTACAGTTGACCGGCAGAAGTC 

ATCACTGTCAAATAATTCCCTGAAAAGCTCAAAAA^ 

TCTACAGCAACGG CTCAAACAGTG C CAATTGATAGCTTT CATAACTTGTCATTTACAG 
AACAAATTCAG CAGCATTCATTG CCACG CAG TAGAAGT CG ACAGT CAATTG TTTCCCC 
ATCTTCCACAACACAGTCCHTAGGACAGAGTCATAATTCACCAAGT^ 
TGGAGTCAAGTAAAG CCCAGTTTGAAGTCAACTAAAG CAAGTAAAGGGGGGAAATCAG 
AAAATTCTG CCAAGTCTGG ATCAG CTGGGAAAAAAGCGAAACAAAGTAATTCTTCACA 
GCCAAAGGTTTTAGAATATGAAATGACTCAGTTTCATAGAAGAGGACCTATAGCCaAA 
TCCGGGACTG CTGCACCACCTAAACAAATGCCAGCAGAATCTCAATG CAAAATTATGA 
TACCTTCAGCTCAGCIAGGAAATTGGTCGATCTC^CAGCAGTTTCT^ 
AAGTGGGGAACAGAAGAAGAGAAATGGAATAATGACAAATCCAAATTATCATCTTCAG 
AGCJ^CCAGGTTTTTCTTGGTAGGGTTTCAGTCCC^ 

ATCAGGAAGTGTTGGAGGGATACCCTTCCTCAGAGACIAGAATTAAGCCTTAAACAAGC 
TCTG AAG CTT CAGATTG AAGGTTCTG ACCCTAG CTTCAACT ATAAAAAG GAAACACCA 
TTATAAAAGGTAATATTTTGTCAACATAAAGAGTAAAAAACAAAAGTAGAGTTCCCTC 
TACTTGGAATG CTCT 
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ORF Start: ATG at 67 


ORF Stop: TAA at 3484 




SEQ ID NO: 64 


1139 aa 


MWat 123942.1kD 


NOV22a, 
CG57366-01 Protein Sequence 


MmQLHIKSSGCFLYLERVLDGVVENFIMLREIRDIPGTIaNGLYLWLCQRLFVRKQFA 
KVQPILNVI LAACRPLT ITELYHAVWTKNMSLTLEDFQRKLDIIiSKLLVDGLGNTKIL 
FHYSFAEWLLDVK^CTQK^LCNAAEGHRMI^SYTCQAKNLTPLEAQEFALHLINS^ 
QLETAELALWMIWNGTPVRDSLSTLIPKEQEVLQLLVKAGAHVNSEDDRTSCIVRQAL 
EREDSIRTLLDNGASVNQCDSNGRTLLANAAYSGSLDWNLLVSRGADLEIEDAHGHT 
PLTLAARQGHTKWNCL I G CG ANI NHTDQDGWTALRS AAWGGHTEVVS ALLYAG VKVD 
CADADS RTALRAAAWGGHED I VLNLLQHGAEVNKADNEGRTAL I AAAYMGHRE I VEHL 
LDHGAEVNHEDVDGRTALSVAALCVPASKGHASWSLLIDRGAEVDHCDKDGMTPLLV 
AAYEGHVDWDLLLEGGADVDHTDNNGRTPLLAAASMGHASWNTLLFWGAAVDSIDS 
EGRTVLS I AS AQGNVEVVRTLLDRGLDENHRDDAGWTPLHMAAFEGHRLI CEALI EQG 
ARTNE I DNDGR I PF I LAS QEGHYDCV Q I LLENKSN I DQRGYDGRN ALRVAAL EGHRD I 
VELLFSHGADVNCKDADGRPTLYILALENQLTMAEYFLENGANVEASDAEGRTALHVS 
CWQGHM EMVQ VL I AYHADVNAADNEKRS ALQS AAWQGHVKWQLL I EHGA\A/DHTCNQ 
GATALC I AAQEGH I DWQVLLEHGADPNHADQFGRTAMRVAAKNGHS QI I KLLEKYGA 
SSLNGCSPSPVHTMEQKPLQSLSSKVQSLTIKSNSSGSTGGGDMQPSIiRGLPNGPTHA 
FSSPSESPDSTVDRQKSSLSNNSLKSSKNSSLRTTSSTATAQTVPIDSFHNLSFTEQI 
QQH SLPRSRSRQSIVSPS STTQSLGQSHNS PS S EFEWSQVKPSLKSTKASKGGKS ENS 
AKSGSAGKKAKQSNSSQPKVLEYEMTQFDRRGPIAKSGTAAPPKQMPAESQCKIMIPS 
AQQE IGRSQQQFLI HQQSGEQKKRNGIMTNPNYHLQSNQVFLGRVSVPRTMQDRGHQE 
VLEGYPSSETELSLKQALKLQI EGSDPS FNYKKETPL 



Further analysis of the NOV22a protein yielded the following properties shown in 
Table 22B. 



r 



Table 22B. Protein Sequence Properties NOV22a 



PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV22a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 22C. 
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Table 22C. Geneseq Results for NOV22a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV22a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU20535 


Human secreted protein, Seq ID No 

527 - Homo sapiens, 447 aa. 
[WO200155326-A2, 02-AUG-2001] 


699.. 1139 
7..447 


439/441 (99%) 
440/441 (99%) 


0.0 


ABB64823 


Drosophila melanogaster polypeptide 
SEQ ID NO 21261 - Drosophila 
melanogaster, 2 1 1 9 aa. 
[ WO200 1 7 1 042-A2, 27-SEP-200 1 ] ' 


2.. 1079 
885..2074 


413/1213(34%) 
621/1213(51%) 


e-177 


AAU25462 


Human mddt protein from clone 
LI:334386.1:2000MAY01 - Homo 
sapiens, 256 aa. [WO2001 62922- A2, 
30-AUG-2001] 


466..702 
2..238 


231/237(97%) 
233/237 (97%) 


e-132 


AAU20651 


Human secreted protein, Seq ID No 

643 - Homo sapiens, 1 82 aa. 
[WO200155326-A2, 02-AUG-2001] 


661..818 
1..158 


156/158 (98%) 
156/158 (98%) 


7e-86 


ABB67412 


Drosophila melanogaster polypeptide 
SEQ ID NO 29028 - Drosophila 
melanogaster, I549aa. 
[ WO200 1 7 1 042- A2, 27-SEP-200 1 ] 


236..813 
2I5..785 


196/578(33%) 
308/578 (52%) 


2e-83 



In a BLAST search of public sequence datbases, the NOV22a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 22D. 
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Table 22D. Public BLASTP Results for NOV22a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV22a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


O01 \\ 17 


tiypotneticai protein K1AAI22j - 
Homo sapiens (Human), 768 aa 
(fragment). 


372..1 1 j9 
1..768 


768/768 
(100%) 
768/768 
(100%) 


0.0 




KIAA122^ PROTEIN - Homo 
sapiens (Human), 743 aa. 


397.. 1 139 
1..743 


743/743 
(100%) 
743/743 
(100%) 


0.0 


AAL89945 


SD03956P - Diwinnhila 
melanogaster (Fruit fly), 1282 aa. 


9 1070 

48..1237 


A 1 1/1 *) 1 *J 
** 1 0/ 1 Z 1 J 

(34%) 
621/1213 
(51%) 


e-l / / 


AAL39916 

! 

i 


SD01389P-Drosophila 
melanogaster (Fruit fly), 21 19 aa. 


2.. 1079 
885..2074 


413/1213 
(34%) 

621/1213 
(51%) 


e-l 77 


j Q9VAU5 


CG 1 00 1 1 PROTEIN - Drosophila 
melanogaster (Fruit fly), 21 19 aa. 


2.. 1079 
885..2074 


413/1213 
(34%) 

621/1213 
(51%) 


e-l 77 



PFam analysis predicts that the NOV22a protein contains the domains shown in 
the Table 22E. 
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Table 22E. Domain Analysis of NOV22a 


Pfom Domain 


NOV22a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


j Ank 

i 


254..286 


16/33(48%) 
26/33 (79%) 


2.4e-06 


| Ank 


287..3I9 


17/33 (52%) 
27/33 (82%) 


9.1e-09 


Ank 


320..352 


17/33(52%) 
23/33 (70%) 


7.1e-06 


Ank 


353.385 


16/33 (48%) 
23/33 (70%) 


2.6e-06 


Ank 


386..418 


17/33 (52%) 
26/33 (79%) 


4.5e-09 


Ank 


4I9..456 


17/38(45%) 
31/38(82%) 


2.9e-05 


Ank 


4S7..489 


20/33(61%) 
30/33 (91%) 


1.4e-10 


| Ank 

{ 


490..522 


17/33 (52%) 
22/33 (67%) 


6.2e-06 


Ank 


523.555 


16/33(48%) 
27/33 (82%) 


3.9e-06 


Ank 


556..588 


15/33(45%) 

24/33 (73%) 


9e-08 


Ank 


589.-62 1 


13/33 (39%) 
26/33 (79%) 


7.3e-06 


Ank 


622..654 


17/33 (52%) 
26/33 (79%) 


3e-08 


Ank 


655„687 


13/33 (39%) 
23/33 (70%) 


0.036 


Ank 


688..720 


16/33 (48%) 
27/33 (82%) 


8.4e-10 


Ank 


721. .753 


14/33 (42%) 
26/33 (79%) 


2e-05 


Ank 


754..786 


17/33 (52%) 
29/33 (88%) 


2.1e-10 



Example 23. 



The NOV23 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 23 A. 
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Table 23A. NOV23 Sequence Analysis 



SEQ ID NO: 65 



3321 bp 



NOV23a, 
CG57368-01 DNA Sequence 



TAACCCGGCGCCCGGCCCTAGCCAGCCCCGGGGCTCGGGGCTGGGGAGATCATGGCCC 
GCCTCTTCAG CCCCCGG CCGCCCCCCAGCGAAGACCTCTTCTACGAGACCTACTACAG 
CCTGAGCCAGCAGTACCCGCTGCTGCTGCTGCTGCTGGGGATCGTGCTCTGTGCGCTC 
GCGGCGCTGCTCG CAGTGGCCTGGGCCAG CGGCAGGGAGCTGACCTCAGACCCGAGCT 
TCCTGACCACTGTGCTGTGCGCGCTGGGCGGCrTCTCGCTGCTGCTGGGCCTCGCTTC 
CCGGG AGCAGCGACTG CAGCGCTGG ACGCGTCCCCTGTCCGGCTTGGT ATGGGTCG CG 
CTGCTAGCGCTAGGCCACGCCTTCCTGTTCACCGGGGGCGTGGTGAGCGCCTGGGACC 
AGGTGTCCTATTTTCTCTTCGTCATCTTCACGGCGTATGCCATGCTGCCCTTGGGCAT 
GCGGGACGCCGCCGTCGCGGGCCTCGCCTCCTCACTCTCGCATCTGCTGGTCCTCGGG 
CTGTATCTTGGGCCACAGCCX3G ACTCACGGCCTGCACTG CTGCCG CAGTTGGCAGCAA 
ACGCAGTGCTGTTCCTGTGCGGGAACGTGGCAGGAGTGTACCACAAGGCGCTGATGGA 
GCGCGCCCTGCGGGCCACGTTCCGGGAGGCACTCAGCTCCCTGCACTCACGCCGGCGG 
CTGGACACCGAGAAGAAGCACCAGGAACACCTTCTCTTGTCCATCCTTCCTGCCTACC 
TGGCCCGAGAGATGAAGGCAGAGATCATGGCACGGCTGCAGGCAGGACAGGGGTCACG 
GCCAGAGAGCACTAACAATTTCCACAGCCTCTATGTCAAGAGGCACCAGGGAGTCAGC 
GTGCTGTATGCTGACATCGTGGGCTTCACGCGGCTGGCCAGCGAGTGTTCCCCTAAGG 
AGCTGGTGCTCATGCTCAATGAGCTCTTTGGCAAGTTCGACCAGATTGCCAAGGAGCA 
TGAATGCATGCGGATCAAGATCCTGGGGGACTGTTACTACTGTGTCTCTGGGCTGCCA 
CTCTCACTGCCAGACCATGCCATCAACTGCGTGCGCATGGGCCTGGACATGTGCCGGG 
CCATCAGGAAACTGCGGGCAGCCACTGGCGTGGACATCAACATGCGTGTGGGCGTGCA 
CTCAGGCAGCGTACTGTGTGG AGTCAT CGGG CTGCAGAAGTGG CAGT ACGACGTTTGG 
TCACATGATGTCACACTGGCTAACCACATGGAGGCAGGCGGTGTACCAGGGCGAGTGC 
ACATCACAGGGGCTACCCTGGCCCTGCTGGCAGGGGCTTATGCTGTGGAGGACGCAGG 
CATGGAGCATCGGGACCCCTACCTTCGGGAGCTAGGGGAGCCTACCTATCTGGTCATC 
GATCCACGGGC AGAGG AGGAGG ATGAGAAGGGCACTGCAGGAGG CTTG CTGTCCTCGC 
TTGAGGG CCTC AAGATGCGTCC AT CACTGCTGATG ACCCGTTACCTGGAGTCCTGGGG 
CGCAGCCAAGCCTTTTGCCCACCTGAGCCACGGAGACAGCCCTGTGTCCACCTCCACC 
CCTCTCCCGTGGCTCTCACTGTGCAGGAGCCGTACCCCCCGGGGACTAGATGATGAAC 
TG G ACACCGGG G ATG C C AAGTT CTTC CAGGT CATTG AG CAG CT CAACT CG C AG AAACA 
GTGGAAGCAGTCGAAGGACTTCAACCCACTGACACTGTACTTCAGAGAGAAGGAGATG 
GAGAAAGAGTACCGACTCTCTGCAATCCCCGCCTTCAAATACTATGAAGCCTGCACCT 
TCCTGGTTTTTCTCTCCAACTTCATCATCCAGATGCTAGTGACAAACAGGCCCCCAGC 
TCTGGCCATCACGTATAGCATCACCTTCCTCCTCTTCCTCCTCATCCTTTTTGTCTGC 
TTCTCAGAGG ACCTG ATG AGGTGTGTCCTGAAAGGCCCCAAGATGCTG CACTGG CTGC 
CTGCACTGTCTGGCCTGGTGGCCACACGACCAGGACTCAGAATAGCCTTGGGCACCGC 
CACCATCCTCCTITGTCTTTGCCATGGCCATTACCAGCCTGTTCTTCTTCCCAACATCA 
TCAGACTGCCOTTTCCAAGCTCCCAATGTGTCCTCCATGATTTCCAACCTCTCCTGGG 
AGCTCCCTGGGTCTCTGCCTCrCATC^GTGTCCCATACTCCATGCACTGCTGCACGCT 

CTCCTGCTGTGGCTGGCGGCATCCTGCTCCCTCTTCCTGCACTCCCATGCCTGGCTGT 
CGGAATGCCTCATCGTCCGCCTCTATCTGGGCCCCTTGGACTCCAGGCCCGGAGTGCT 



TGAGGCAGGAGAGGGAGGAGACAGAGACGATGGAGAACCTGACTCGGCTGCTCTTGGA 
GAACGTGCTCCCTGCACACGTGGCCCCCCAGTTCATTGGCCAGAACCGGCGCAACGAG 
GATCrCTACCACCAGTCCTATGAATCCGTTTGTGTCCTCTTCGCCTCAGTCCCAGACT 
TCAAGG AGTT CTACTCTG AATCCAACATCAATCATGAGGG C CTAG AGTGT CTGAGG CT 
GCTC AATGAG ATAATTG CTG ATTTTG ATG AG CTG CT CTC CAAG CCCAAGTTCAGTGGG 
GTGG AG AAGATCAAGACCAT CGG CAG C AC CTACATGG CAG C CACAGG CTTAAATG CCA 
CCTCTGGACAGGATGCACAACAGGATGCTGAACGGAGCTG CAG CCACCTTGG CACTAT 
GGTGGAATTTGCCX3TGGCCCTGGGGTCTAAGCTGGACGTCATCAACAAGCATTCATTC 
AACAACTTCCGCCTGCGAGTGGGGTTGAACCATGGACCCGTAGTAGCTGGAGTTATTG 
GGGCCCAGAAGCCGCAATATGACATTTC^GGCAACACAGTGAACGTGGCCAGCCGCAT 
GGAGAGTACAGGAGTCCTTGGCAAAATCCIAAGTGACrGAGGAGACAGCATG 
C^GTCCCrGGGCTAC^CCTGCTACAGCCGGGGTGTCATCAAGGTGAAAGGCAAAGGGC 
AGCTCTGCACCTACTTCCTGAACACAGACTTGACACGAACTGGACCTCCTTCAG 
CCTAGG CTGAG ATTG CACT CG C CTTCTj 
TGGAGCCCATTGATG 
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ORF Start: ATG at 52 


ORF Stop: TGA at 3256 




SEQ ID NO: 66 


1068 aa 


MWat I18800.7kD 


NOV23a, 
CG57368-01 Protein Sequence 

j 

i 
j 

j 

i 

i 

j 


MARLFSPRPPPSEDLFYETYYSLSQQYPLLLLLI^IVLCAIJVALIAVAWASGRELTSD 
PS FLTTVLCALGGFS LLLGLASREQRLQRWTRPLSGLVWVALLALGHAFLFTGGVVS A 
WDQ VS YFL FVI FTAYAMLPLGMRDAAVAGLAS SLSHLLVLGLYLG PQPDSRP ALLPQL 
AANAVL FLCGNVAGVYHKALMERALRATFRE ALSSLHS RRRLDTEKKHQEHLLLS ILP 
AYLAREMKAE I MARLQAGQGSRP ESTNNFHSLYVKRHQGVSVL YAD I VG FTRLAS E CS 
PKELVLMLNELFGKFDQ I AKEHE CMR I KI LGDCYYCVS GL PL S L PDHAI NCVRMGLDM 
CRAIRKLRAATGVDINMRVGVHSGSVLCGVIGLQKWQYDWSHDVTIANHMEAGGVPG 
RVH I TG ATLALLAG AYAVEDAGM EHRD P YLRELG EPTYLVI DPRAEEED EKGTAGGLL 
SSLEGLKMRPSLLMTRYLESWGAAKPFAHLSHGDSPVSTSTPLPWLSLCRSRTPRGLD 
DELDTGDAKFFQVIEQLNSQKQWKQSKDFNPLTLYFREKEMEKEYRLSAIPAFKYYEA 
CTFLVFLSNFI IQMLVTNRPPAIiAITYSITFLLFLLILFVCFSEDLMRCVLKGPKMLH 
WLPALSGLVATRPGLRI ALGTAT ILLVFAMAITSLFFFPTSSDCPFQAPNVS SMI SNL 
SWELPGSLPLI SVPYSMHCCTLGFLSCSLFLHMS FELKLLLLLLWLAASCSLFLHSHA 
WLSECLIWLY1/3PLDSRPGVLKEPKLMGAISFFIFFFTLLVLARQNEYYCRLDFLWK 
KKLRQEREETETMENLTRLLLENVL PAHVAPQFI GQNRRNEDLYHQ SYECVCVL FAS V 
PDFKEFYSESNINHEGLECLRLI^EIIADFDELI^KPKFSGVEKIKTIGSTYMAATGL 
NATSGQDAQQD AERS CSHLGTMVEFAVALG S KLDVI NKHS FNNFRLRVGLNHG PWAG 
VI GAQKPQYDI WGNTVNVASRMESTGVLGKI QVT E ETAWALQSLGYTCY SRG VI KVKG 
KGQLCTYFLNTDLTRTG PP S ATLG 


j 


SEQ ID NO: 67 


995 bp 


[ NOV23b, 

j CG57368-02 DNA Sequence 

1 

1 

[ 

I 
\ 

i 


ATCATGGCCCGCCTCTTCAGCCCCCGGCCGCCCCCCAGCGAAGACCrCTTCTACGAGA 
CCTACTAC^GCCTGAGCCAGCAGTACCCGCrGCTGCTGCTGCTGCTGGGGATCGTGCT 
CTGTGCGCTCGCGGCGCTGCTCGCAGTGGCCTGGGCCAGCGGCAGGGAGCTGACCTCA 
GACCCGAGCTTCCTGACCACTGTGCTGTGCGCGCTGGGCGGCTTCTCGCrrGCTGCIGG 
GCCTCGCTTCCCGGGAGCAGCGACTGCAGCX3CTGGACGCGTCCCCTGTCCGGCTTGGT 
ATGGGTCGCGCTGCTAGCGCTAGGCCACGCCTTCCTGTTCACCGGGGGCGTGGTGAGC 
GCCTGGGACCAGGTGTCCTATTTTCTCTTCGTCATCTTCACGGCGTATGCCATGCTGC 
CCTTGGGCTiTGCGGGACGCCGCCGTCGCGGGCCTCGCCTCCTCACTCTCGCATCTGCT 
GGTCCTCGGGCTGTATCTTGGGCCACAGCCGGACTCACGGCCTGCACTGCTGCCGCAG 
TTGGCAGCAAACGCAGTGCTGTTCCTGTGCGGGAACGTGGCAGGAGTGTACCACAAGG 
CGCTGATGGAGCGCGCCCTGCGGGCCACGTTCCGGGAGGCACTCAGCTCCCTGCACTC 
ACGCCGGCGGCTGGACACCGAGAAGAAGCGCCAGGAACACCTTCTCTTGTCCATCCTT 
CCTGCCTACCTGGCCCGAGAGATGAAGGCAGAGATCATGGCACGGCTGCAGGCAGGAC 
AGGGGTCACGGCCAGAGAGCACTAACAATTTCCACAGCCTCTATGTCAAGAGTACAGG 
AGTCCTTGGCAAAATCCAAGTGACTGAGGAGACAGCATGGGCCCTACAGTCCCTGGGC 
TACACCTGCTACAGCCGGGGTGTCATCAAGGTGAAAGGCAAAGGGCAGCTCTGCACCT 
ACTTCCTGAACACAGACTTGACACGAACTGGACCTCCTTCAGCTACCCTAGGCTGAGA 
TTGCACTCG 




ORF Start: ATG at 4 


ORF Stop: TGA at 982 




SEQ ID NO: 68 


326 aa 


MW at 35652 AkD 


NOV23b, 
CG57368-02 Protein Sequence 


MARLFSPRPPPSEDLFYETYYSLSQQYPLLLLLLGIVLCALAALLAVAWASGRELTSD 

PS FLTTVLCALGGFSLLLGIiASREQRLQRV^RPLS GLVWVALLA^ 

WD QVS YFLFVI FTAYAML PLGMRD AAVAGLAS S LS HLLVLGLYLG PQPDSRP ALLPQL 

AANAVLFIjCGNVAGVYHKALMERALRATFREALSSLHSR I LP 

AYLAREMKAEIMARLQAGQGSRPESTNNFHSLYVKSTGVLGKIQVT^ 

TCYSRGVI KVKGKGQLCTYFLNTDLTRTGPPS ATLG 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 23B. 



Table 23B. Comparison of NOV23a against NOV23b. 


Protein Sequence 


NOV23a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV23b 


1..267 
1..267 


208/267 (77%) 
208/267 (77%) 
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Further analysis of the NOV23a protein yielded the following properties shown in 
Table 23C. 



Table 23C. Protein Sequence Properties NOV23a 


PSort analysis: 

i 


0.8000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.3000 
probability located in microbody (peroxisome) 


j SignalP analysis: 


Cleavage site between residues 51 and 52 



A search of the NOV23a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 23D. 



| Table 23D. Geneseq Results for NOV23a 

i 



J 

| Geneseq 
I Identifier 

1 


Protein/Organism/Length [Patent 
#, Date] 


NOV23a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


j AAU01924 

i 


Human adenylate cyclase 
polypeptide - Homo sapiens, 1077 
aa. [WO200125448-AI, I2-APR- 
2001] 


I.. 1068 
1..1077 


1066/1078 (98%) 
1066/1078 (98%) 


0.0 


AAB02008 


Type IV adenylyl cyclase - Homo 
sapiens, 1064 aa. [US6107076-A, 
22-AUG-2000] 


1..1066 
1..1064 


976/1073(90%) 
999/1073 (92%) 


0.0 


AAB02006 


Adenylyl cyclase type II-C2 C2 
alpha domain - Homo sapiens, 
1090 aa. [US6I07076-A,22- 
AUG-2000] 


12..1060 
28..1082 


605/1064 (56%) 
772/1064 (71%) 


0.0 


AAR94560 


Rat adenylyl cyclase - Rattus sp, 
1090 aa. [WO9608260-A1, 21- 
MAR-1996] 


12..1060 
28..1082 


605/1064 (56%) 
772/1064 (71%) 


0.0 


AAE02938 


Human adenylate cyclase 25678 - 
Homo sapiens, 1086 aa. 
[WO200144453-A1, 21-JUN- 
2001] 


10..1060 
22.. 1078 


605/1066 (56%) 
771/1066 (71%) 


0.0 



In a BLAST search of public sequence datbases, the NOV23a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 23E. 
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Table 23E. Public BLASTP Results for NOV23a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV23a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC37757 


^FHI IF1\IPF 1 PPHM P ATPXTT 
oc^uciN^c. Z rKwIVJ rA 1 fcIN 1 

WOO 125448 - Homo sapiens 
(Human), 1077 aa. 


J..IOOO 
1..1077 


1066/1078 
(98%) 

1 066/1078 
(98%) 


0.0 


no i wf"? 

\£y i wrj 


oIIYllLAK 1UAULNYLYL 
CYCLASE 4 (ADENYLYL 
CYCLASE TYPE 4) (EC 4.6. 1.1)- 
Mus musculus (Mouse), 1077 aa. 


1..1066 
1..1077 


988/1078 
(91%) 

1010/1078 
(93%) 


0,0 


P26770 


Adenylate cvclase tvne IV (PC 
4.6. 1.1) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Rattus 
norvegicus (Rat), 1064 aa. 


1 1066 

1 .. J uuu 

1..1064 


076/1 (Ml 
y /o/ iv/j 

(90%) 

999/1073 

(92%) 


n n 


P26769 


Adenylate cyclase, type II (EC 
4.6. 1.1) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Rattus 
norvegicus (Rat), 1 090 aa. 


12.. 1060 
28.. 1082 


605/1064 
(56%) 

772/1064 
(71%) 


0.0 


Q08462 


Adenylate cyclase, type. II (EC 
4.6. 1.1) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Homo sapiens 
(Human), 887 aa (fragment). 


188.. 1060 
1..879 


526/888(59%) 
658/888 (73%) 


0.0 



PFam analysis predicts that the NOV23a protein contains the domains shown in 
the Table 23F. 



Table 23F. Domain Analysis of NOV23a , 


Pfam Domain 


NOV23a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Sodcu 


376.383 


6/8 (75%) 
8/8(100%) 


0.73 


guanylate_cyc 


264..448 


71/226(31%) 
156/226 (69%) 


4.2e-66 


guanylate_cyc 


855..1055 


94/228(41%) 
177/228 (78%) 


2.7e-85 



Example 24. 



The NOV24 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 24A. 
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Table 24 A. NOV24 Sequence Analysis 




SEQ ID NO: 69 


997 bp 


NOV24a, 
CG59955-01 DNA Sequence 


TTCTAATCTGTGGTTTCTTCACATCAACTGAAACAATGCAGCAAAATAACAGTGTGAC 
TGAATTCATACTGTTAGGATTAACACAGGATCCCTTGAGG^GAAARTnr^'rrT'r'rn'rTi 

ATCTTCTTAATTTTCTATATGGGAACTGTGGTGGGGAATATGCTCATTATTGTGACCA 
TCAAGTCCAGCCGGACACTAGGAAGCCCCATGTACTTCTTTCTATTTTATTTGTCCTT 
TGCAGATTCTTGCTTTTCAACTTCCACAGCCCCTAGATTAATTGTGGATGCTCTCTCT 
GAAAAGAAAATTATAACCTACAATGAGTGCATGACACAAGTCTTTGCACTACATTTAT 
TTGG CTG CATGG AG ATCTTTGTCCTCATTCTCATGGCTGTTG ATCGCTATGTGG CCAT 
CTGTAAGCCCTTGCGTTACCCAACCATCATGAGCCAGCAGGTCTGCATCATCCTGATT 
GTTCTTGCCTGGATAGGGTCTTTAATACACTCTACAGCTCAGATTATCCTGGCCTTAA 
GATTGCCTTTCTGTGGACCCTATTTGATTGATCATTATTGCTGTGATTTGCAGCCCTT 
GTTGAAACTTGCCTGCATGGACACTTACATGATCAACCTGCTGTTGGTGTCTAACAGT 
GGGGCAATTTGCTCAAGTAGTTTCATGATTTTGATAATTTCATATATTGTCATCTTGC 
ATTCACTGAGAAACCACAGTGCCAAAGGGAAGAAAAAGGCTCTCTCCGCTTGCACGTC 
TCACATAATTGTAGTCATCTTATTCTTTGGCCCATGTATATTCATATATACACGCCCC 
CCGACCACTT^CCCCATGGACAAGATGGTGGCAGTATTTTATACTATTGGACCACCCT 
TTCTCAATCCACTCATCTACACACTGAGGAATGCAGAAGTGAJUVAATGCCATGAGAAA 
GTTATGGCATGGCAAAATTATTTCAGAAAACAAAGGATAAATTGAGGGCCTGACCTGA 
TTACTTTTTCA 




ORF Start: ATG at 36 


ORF Stop: TAA at 966 




SEQ ID NO: 70 


310 aa MW at 35002.7kD 


NOV24a, 
CG59955-01 Protein Sequence 


MQQNNSVTEFILLGLTQDPLRQKIVFVI FLI FYMGTWGNMLI IVTIKSSRTLGS PMY 
FFLFYLSFADSCFSTSTAPRLIVDALSEKKIITYNECMTQVFALHLFGCMEIFVLILM 
AVDRYVAICKPLRYPTIMSQQVCIILIVLAWIGSLIHSTAQIILALRLPFCGPYLIDH 
YCCDLQPLLKLACMDTYMINLLLVSNSGAICSSSFMILIISYIVILHSLRNHSAKGKK 
KALS ACTSH 1 1 WI LFFG PC I F I YTRPPTTF PMDKMVAVF YT IG P PFLNPLI YTLRNA 
E VKNAMRKLWHGKI I S ENKG 



Further analysis of the NOV24a protein yielded the following properties shown in 
Table 24B. 



Table 24B. Protein Sequence Properties NOV24a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 03000 probability located in endoplasmic reticulum (membrane); 0.0300 
probability located in mitochondrial inner membrane 


SignalP analysis: 


Cleavage site between residues 40 and 41 



A search of the NOV24a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 24C. 
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Table 24C. Geneseq Results for NOV24a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#,Date] 


NOV24a Residues/ 
Match Residues 


Identities/ 
Similarities 
tor the 
Matched 
Region 


Expect 
Value 


AAG72I21 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1802 - 
Homo sapiens, 299 aa. 
[WO200127158-A2, I9-APR- 
2001] 


1..298 
2..299 


296/298 
(99%) 

296/298 
(99%) 


e-I70 


AAU24690 


Human olfactory receptor 
AOLFR1 89 - Homo sapiens, 298 
aa. [WO200168805-A2, 20-SEP- 
2001] 


1..286 
1.286 


284/286 
(99%) 

284/286 
(99%) 


e-163 


AAE09969 


G-protein coupled-receptor 
(GPCR) NOV5 protein - 
Unidentified, 310 aa. 

lAVrf^Oflfi 1 AO 1 ^ CCD 
[WUiUU lOO/40-Az, iJ-obr- 

2001] 


1..309 
1..309 


210/309 
(67%) 
262/309 

\03 VO) 


e-125 


AAG72412 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2093 - 

Homo sapiens, 3 1 0 aa. 
[WO200127158-A2, 19-APR- 
2001] 


1..309 
1..309 


210/309 
(67%) 

262/309 
(83%) 


e-125 


AAG72314 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1995 - 
Homo sapiens, 3 10 aa. 
[WO200127158-A2, 19-APR- 
2001] 


1..309 
I ..309 


210/309 
(67%) 

262/309 
(83%) 


e-125 



In a BLAST search of public sequence datbases, the NOV24a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 24D. 
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Table 24D. Public BLASTP Results for NOV24a 


Protein 
Accession 
Number 


rrotein/Urganism/Length 


NOV24a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q8VGF4 


OLFACTORY RECEPTOR 
iviUK^JU-j - Mus musculus 
(Mouse), 307 aa. 


1..307 
1..307 


272/307 (88%) 
294/307 (95%) 


e-160 


Q8VGF5 


OLFACTORY RECEPTOR 
MOR230-2 - Mus, musculus 
(Mouse), 307 aa. 


1 .305 
1..305 


261/305 (85%) 
zojouo \yz /o) 


e-155 


Q8VGF6 


OLFACTORY RECEPTOR 
MOR230-1 - Mus musculus 
(Mouse), 307 aa. 


1.307 
1..307 


259/307 (84%) 
286/307 (92%) 


e-154 


Q8VFF8 


OLFACTORY RECEPTOR 
MOR230-6 - Mus musculus 
(Mouse), 303 aa. 


4..304 
3..303 


238/301 (79%) 
273/301 (90%) 


e-143 


Q8VFF9 


OLFACTORY RECEPTOR 
MOR230-5 - Mus musculus 
(Mouse), 303 aa. 


4..304 
3..303 


237/301 (78%) 
275/301 (90%) 


e-I41 



PFam analysis predicts that the NOV24a protein contains the domains shown 
the Table 24E. 



in 



Table 24E. Domain Analysis of NOV24a 


Pfam Domain 


NOV24a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l 


39.. 199 


36/181 (20%) 
120/181 (66%) 


4.4e-27 



The NOV25 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 25 A. 
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Table 25A. NOV25 Sequence Analysis 




SEQ ID NO: 71 


847 bp 


NOV25a, 
CG8921 1-01 DNA Sequence 


AACAATGC^TGCAGGATGGCATACCATGGCAACAGGGGCACTTCTCACCCAGCCACA 
TTTTTTCTCATTGGAATCCCAGGTCTGGAAGACGTCCATATGAGAATCTCCCTGCZAT^ 
TCTG CTCTGTTTACCTTTTGG CTTTG CTGGG AAATGCTACCATTCTGTTAGTCATCAA 
GGCAGAAC^GACCCTCCGGGAGCCCATGTTCTACTTTCTGGCCATCCTTTCCACAATT 
G ATTTGG CCCTTTCT ACAACC T LTl'GTG C CT CGTACG C TGGGT AT CTTCTGG TTTG ATG 
TTCATGAGATTAACTTTGGAGCTTGTGTGG CCCAG ATGTTTCTG ATCCATG CCTTCAC 
TGGCATGGAGGCTGAGGTCCTGGTGGCCATGGCCTTTGACCATTACGTGGCCATCTGC 
AATCCACTTCACTACACAAACATCTTGACATGCCGGGTGCTGGTGGGCATCACTATGT 
GGATTTTAATTCGTCCAGTTCTGTTTACACTCCCGATAATCTATCTCATCTACCGTTT 
AC C ATTTTGTCAGGCTCATATAATAGCCCATTCCTACTATG AGCACATGGG CATTGCA 
AAATTGTCCTGTGGAAACATCCGTGTCAATGCTATCTATGGGCTCTTTGTGGTCTCCC 
T CTTT CT C CTG AACCTGGTCCTT ATTGTTATCT CATATGTTTACATTCTCTGTG CTGT 
CTTCTGCCTCCCATCACATGATGCTCGG CTAAAAG CCCTAAGCACATGTGG CTCTCAT 
GTTG<5GGTCATCTGTGTTTTCrATATCCCGTCGGTCTTCTCTTTCCTTACTC^TTGAT 
TTG G AC ACAAC ATT CCACATT AC ATT CACATT CTT 




ORF Start: ATG at 5 


ORF Stop: TGA at 809 




SEQ ID NO: 72 


268 aa 


MWat 30031.5kD 


NOV25a, 
CG8921 1-01 Protein Sequence 


MQCRMAYHGNRGTS HP AT F FL I G I PGLEDVHMRI S LH FCS VYLLALLGNAT ILLVI KA 
EQTLRE PM FYFLAI LST I DLALS TTS VPRTLG I FWFDVHE I NFG ACVAQM FL I HAFTG 
MEAEVLVAMAFDHYVAI CNPLHYTN I LTCRVLVG I TMCIL I RPVLFTLP 1 1 YLIYRLP 
FCQAHIIAHSYYEHMGIAKLSCGNIRVNAIYGLFWSLFLliNLVLIVISYVYILCAVF 
CLPSHDARLKALSTCGSHVGVI CVFYIP SVFS FLTH 



Further analysis of the NOV25a protein yielded the following properties shown in 
Table 25B. 



Table 25B. Protein Sequence Properties NOV25a 


PSort analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi body; 
0. 1 000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 59 and 60 



A search of the NOV25a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 25C. 
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Table 25C. Geneseq Results for NOV25a 


Geneseq 
Identifier 


Protein/Organism/Length 
[raieni #, L/atej 


NOV25a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG71560 


Human olfactory receptor 
polypeptide, SEQ ID NO: 124 1 

- Homo sapiens, 321 aa. 
[wvjzuuiz/ 1 jo-Az, 19-APR- 
2001] 


S..268 
L.264 


253/264 (95%) 
255/264 (95%) 


e-146 


ABB44529 


Human GPCR5b polypeptide 
SEQ ID NO 18 -Homo 
sapiens, 3 11 aa. 
twuzUUl /4y04-A2, 1 1-OCT- 
2001] 


5..26S 
1..263 


225/264 (85%) 
239/264 (90%) 


e-I27 


ABB44530 


Human GPCR5c polypeptide 
SEQ ID NO 20 - Homo 
sapiens, 311 aa. 
(WO200174904-A2, 1 1-OCT- 
2001] 


5..268 
1..263 


225/264 (85%) 
238/264(89%) 


e-127 


AAU71562 

i 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1243 

- Homo sapiens, 3 1 1 aa. 
[WO200127158-A2, 19-APR- 
2001] 


5..268 
1..263 


225/264(85%) 
237/264(89%) 


e-127 


AAU24572 


Human olfactory receptor 
AOLFR62 - Homo sapiens, 
311 aa. [WO200168805-A2, 
20-SEP-2001] 


5..26S 
1.263 


225/264 (85%) 
237/264 (89%) 


e-127 



In a BLAST search of public sequence datbases, the NOV25a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 25D. 
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Table 25D. Public BLASTP Results for NOV25a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV25a Residues/ 
Match Residues 


Identities/ 
Similarities for 
tne iviaicnea 
Portion 


Expect 
V&lue 


Q8VGV8 


OLFACTORY RECEPTOR 
MOR32-3 - Mus muscuius 
(Mouse), 317 aa. 


15..268 
1 0..263 


148/254(58%) 


9e-86 


Q8VGX4 


OLFACTORY RECEPTOR 

IVlL/ivJZ~Z - MUS muscuiub 

(Mouse), 312 aa. 


I0..268 
5. .263 


151/259(58%) : 
194/259(74%) 


6e-85 


Q8VG26 


OLFACTORY RECEPTOR 
MOR32-5 - Mus muscuius 
(Mouse), 313 aa. 


8..268 
3..263 


146/261 (55%) 
196/261 (74%) 


4e-84 


Q8VF06 


OLFACTORY RECEPTOR 
MOR32-1 1 - Mus muscuius 
(Mouse), 3 12 aa. 


15..268 
10..263 


144/254(56%) 
191/254(74%) 


2e-83 


Q8VG28 

! 

4 


OLFACTORY RECEPTOR 
MOR32-4 - Mus muscuius 
(Mouse), 312 aa. 


9..268 
4..263 


148/260 (56%) 
188/260 (71%) 


2e-80 



PFam analysis predicts that the NOV25a protein contains the domains shown in 
the Table 25E. 



Table 25E. Domain Analysis of NOV25a 


Pfam Domain 


NOV25a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tmJ 


48..157 


30/112(27%) 
76/112(68%) 


2.2e-09 



Example 26. 



The NOV26 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 26A. 
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Table 26A. NOV26 Sequence Analysis 




SEQ ID NO: 73 


614 bp 


NOV26a, 
CG90530-02 DNA Senuence 


CTTAGTGTGGAGCAGTGAACTGTGTGTGGTTCCTTCTACTTGGGGATCATGCAGAGAG 
CTTCACGTCTGAAGAGAGAGCTGCACATGTTAGCCACAGAGCCACCCCCAGGCATCAC 
ATGTTGGCAAGATAAAGACCAAATGGATGACCTGCGAGCTCAAATATTAGGTGGAGCC 
AACACACCTTATGAGAAAGGTGTTTTTAAGCTAGAAGTTATCATTCCTGAGAGGTACC 
CATTTGAACCTCCTCAGATCCGATTTCTC^CTCCAATTTATCATCCAAACATTGATTC 
TGCTGGAAGGATTTGTCTGGATGTTCTCAAATTGCCACCAAAATCCTCAGAATTTAAA 
TATAATAAGCCAGCCTTCCTCAAGAATGCCAGACAGTGGACAGAGAAGCATGCAAGAC 
AGAAACAAAAGGCTGATGAGGAAGAGATGCTTGATAATCTACCAGAGGCTGGTGACTC 
CAGAGTACACAACTCAACACAGAAAAGGAAGGCCAGTCAGCTAGTAGGCATAGAAAAG 
AAATTTCATCCTGATGTTTAGGGGACTTGTCCTGGTTCATCTTAGTTAATGTGTTCTT 
TGCCAAGGTATCTAAGTTGCCTACCTTGAATTTT 




ORF Start: ATG at 49 


ORF Stop: TAG at 541 




SEQ ID NO: 74 


164 aa 


MWatl8958.6kD 


NOV26a, 
CG90530-02 Protein Sequence 


MQRASRL KRELHMLATE P PPGITCWQDKDQMDDLRAQILGGANTPYEKGVFKLEVI I P 
ERYPPEPPQIRFLTPIYHPNIDSAGRICLDVLKLPPKSSEFKYNKPAFLKNARQWTEK 
HARQKQKADEEEMLDNLPEAGDSRVHNSTQKRKASQLVGIEKKFHPDV 



Further analysis of the NOV26a protein yielded the following properties shown i 
Table 26B. 



Table 26B. Protein Sequence Properties NOV26a 


PSort analysis: 


0.4500 probability located in cytoplasm; 03000 probability located in microbody 
(peroxisome): 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


. . No Known Signal Sequence Predicted 



A search of the NOV26a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 26C. 
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Table 26C Geneseq Results for NOV26a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV26a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAB51233 

i 


Human ubiquitin-conjugating 
enzyme 2 protein sequence SEQ 
ID NO: 7 - Homo sapiens, 197 aa. 
[CN1268564-A, 04-OCT-2000] 


I ..164 
1.197 


164/197 
164/197 (83%) 


iC-07 


AAM4145 
5 


Human polypeptide SEQ ID NO 
6386 - Homo sapiens, 207 aa. 
[WO200153312-AI, 26-JUL- 
2001] 


I. .164 

II. .207 


164/197 Ott 0 /^ 

i v/*t/ 17/ yoj /O j 

164/197 (83%) 




AAM3966 
9 


Human polypeptide SEQ ID NO 
2814 - Homo sapiens, 197 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..164 
1..I97 


164/197 (83%) 
164/197(83%) 


2e-89 


AAU23245 


Novel human enzyme 
polypeptide #33 1 - Homo 
sapiens, 207 aa. [WO200 155301- 
A2, 02-AUG-2001] 


I. .I64 

II. .207 


164/197 (83%) 
164/197 (83%) 


2e-89 


AAY23157 

i 


Human ubiquitin-Iike 
conjugating protein (UBCLE) - 

Homo sapiens, 197 aa. 
[ W0993 1 252-A 1 , 24- J UN- 1 999] 


1.164 
1..197 


164/197 (83%) 
164/197 (83%) 


2e-89 



In a BLAST search of public sequence datbases, the NOV26a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 26D. 



185 



WO 03/010327 



PCT/US02/14199 



Table 26D. Public BLASTP Results for NO V26a 


Protein 
Accession 

Nnmhf»r 


Protein/Organism/Length 


NOV26a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q9NPD8 


UBIQUITIN-CONJUGATING 
ENZYME ISOLOG (HSPC150) 
(UBIQUITIN-CONJUGATING 

FIS, CLONE KAT08890) (HSPC150 
PROTEIN SIMILAR TO 
UBIQUITIN-CONJUGATING 
ENZYME) - Homo sapiens 
(Human), 197 aa. 


1..164 
1..197 


164/197 (83%) 
164/197 (83%) 


4e-89 


QycQ37 


2700084L22RIK PROTEIN - Mus 
musculus (Mouse), 204 aa. 


1..164 
1..196 


136/197 (69%) 
145/197 (73%) 


7e-70 


Q9XHP3 


UBIQUITIN-CONJUGATING 
ENZYME E2 - Catharanthus roseus 
(Rosy periwinkle) (Madagascar 
periwinkle), 153 aa. 


6..91 
9..94 


49/86 (56%) 
60/86 (68%) 


2e-23 


Q8W0I1 


UBIQUITIN-CONJUGATING 
ENZYME E2 - Oryza sativa (Rice), 
153 aa. 


6..91 
9..94 


46/86 (53%) 
58/86 (66%) 


7e-21 


Q94A97 

1 


AT1G78870/F9K20_8 - Arabidopsis 
thaliana (Mouse-ear cress), 153 aa. 


6..91 
9..94 


46/86 (53%) 
57/86 (65%) 


7e-21 



PFam analysis predicts that the NOV26a protein contains the domains shown in 
the Table 26E. 



Table 26E. Domain Analysis of NOV26a 


Pfam Domain 


NOV26a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


UQ_con 


1..116 


51/165 (31%) 
91/165(55%) 


2.9e-23 



Example 27. 



The NOV27 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 27A. 
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Table 27A. NOV27 Sequence Analysis 




SEQ ID MO: 75 


1 973 bp 


CG93076-01 DNA Sequence 


GGATGAACCAGACTTTGAATAGCAGTGGGACCGTGGAGTCAGCCCTAAACTATTCCAG 
AGGG AGCACAGTG CACACGG CCTACCTGGTGCTGAGCTCC CTGG CCATGTTCACCTGC 
CTGTGCGGGATGGC7VGGCAACAGCATGGTGATCTGGCTGCTGGGCTTTCGAATGCACA 
GGAACCCCTTCTGCATCTATATCCTC^CC^ 

CAGCATGGCHTCCACGCTCAGCCTGGAAACCCAGCCCCTGGTCAATACCACTGACAAG 
GTCCACGAGCTGATGAAGAGACTGATGTACTTTGCCTACACAGTGGGCCTGAGCCTGC 




TCACCGGCCCAGGCACCTGTCAGCCTGGGTGTGTGGCCTGCrGTGGACACTCTGTCTC 

CTG ATGAACGGGTTGACCTCTTCCTT CTGCAG CAAGTTCTTG AAATTCAATGAAGATC 

GGTGCTTCAGGGTGGACATGGTCCAGGCCGCCCTCATCATGGGGGTCTTAACCCCAGT 

GATGACTCTGTCCAGCCTGACCCTCTTTGTCTGGGTGCGGAGGGGCTCCCAGCAGTGG 

CGGCGGCAGCCCACACGGCTGTTCGTGGTGGTCCTGGCCTCTGTCCTGGTGTTCCTCA 

TCTGTTCCCTGCCTCTGAGCATCTACrGGTTTGTGCTCTACTGGTTGAGCCT 

CGAGATGCAGGTCCTGTGCITCAGCTTGTCACGCCrCTCCTCGTCCGTAAGCAGCAGC 

GCCAACCCCGTCATCTACTTCCTGGTGGGCAGCCGGAGGAGCCACAGGCTGCCCACCA 

GGTCCCTGGGGACTGTGCTCCAACAGGCGCTTCGCGAGGAGCCCGAGCTGGAAGGTGG 

GGAGACGCCCACCGTGGGCACCAATGAGATGGGGGCTTGAGAGCC 




ORF Start: ATG at 3 


ORF Stop: TGA at 966 




SEQ ID NO: 76 


321 aa 


MWat 36087.IkD 


NOV27a, 
CG93O76-01 Protein Sequence 


MNQTLNSSGTVES ALNYSRGSTYHTAYLVLS SIAMFTCLCGMAGNSMVI WLLGFRM^ 

NPFCIYILNIAAADLLFLFSMASTLSLETQPLVNTTDKVHEIiMKRIJ^y 

TAI STQRCIjSVLFPI WFKCHRPRHLSAWVCGLI#WTLCLLMNGI*TS S FCSKFLKFNEDR 

CFRVDMVQAALIMG^^TPVMTLSSLTLFVWVRRGSQQWRRQPTRLFVVVIiASVLVFIjI 

CSLPLSIYWFVLYWLSLPPEMQVLCFSLSRLSSSVSSSANPVIYFLVGSRRSHRLPTR 

SLGTVLQQALREEPELEGGETPTVGTNEMGA 



Further analysis of the NOV27a protein yielded the following properties shown in 
Table 27B. 



Table 27B. Protein Sequence Properties NOV27a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.3000 
probability located in microbody (peroxisome) 


SignalP analysis: 


Cleavage site between residues 45 and 46 



A search of the NOV27a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 27C 



187 



WO 03/010327 



PCT/US02/14199 



Table 27C. Geneseq Results for NOV27a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV27a 
Residues/ 
Match Residues 


Identities/ 
Similarities 
• for the 
Matched 
Region 


Expect 
Value 


ABB04658 


Human G protein-coupled receptor TGR7 
SEQ ID NO: 1 - Homo sapiens, 32 1 aa. 
[WO200183748-A1, 08-NOV-2001] 


K.32I 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 


AAU 19292 


Human G protein-coupled receptor 
nGPCR~74 - Homo sapiens, 321 aa. 
[WO200166750-A2, 13-SEP-200I] 


1..321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 


AAG64124 


Human G protein-coupled receptor 
GPRv51 - Homo sapiens, 321 aa. 
[WO200148188-A1, 05-JUL-2001] 


1..321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 


AAU04366 


Human G -protein coupled receptor, 
hRUP12 - Homo sapiens, 321 aa. 
[WO200136471-A2, 25-MAY-2001] 


1..321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 


AAE06768 


Human G-protein coupled receptor- 18 
(GCREC-18) protein - Homo sapiens, 
321 aa. [WO200157085-A2, 09-AUG- 
2001] 


I. .321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 



In a BLAST search of public sequence datbases, the NOV27a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 27D. 
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Table 27D. Public BLASTP Results for NOV27a 


Protein 
Accession 
Mumber 


Protein/Organism/Length 


NOV27a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q91ZB8 


G PROTEIN-COUPLED 
RECEPTOR - Mus musculus 
(Mouse), 321 aa. 


1..318 
1.319 


189/321 (58%) 1 
237/321 (72%) 


3e-98 


Q96LA9 


G PROTEIN-COUPLED 
RECEPTOR - Homo sapiens 
(Human), 322 aa. 


1..310 
1..305 


134/317 (42%) 
1 oy/j 1 / {py /o) 


7e-54 


AAL86883 


G PROTEIN-COUPLED 
RECEPTOR SNSR6 - Homo 
sapiens (Human), 322 aa. 


1..310 
1..305 


133/317(41%) 
188/317(58%) 


2e-53 


AAL86882 


G PROTEIN-COUPLED 
RECEPTOR SNSR5 - Homo 
sapiens (Human), 322 aa. 


1..310 
1..305 


134/317(42%) 
188/317(59%) 


3e-53 


AAL86880 


G PROTEIN-COUPLED 
RECEPTOR SNSR3 - Homo 
sapiens (Human), 322 aa. 


1..309 
1..304 


129/314(41%) 
188/314 (59%) 


2e-52 



PFam analysis predicts that the NOV27a protein contains the domains shown in 
the Table 27E. 



Table 27E. Domain Analysis of NOV27a 


Pfam Domain 


NOV27a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l 


44..78 


17/37(46%) 
33/37(89%) 


1.4e-09 


7tm_l 


104.276 


39/205 (19%) 
119/205(58%) 


7.3e-09 



Example 28. 



The NOV28 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 28A. 
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Table 28A. NOV28 Sequence Analysis 




SEQ ID NO: 77 


2856 bp 


NOV28a, 
CG94235-01 DNA Sequence 


GGGCGGCGCGGGGTCTGCGCTGGGGCCATGGCTCCGCCGCGCCGCTTCGTCCTGGAGC 
TTCCCGACTGCACCCTGGCTCACTTCGCCCTAGGCGCCGACGCCCCCGGCGACGCAGA 
CGCCCCCGACCCCCGCCTGGCGGCGCTGCTGGGGCCCCCGGAGCGCAGCTACTCGCTG 
TGCGTGCCCGTGACCCCGGACGCCGGCTGCGGGGCCCGGGTCCGGGCGGCGCGGCTGC 
ACCAGCGCCTGCTGCACCAGCTGCGCCGCGGCCCCTTCCAGCGGTGCCAGCTGCTCAG 
GCTGCTCTGCTACTGCCCGGGCGGCCAGGCCGGCGGCGCACAGCAAGGCTTCCTGCTG 
CGCGACCCCCTGGATGACCCTGACACCCGGCAAGCGCTGCTCGAGCTGCTGGGCGCCT 
GTCAGGAGGCAC CACGCCCGCACTTGGG CGAGTTCGAGGCCGACCCGCGCGGCCAGCT 
GTGGCAGCGCCTCTGGGAGGTGCAAGACGGCAGGCGGCTGCAGGTGGGCTGCGCACAG 
GTCGTGCCCGTCCCGGAGCCCCCGCTGCACCCGGTGGTGCCAGACTTGCCCAGTTCCG 
TGGTCTTCCCGGACCGGGAAGCCGCCCGGGCCGTTTTGGAGGAGTGTACCTCCTTTAT 
TCCrGAAGCCCGGGCAGTGCTTGACCTGGTCGACCAGTGCCCAAAACAGATCCAGAAA 
GGAAAGTTCCAGGTTGTTGCCATCGAAGGACTGGATGCCACGGGTGGTAAAACCACGG 
TGACCCAGTCAGTGGCAGATTCACTTAAGGCTGTCCTCTTAAAGTCACCACCCTCTTG 
CATTGGCCAGTGGAGGAAGATCTTTGATGATGAACCAACTATCATTAGAAGAGCTTTT 
T ACT CTTTGG G CAATT ATATTGTG GCCTC CG AAAT AG CT AAAG AATCTG CCAAAT CT C 
CTGTGATTGTAGACAGGCACAGCACGGCCACCTATGCCATAGCCACTGAGGTGAGTGG 
GGGTCTCCAGCACCTGCCCCCAGCCCATCACCCTGTGTACCAGTGGCCAGAGGACCTG 
CTCAAACCTGACCTTATCCTG CTG CTCACTGTG AGTCCTG AGG AGAGGTTGCAGAGG C 
TG CAGGGC CGGG GCATGGAGAAG AC CAG G G AAG AAG CAG AACTTG AGG C CAACAGTG T 
GTTTCGTCAAAAGGTAGAAATGTCCTACC^GCGGATGGAGAATCCTGGCTGCCATGTG 
GTTGATGCCAGCCCCTCCAGAGAAAAGGTCCTGCAGACGGTATTAAGCCTAATCCAGA 
ATAGTTTTAGTGAACCGTAGTTACTCTGGCCAGGTGCCACGTCTAACTAGATTAGATG 
TTGTTTGAAACATCTACATCCACCATTTGTTATGCAGTGTTCCCAAATTTCTGTTCTA 
CAAGCATGTTGTGTGGCAGAAAACTGGAGAC<^GGCATCTTAATTTTACTTCAGCCAT 
CGTACCCTCTTCTGACrC^TGGACCCGTC^TCACAAAGGTCCCTCTCATCATGTTCC^ 
GTGAGAGGCCAG CGATTGCTTTCTTCCTGG CATAGTAAACATTTTCTTGGAACATATG 
TTTCACTTAAT(^CTACCAAATATCTGGAAGACCTGTCTTACTCAGACAGCACCAGGT 
GTAC^GAAGCAGCAGAC^GATCTTCCAGATCAGCAGGGAGACCCCGGAGCCTCTGCT 
TCTCCTACACTGGCATGCTGATGAGATCGTGACATGCCC^CATTGGCTTCTTCCACAT 
CTGGTTGCACTCGTCATGATGGGCTCGCTGCATCTCCCTCAGTCCCAAATTCTAGAGC 
CAAGTGTTCCTGCAGAGGCTGTCTATGTGTCCrGGCTGCCCAAGGACACTCCTGCAGA 
GCCATTTTTGGGTAAGGAACACTTACAAAGAAGGCATTGATCTTGTGTCTGAGGCTCA 
GAGCCCTTTTGATAGGCTTCTGAGTCATATATA 

ACTG CAAATAT ACCAACCTTTCTCTGAATTATATTTTG CTTATTT ATATT^ 
TTTTTCT AAAGTATGG CTCTG AATAG AATG CACATTTTCCATTGAACTGGATG CATTT 
CATTTAG CCAAT CCAGTAATTTATTTATATTAATCTATACATAATATGT^ 
ATAGGAGCTATGATTC^TTAATTAAAAGTGGAGTCAAAACGCTAAATGCAATGTTTGT 
TGTGTATTTTCATTACACAAACTTAATTTGTCTTGT^ 

GAGTGGGATTTCrTGGTAAATTATCTTGCACTTGAATGTCTCATGATTACATATGAAA 
TOSCTTTGACATATCTrTAGACAGAAAAAAGTAGCrrGAGTGAGGGGGAAATTATAGAG 
CTGTGTGACTTTAGGGAGTAGGTTGAACCAGGTG AT T AC CTAAAATT CCTTCCAGTTC 
AAAGGGAGATAAATCTGTAAATTATTTTATCCTATCTACCATTTCTTAAGAAGACATT 
ACTCCAAAATAATT AAATTT AAGGCTTT AT CAGGTCTG CATATAG AATCTTAAATTCT 
AATAAAGTTTCATGTTAATGTCATAGGATTTTTAAAAGAGCTATAGGTAATTTCTATA 
T AATATGTGTAT ATT AAAATGTAATTGATTT CAGTTGAAAGT ATTTTAAAG CTGATAA 
ATAGCATTAGGGTTCTTTGCAATGTGGTATCT7U3 

TAAACATTTTG AAAAG CTTATACTGG CAGCCTAG AAAAACAAACAATTAATGTATCTT 
TATGTCCCTGGCACATGAATAAACTTTGCTGTGGTTTACTAATCTAAAAAAAAAAAAA 
AAAGGG CGGCCGCT 




ORF Start: ATG at 28 


ORF Stop: TAG at 1294 




SEQ ID NO: 78 


422 aa 


MW at46476.6kD 


NOV28a, 
CG94235-01 Protein Sequence 


MAPPRRFVLELPDCTIJUiFALGADAPGDADAPDPRIJ^^LGPPERSYSLCVPVTPDAG 
CG ARVRAARLHQRLLHQLRRG P FQRCQL LRLLCY C PG GQAGG AQQG FLLRDPLDDPDT 
RQALLEI^GACQEAPRPHIjGEFEADPRGQLWQRLWEVQDGRRIjQVGCAQVVPVPEPPL 
HPWPDLPSSVVFPDREAARAVLEECTS F I PEARAVLDLVDQCPKQI QKGKFQWAIE 
GI^ATGGKTTVTQSVADSIJCAVLLKSPPSCIGQWRKI FDDEPTI IRRAFYSLGNYIVA 
SEIAKESAJCSPVIVDRHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLILLL 
WSPEERLQRIjC^RGMEKTREEAELEANSVFRQKVEWSY 
VLQTVLSLIQNSFSEP 
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SEQIDNO:79 | 2331 bp 


NOV28b, 
CG94235-02 DNA Sequence 

i 

i 

I 

i 

j 

i 


CCCTWCTC^OT 

AGCCCGGGCAGTGCTTGACCTGGTCGACCAGTGCCCAAAACAGATCCAGAAAGGAAAG 
TTCCAGGTTGTTGCCATCGAAGGACTGGATGCCACGGrGTAAAACCACGGTGACCCAGT 
CAGCGGCAGATTCACTTAAGGCTGTCCTCTTAAAGTCACCACCCTCTTGCATTGGCCA 
GTGGAGGAAGATCTTTGATGATGAACCAACTATCATTAGAAGAGCTTTTTACTCTTTG 
GGCAATTATATTGTGGCCTCCGAAATAGCTAAAGAATCTGCCAAATCTCCTGTGATTG 
TAGACAGGTACTGGCACAGCACGGCCACCTATGCCATAGCCACTGAGGTGAGTGGGGG 
TCTCCAGCACCTGCCCCCAGCCCATCACCCTGTGTACCAGTGGCCAGAGGACCTGCTC 
AAACCTGACCTTATCCTGCTGCTCACTGTGAGTCCTGAGGAGAGGTTGCAGAGGCTGC 
AGGGCCGGGGCATGGAGAAGACCAGGGAAGAAGCAGAACTTGAGGCCAACAGTGTGTT 
TCGTCAAAAGGTAGAAATGTCCTACCAGCGGATGGAGAATCCTGGCTGCCATGTGGTT 
GATGCCAGCCCCTCCAGAGAAAAGGTCCTGCAGACGGTATTAAGCCTAATCCAGAATA 
GTTTTAGTGAACCGTAGTTACTCTGGCCAGGTGCCACGTCTAACTAGATTAGATGTTG 
TTTGAAACATCTACATCCACCATTTGTTATGCAGTGTTCCCAAATTTCTGTTCTACA^ 
GCATGTTGTGTGGCAGAAAACTGGAGACC^GGCATCrTAAGTTTACTTCAGCCATCGT 
ACCCTCTTCTGACTGATGGACCCGTCATCACAAAGGTCCCTCTCATCATGTTCCAGTG 
AGAGGCCAGCGATTGCTTTCTTCCTGGCATAGTAAACATTTTCrTGGAACATATGTCT 
CACTTAATCACTACCAAATATCTGGAAGACCTGTCTTACTCAGACAGCACCAGGTGTA 
CAGAAGCAGCAGACAAGATCTTCCAGATCAGCAGGGAGACCCCGGAGCCTCTGCTTCT 
CCTACACTGGCATGCTGATGAGATCGTGACATC^ 

GTTGCACTCGTCATGATGGGCTCGCTGCATCTCCCTCAGTCCCAAATTCTAGAGCCAA 

GTGTTCCTGCAGAGGCTGTCTATGTGTCCTGGCTGCCCAAGGACACTCCTGCAGAGCC 

ATTTTTGGGTAAGGAACACTTACAAAGAAGGCATTGATCTTGTGTCTGAGGCTCAGAG 

CCCTTTTGATAGGCTTCTGAGTCATATATAAAGACATTCAAGCCAAGATGCTCCT^CT 

GCAAATATACCAACCTTCTCTGAATTATATTTTGCTTATTTATATTTCTTTTCTTTTT 

TTCTAAAGTATGGCTCTGAATAGAATGCACATTTTCCATTGAACTGGATG 

TTAGCCJLATCCAGTAATTTATTTATATTAATCTATACATAATATGTTTCCTCAGCATA 

GGAGCTATGATTCATTAATTAAAAGTGGAGTCAAAACGCTAAATGCAATGTTTGTTGT 

GTATTTTCATTACACAAACTTAATTTGTCTTGTTAAATAAGTACAGTGGATCTTGGAG 

TGGGATTTCTTGGTAAATTATCTTGCAOTTGAATGTCTCATGATTACAT ATGZUi a Trr 

CTTTGACATATCTTTAGACAGAAAAAAGTAGCTGAGTGAGGGGGAAATTATAGAGCTG 

TGTGAC^TTAGGGAGTAGGTTfl2iar , r , af^r2Tr , a'rT'ar'r''T>7VT\-n » rMvi«iimn«» . 

* * *™«unuift\3ui j. vj«-rtv_\_>i\jVj j, l ACCTAAAATTCCTTCCAGTTCAAA 

GG C AG ATAAAT CTGTAAATT ATTTTAT CCT ATCTACCAT TT CTT AAG AAG ACATTA CT 

CCAAAATAATTAAATTTAAGGCTTTATCAGGTCTGCATATAGAATCTTAAATTCTAAT 

AAAGTTTCATGTTAATGTCATAGGATTTTTAAAAGAGCTATAGGTAATTTCTGTATAA 

TATGTGTATATTAAAATGTAATTGATTTCAGTTGAAAGTATTTTAAAGCTGATAAATA 

GCATTAGGGTTCTTTGCAATGTGGTATCTAGCTGTATTATTGGTTTTATTTACTTTAA 

ACATTTTGAAAAGCTTATA CT CtdCT^a r^Tar 1 a 2v?i * i\ a * fummnnnmrtm* ,,, , 

* * i ± xx x nVtf x ouLfiun- 1 AVaAAAAACAAACAATTAATGTATCTl 1 1 AT 

GTCCCTGGCACATGAATAAACTTTGCTGTGGTTTACTAAAAAAAAAAAAAAAAAAAAA 
GGGCGG CCGCT 




ORF Start: ATG at 1 6 J ORF Stop: TAG at 769 




SEQIDNO:80 j 251 aa MW at 27980.8kD 


NOV28b, i 
CG94235-02 Protein Sequence : 

; 

i 


vlAPPRRFVLELPDCTI^FALGAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQVVA 
I EGLDATGKTTVTQSAADSLKAVLLKSPPSCIGQWRKI FDDEPTI IRKAFYSLGNYI V 
\SE I AKES AKS PVI VDRYWHSTATYAI ATEVSGGLQHLPPAHHP VYQWPEDIiLKPDL I 
jLLTVS P EERLQRIjQGRGMEKTREEAELEANSVFRQKVEMS YQRMENPG CHWDAS PS 
IEKVLQTVLSLIQNS FSEP 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 28B. 



Table 28B. Comparison of NOV28a against NOV28b. 


Protein Sequence 


NOV28a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV28b 


192..422 ; 
20..251 


217/233 (93%) 
217/233 (93%) 
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Further analysis of the NOV28a protein yielded the following properties shown in 
Table 28C. 



Table 28C Protein Sequence Properties NOV28a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1939 probability located in lysosome (lumen); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV28a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 28D. 



Table 28D. Geneseq Results for NOV28a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV28a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB57297 


Mouse ischaemic condition related 
protein sequence SEQ ID NO:834 - Mus 
musculus, 431 aa. [WO200188188-A2, 
22-NOV-2001] 


199..384 
157..344 


131/193 (67%) 
150/193 (76%) 


3e-64 


AAU34536 


E. coli cellular proliferation protein #117 
- Escherichia coli, 213 aa. 
[WO200170955-A2, 27-SEP-2001] 


229..421 
6..210 


51/212(24%) 
88/212(41%) 


0.054 


AAB72201 


E. coli thymidylate kinase amino acid 
sequence - Escherichia coli, 213 aa. 
[WO200111025-A2, 15-FEB-2001] 


229..421 
6..210 


51/212(24%) 
88/212(41%) 


0.054 


AAU34536 


E. coli cellular proliferation protein #117 
- Escherichia coli, 213 aa. 
[WO200170955-A2, 27-SEP-2001] 


229..421 
6..210 


51/212(24%) 
88/212(41%) 


0.054 


AAY28786 


E.coli thymidylate kinase- 1 - Escherichia 
coli, 213 aa. [WO9941404-A2, 19-AUG- 
1999] 


229..421 
6..210 


51/212(24%) 
88/212(41%) 


0.054 



In a BLAST search of public sequence datbases, the NOV28a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 28E. 
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Table 28E. Public BLASTP Results for NOV28a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV28a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q9DCj4 


I200004E04R1K PROTEIN - 
Mus musculus (Mouse), 395 
aa. 


42..418 
1 6.393 


292/379 (77%) 
328/379 (86%) 


e-loo 


V^OZ J 1 0 


TUVN/linVI ATE !/■ TXT A CC 

1 ri Y MIL/ Y LA i t IvlNAofc 

HOMOLOGUE - Mus 
musculus (Mouse), 43 1 aa. 


157..344 


Ul/l 7 J [Of /O) 

150/193 (76%) 




Q96AL8 


HYPOTHETICAL 6.7 KDA 
PROTEIN - Homo sapiens 
(Human), 58 aa (fragment). 


365„422 
1..58 


58/58 (100%) 
58/58 (100%) 


3e-25 


060970 


TKRP1 - Leishmania major, 
274 aa. 


199..418 
44..254 


73/220 (33%) 
108/220 (48%) 


9e-22 


Q9CKE9 


Thymidylate kinase (EC 
2.7.4.9) (dTMP kinase) - 
Pasteurella multocida, 209 aa. 


224..416 
4..203 


59/210(28%) 
87/210(41%) 


4e-04 



PFam analysis predicts that the NOV28a protein contains the domains shown in 
the Table 28F. 



Table 28F. Domain Analysis of NOV28a 


Pfam Domain 


NOV28a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Thymidylatejkin 


231..411 


53/208 (25%) 
131/208 (63%) 


4.9e-08 



Example 29. 



The NOV29 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 29A. 
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Table 29A. NOV29 Sequence Analysis 




SEQ ID NO: 81 


867 bp 


NOV29a, 
CG94692-01 DN A Sequence 


ATGCCGGTGGAGGAATTTGTGGCTGGCTGGATCTCTGGCGCTCTGGGCTTGGTCCTGG 
GACACCCGTTTGACACTGTAAAGGTGAGGCTGCAGACCCAGACCACCTACCGGGGCAT 
CGTTGATTGCATGGTCAAGATTTACCGCCATGAGTCCCTCCTGGGCTTCTTCAAGGGA 
ATG AGCTTCCCCATTGCCAG CAT AG CTGTGGTCAACTCTGTCCTGTTTGGGGTCTATA 
GC^C^CCCTGCTGGTGCTCACGGCCACCTCCCACCAGGAGCGGCGGGCCCAGCCGCC 
CAGCTACATGCACATCTTCCTAGCGGGCTGCACCGGGGGGTTCCTGCAGGCCTACTGT 
CTGGCTCCTTTTGACCTCATCAAAGTCCGGCTACAAAACCAGACAGAGCCAAGGGCCC 
AGCCAGGGAGCCCCCCACCCCGGTACCAGGGGCCCGTGCACTGTGCAGCCTCCATCTT 
CCGGGAGGAGGGGCCCCGGGGGCTGTTCCGAGGAGCCTGGGCCCTGACGCTGAGGGAC 
ACCCCCACGGTGGGGATCTACTTCATCACCTATGAAGGGCTCTGTCGCCAGTACACAC 
CAGAAGGCCAGAATCCCAG CTCAGCCACGGTGCTGGTGGCAGGGGGCTTTGCAGG CAT 
TGCTTCCTGGGTGGCAGCCACGCCCTTAGACGTGATCAAGTCCCGGATGCAGATGGAT 
GGACTGAGACGCAGAGTGTACCAGGGGATGCTGGACTGCATGGTGAGCAGCATCCGGC 




CGTC^TGCTGTCACCTTCCTCAGCTACGAATATCTCCTCCGCTGGTGGGGATGA 




ORF Start: ATG at 1 


ORF Stop: TGA at 865 




SEQ ID NO: 82 


288 aa 


MWat31873.6kD 


NOV29a, 
CG94692-01 Protein Sequence 


MPVEEFVAGWI SGALGLVLGHPFDTVKVRLQTQTTYRG I VDCMVKI YRHES LLGFFKG 
MS FPI AS I AWNSVLFGVYSNTLLVLTATSHQERRAQPPS YMHI FLAGCTGGFLQAYC 
LAPFDLIKVRLQNQTEPRAQPGSPPPRYQGPVHCAASIFREEGPRGLFRGAWALTLRD 
TPTVGI YFI TYEGLCRQYTPEGQNPSSATVLVAGGFAG I ASWVAATPLDVI KSRMQMD 
GLRRRVYQGMLDCMVSS I RQEGLGVFFRGVTINSARAFPVNAVTFLSYEYIiLRWWG 




SEQ ID NO: 83 


867 bp 


"NOV29b 
CG94692-02 DNA Sequence 


ATG CCGGTGGAGG AATTTGTGGCTGGCTGGATCTCTGG CGCTCTGGG CTTGGTCCTGG 
GACACCCGTTTGACACTGTAAAGGTGAGGCTGCAGACCCAGACCACCTACCGGGGCAT 
CGTTGATTGCATGGTCAAGATTTACCGCCATGAGTCCCTCCTGGGCTTCTTCAAGGGA 
ATGAGCTTCCCCATTGCC^GCATAGCTGTGGTCAACTCTGTCCTGTTTGGGGTCTATA 
GC^VACACCCTGCTGGTGCTCACGGCCACCTCCCACCAGGAGCGGCGGGCCCAGCCGCC 
CAGCTAC^TGCACATCTTCCTAGCGGGCTGCACCGGGGGGTTCCTGCAGGCCTACTGT 
CTGGCTCCTTTTGACCTCATCAAAGTCCGGCTACAAAACCAGACAGAGCCAAGGGCCC 
AGCCAGGGAGCCCCCCACCCCGGTACCAGGGGCCCGTGCACrGTGCAGCCTCCATCTT 
CCGGGAGGAGGGGCCCCGGGGGCTGTTCCGAGGAGCCTGGGCCCTGACGCTGAGGGAC 
ACCCCCIACGGTGGGGATCTACTTCAT<ZACCTATGAAGGGCrCTGTCX5CCAGTACACAC 
CAGAAGGCCAGAATCCCAGCTCAGCCACGGTGCTGGTGGCAGGGGGCTTTGCAGGCAT 
TGCTTCOTGGGTGGCAGCGACGCCCTTAGACGTGATCAAGTCCCGGATGCAGATGGAT 
GGACTGAGACGCAGAGTGTACCAGGGGATGCTGGACTGCATGGTGAGCAGCATCCGGC 
AGGAAGGACTGGGAGTCTTCTTCCGGGGGGTCACC^TCAACAGTGCCCGCGCCrTTCC 
CGTC^TGCTGTCACCTTCCrC^GCrACGAATATCTCCTCCGCTGGTGGGGATGA 




ORF Start: ATG at 1 


ORF Stop: TGA at 865 




SEQ ID NO: 84 


288 aa 


MWat31873.6kD 


NOV29b, 
CG94692-02 Protein Sequence 


MPVEEFVAGWI SGALGLVliGHP FDTVKVRLQTQTTYRG I VDCMVKI YRHE SLLGFFKG 
MS FP I AS I AWNSVLFGVYSNTLLVLTATS HQERRAQP PSYMH I FLAGCTGGFLQAYC 
LAPFDLIKVRLQNQTEPRAQPGSPPPRYQGPVHCAAS I FREEGPRGLFRGAWALTLRD 
TPTVG I YF ITYEGLCRQYTPEGQNPSS ATVLVAGGFAG I ASWVAATPLDVT KSRMQMD 
GLRRRVYQGMLDCMVS S IRQEG LGVFFRGVT INS ARAF PVNAVTFLS YE YLLRWWG 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 29B. 



Table 29B. Comparison of NOV29a against NOV29b. 


Protein Sequence 


NOV29a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 
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NOV29b 


1.288 


288/288(100%) 




1.288 


288/288 (100%) 



Further analysis of the NOV29a protein yielded the following properties shown in 
Table 29C. 



Table 29C. Protein Sequence Properties NOV29a 


PSort analysis: 


0.7900 probability located in plasma membrane; 0.6400 probability located in 
microbody (peroxisome); 0.3000 probability located in Golgi body; 0.2000 
probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV29a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 29D. 



Table 29D. Geneseq Results for NOV29a 


Geneseq 
Identifier 


. Protein/Organism/Length 
[Patent #, Date] 


NOV29a Residues/ 
Match Residues 


Identities/ 

Stmilaritip*; fnr th#» 
oiiiii icu iiico lui me 

Matched Region 


Expect 
Value 


AAE10330 


Human transporter and ion 
channel-7 (TRICH-7) protein - 

Homo sapiens, 288 aa. 
[WO200162923-A2, 30-AUG- 
2001] 


1-288 
1..288 


288/288(100%) 
288/288(100%) 


e-170 


ABG27643 


Novel human diagnostic 
protein #27634 - Homo 
sapiens, 346 aa. 
[WO2001 75067- A2, 11-OCT- 
2001] 


1..288 
59..346 


286/288 (99%) 
286/288 (99%) 


e-168 


j ABG27643 

1 
{ 
i 


Novel human diagnostic 
protein #27634 - Homo 
sapiens, 346 aa. 
[WO200175067-A2, 11-OCT- 
2001] 


1..288 
59..346 


286/288 (99%) 
286/288 (99%) 


e-168 


AAE16774 


Human transporter and ion 

channel- 11 (TRICH-11) 
protein - Homo sapiens, 1 8 1 
aa. [WO200192304-A2, 06- 
DEC-200I] 


113..288 
6..181 


176/176(100%) 
176/176(100%) 


e-101 


AAM39422 


Human polypeptide SEQ ID 
NO 2567 - Homo sapiens, 3 1 1 
aa. [WO200153312-Al,26- 
JUL-2001] 


3..285 
6..300 


144/295(48%) 
183/295 (61%) 


4e-73 
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In a BLAST search of public sequence datbases, the NOV29a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 29E. 



Table 29E. Public BLASTP Results for NOV29a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV29a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


08VBZ7 


HYPOTHFTICAI X) A 

1111 1 riLiJ Iv^Ab JZ..H 

KDA PROTEIN - Mus 
muscuius (Mouse), 294 aa 
(fragment). 


1 787 

7..293 


267/287 (92%) 


e> 1/17 

e- 1 h / 


CAC24997 


SEQUENCE 41 FROM 
PATENT WOO 100806 
PRECURSOR - Homo 
sapiens (Human), 308 aa. 


5. .285 
2..301 


128/301 (42%\ 
187/301 (61%) 


le-66 


Q922X4 


SIMILAR TO CG4995 
GENE PRODUCT - Mus 
muscuius (Mouse), 306 aa. 


5..286 
4..275 


117/288 (40%) 
166/288 (57%) 


Ie-53 


Q9VKZ5 


CG4995 PROTEIN 
(GH13054P)-Drosophila 
melanogaster (Fruit fly), 
399 aa. 


3..284 
41. .309 


112/287(39%) 
155/287 (53%) 


7e-51 


Q9AX03 


PUTATIVE 
CARNITINE/ACYLCARN 
ITINE TRANSLOCASE - 
Oryza sativa'(Rice), 322 aa. 


4..286 
14..307 


119/300 (39%) 
166/300 (54%) 


3e-50 



PFam analysis predicts that the NOV29a protein contains the domains shown in 
the Table 29F. 



Table 29F. Domain Analysis of NOV29a 


Pfem Domain 


NOV29a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mito_carr 


1..90 


31/125(25%) 
72/125 (58%) 


5.1e-22 


mito_carr 


98..198 


34/126(27%) 
84/126 (67%) 


7.1e-29 


mito_carr 


200..288 


33/125 (26%) 
75/125 (60%) 


7.1e-24 
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Example 30. 

The NOV30 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 3 OA. 



Table 30A. NOV30 Sequence Analysis 




SEQ ID NO: 85 


947 bp 


NOV30a, 
CG94724-01 DNA Sequence 


ACTGACCATGGCCGACCAGCCAAAGCCCATCAGCCTGCT CAAGAACCTG CTGG CCAGT 
GGCTTTGGTGGCATGTGCCTGGTGTTCATGGGGCACCCTCTGGACACTGTCAAGGTCC 
GACTGCAGACACAGCCACCGAGTTTGCCTAGACAGCCTCCCATGTACTCTGGGACCTT 
TGACTCTCTCCCAAAGACTCTTAGAAGGGACATCACAGGGCTATATAAGGGAATGGCT 
GCCCCTATCATCGGTGTCACCCCCATCTTTGCTGTGTGCTTCTTTGGGTTTGGTTTGA 
GGAAGAAACTAGGACAGAAACACCCAGAAGATGTGCTCAGCTATCCCCAGCTGTTTGC 
AGCTGGGATGTTATCTGGTACATTCACCACAGGCATCGCGACCCCTGGAGAACCCATC 
AAGTCCTTGTTACACTTTCAGCCTTCTTCAGGGGAAACCAAGTACACTGGAACCTTGG 
ACTGTGCAAAGAAGTTGTACCAGGAGTTTCAGATCCGAGGCATCTACAAAGGGACCGT 
GCrTACCCTTATGAGAGACGTTCCAGCTAGCGGAACGTATTTCATGACAAATGAATGG 
CTAAAAAATATCTTCACTCCGGAAGGAAAGAGGGTCTGTGAG CTCAGTGTG CCTAGAA 
TCCTGGTGGCTGGGTGCATTGCAGGGATCTTCAACTGGGCAATGGCAGTCCCGCAAGA 
TGTGCTCAAGTATCCCTTCCAGACGGCACCTCCTGGGAAATATCCTAACAGTTTTGGA 
GATGTGCTGAGGGAGCTGATCTGGGATGAAGGAATCACATCCTTGTCTAAAGGGTCTG 
ATGCAGTGATGACCCGAGCCTTCCCAGCCAATGCAGCCTGTTTCCTTGGCCTTGAAGT 
TGCCATGAAGTTTCTTAATTGGGCCACCCCCAACTTGTGAGGCAGAAGGCTGCTCAAG 
ACTTCTGG ATG CTGG AACT 




ORP Start: ATG at 8 


ORF Stop: TGA at 908 




SEQ ID NO: 86 


300 aa 


MW at 32855. lkD 


NOV30a, 
CG94724-01 Protein Sequence 


MADQPKPISIiKNIJ^GFGGMCLVFMGHPLDTVKVRLQTQPPSLPRQPPMySGTFDS 
LP KTLRRD I TGL YKGMAAP 1 1 G VTP I FAVCFFGFGLRKKLGQKHPEDVLSYPQLFAAG 
MLS GTFTTG I ATPGE PI KS LLHFQPS SG ETKYTGTLDCAKKLYQE FQ I RG I YKGTVLT 
LMRDVPASGTYFMTNEWLKNI FTP EG KRVCELSV PRI LVAG C I AG I FNWAMAVPQDVL 
KYPFQTAPPGKYPNSFGDVLRELIWDEGITSLSKGSDAVMTRAFPANAACFLGLEVAM 
KFLNWATPNL 



Further analysis of the NOV30a protein yielded the following properties shown in 
Table 30B. 



Table 30B. Protein Sequence Properties NOV30a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.1000 
probability located in mitochondrial inner membrane 


SignalP analysis: 


"No Known Signal Sequence Predicted 



A search of the NOV30a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 30C. 
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Table 30C. Geneseq Results for NOV30a 


Geneseq 
Identifier 

! 


Protein/Organism/Length 
[Patent #, Date] 


NOV30a Residues/ 
Match Residues 


lucniiiics/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


| AAM79434 

1 


Human protein SEQ ID NO 
3080 - Homo sapiens, 3 1 8 aa. 
[WO200157190-A2, 09-AUG- 
. 2001] 


1 ..300 
18..318 


zjo/JUI \oD /o) 

268/301 (88%) 


e-146 


AAM78450 


Human protein SEQ ID NO 
1 112 - Homo sapiens, 3 1 8 aa. 
[ WO200 1571 90-A2, 09-AUG- 
2001] 


1..300 

1 O..J 1 o 


258/301 (85%) 
zoo/jUI \oo/o) 


e-146 


AAYx5740 


Human secreted protein 
encoded from gene 30 - Homo 
sapiens, 228 aa. [W0993888I- 
AI,05-AUG-1999] 


74.300 
1..227 


196/227 (86%) 
202/227 (88%) 


e-110 


ABB59582 


Drosophila melanogaster 
polypeptide SEQ ID NO 5538 - 
Drosophila melanogaster, 306 
aa. [WO200171042-A2, 27- 
SEP-2001] 


11. .299 
I6..305 


137/291 (47%) 
190/291 (65%) 


le-73 


ABB59928 


Drosophila melanogaster 
polypeptide SEQ ID NO 6576 - 
Drosophila melanogaster, 299 
aa. [ WO200 1 7 1 042-A2, 27- 
SEP-2001] 


11. .293 
15..297 


121/285 (42%) 
172/285 (59%) 


7e-57 



In a BLAST search of public sequence datbases, the NOV30a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 30D. 
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Table 30D. Public BLASTP Results for NOV30a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV30a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 




Mitochondrial 
carnitine/acylcarnitine carrier 
protein (Carnitine/acylcarnitine 
translocase) (CAC) - Homo sapiens 
(Human), 301 aa. 


1..j00 
1..301 


258/jOI (85%) 
268/301 (88%) 


e-146 


Q9Z2Z6 


MCAC PROTEIN - Mus musculus 
(Mouse), 301 aa. 


1..300 
I. .301 


243/301 (80%)' 
257/301 (84%) 


e-138 


P97521 


Mitochondrial 
carnitine/acylcarnitine carrier 
protein (Carnitine/acylcarnitine 
translocase) (CAC) - Rattus 
norvegicus (Rat), 301 aa. 


1 .JuU 

1..301 


254/301 (83%) 


e-i j4 


Q9VQG4 


COLT PROTEIN (GM13207P) - 
Drosophila melanogaster (Fruit 
fly), 306 aa. 


11. .299 
16..305 


137/291 (47%) 
190/291 (65%) 


3e-73 


001396 

i 


COLT PROTEIN (CONGESTED- 
LIKE TRACHEA PROTEIN) - 
Drosophila melanogaster (Fruit 
fly), 306 aa. 


11. .299 
16..305 


136/291 (46%) 
188/291 (63%) 


4e-72 



PFam analysis predicts that the NOV30a protein contains the domains shown in 
the Table 30E. 



Table 30E. Domain Analysis of NOV30a 


Pfam Domain 


NOV30a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mito_carr 


9..104 


37/125 (30%) 
79/125 (63%) 


3.7e-23 


mito_carr 


108..202 


34/325(27%) 
76/125 (61%) 


2.4e-19 


mito_carr 


207.299 


28/125(22%) 
70/125 (56%) 


1.6e-ll 



Example 31. 



The NOV31 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 31 A. 
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Table 31 A. NOV31 Sequence Analysis 




SEQIDNO:87 | H71 bp 


NOV31a, 
CG94871-01 DN A Sequence 


GTTGGCTCCAGACAAATAAACAT^ARTrraTr^^arr^r!* ^A^Amn?VGGCTCAC 
TTTGTGCTCAACATTGCCTGAATAA.CTTATTGCAAGGAGAATATTTTAGCCCTGTGGA 
ATTATCCTCAATTG CACATCAGCTGGATG AGGAGGAGAAGATG AG AATGGCAGAAG G A 
GGAGTTACTAGTGAAGATTATCGCACGTTTTTACAGCAGCCTTCTGGAAATATGGATG 
ACAGTGGTTTTTTCTCTATTCAGGTTATAAGCAATGCCTTGAAAGTTTGGGGTTTAGA 
ACTAATCCTGTTCAACAGTCCAGAGTATCAGAGGCTCAGGATCGATCCTATAAATGAA 
AGATCATTTATATGCAATTATAAGGAACACTGGTTTACAGTTAGAAAATTAfSRa aa an 
^TGGTT^CTTGAATTCTCTCT 




GATCTGCCAGATTG CG AAGCTCACCAACTCOTGCA 

ATCGACCAAAACTTATTGGAGAAGAATTAGCACAACTAAAAGAGCAAAGAGTCCATAA 
AACAGACCTGGAACGAGTGTTAGAAGCAAATGATGGCTCAGGAATGTTAGACGAAGAT 
GAGGAGGATTTGCAGAGGGCTCTGGCACTAAGTCGCCAAGAAATTGACATGGAAGATG 
AGGAAGCAGATCTCCGCAGGGCTATTCAGCTAAGTATGCAAGGTAGTTCCAGAAACAT 
ATCTCAAGATATGACACAGACATCAGGTACAAATCTTACTTCAGAAGAGCTTCGGAAG 
AGACGAGAAGCCTACTTTGAAAAACAGCAGCAGCAGCAGGGGGACCTATCAGGACAGA 
GTTCACATCCATGTGAAAGGCCAGCCACCAGTTCAGGAGCACTTGGGAGTGATCTAGG 
TGATGCTATGAGTGAAGAAGACATGCTTCAGGCAGCTGTGACCATGTCTTTAGAAACT 
GTCAGAAATGATTTGAAAACAGAAGGAAAAAAATAATACCTTTAAAAAATAATTTAGA 


TATTCATAC1TTCCAACATTATCCTGTGTGATTACAGCATAGGGTCCACTTTGGTAAT 
GTGTCAAAGAG 




ORF Start: ATG at 22 


ORF Stop:TAA at 1078 




SEQ ID NO: 88 


352 aa MW at 40068.4kD 


NOV31a, 
CG94871-01 Protein Sequence 

1 


MESIFHEKQEGSLCAQHCl^LLOGEYFSPVELSSIAHQLDEEEKMRMAEGGVTSEDY 

RTFLQQPSGNMDDSGFFSIQVISNALKVWGLELILPNSPEYQRLRIDPINERSFICNY 

KEHWFTVRKIXSKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIl^KGDLPDC 

DQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDEDEEDIjQRA 

LALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNIiTSEELRKRREAYFE 

K0X3O^GDLSGQSSHPCi^PATSSGAI^SDI^DAMSEEDMLQAAVTMSLErVRNDLKT 
EGKK 


I 


SEQ ID NO: 89 | 1219bp 


NOV31b, 
CG94871-02 DNA Sequence 


GTTGGCTCCAGACAAATAAACATGGAGTCCATrTTr rarr.ana n nr-*^ A^TCAC 
TTTGTGCTCAACATTGCCTGAATAACTTATTGCAAGGAGAATATTTTAGCCCTGTGGA 
AITATCCTCAATTGCACATCAGCTGGATGAGGAGGAGAGGATGAGAATGGCAGAAGGA 
GGAGTTACTAGTGAAGATTATCGCACGTTTTTACAGCAGCCTTCrrGGAAATATGGATG 
AC^GTGGTTTTTTCTCTATTCAGGTTATAAGCAATGCCTTGAAAGTTTGGGGTTTAGA 
ACTAATC CTGTTCAACAGTCCAGAGTATCAGAGG CT CAGGAT CG ATCCT ATAAATGAA 
AGATCATTTATATGCAATTATAAGGAACA.CITGGTTTACAGTTAGAAAATTAGGAAAAC 




GATCTGCCAGATTGCGAAGCTGACCAACTCCTGCAGATGATTAGGGTCCAAC^ 

ATCGACCAAAACTTATTGGAGAAGAATTAGCACAACTAAAAGAGCAAAGAGTCCATAA 

AAC CGAC CTGG AACG AATGTTAG AAG CAAATGATG GCTC AGG AATGTT AG ACG AAG AT 

GAGGAGGATTTGCAGAGGGCTCTGGCACTAAGTCGCCAAGAAATTGACATGGAAGATG 

AGGAAGCAGATCTCCGCAGGGCTATTCAGCTAAGTATGCAAGGTAGTTCCAGAAACAT 

ATCTCAAGATATGACAC^GAC^TCAGGTACAAATCTTACTTCAGAAGAGCTTCGGAAG 

AGACGAGAAGCCTACTTTGAAAAACAGCAGCAAAAGCAGCAACAGCAGCAGCAGCAGC 

AG CAG CAGCAGCAG CAGCAG CAGCAGCAGCGGGACCTAT CAGGACAGAGTTCACATCC 

ATGTGAAAGGCCAGCCACCAGTTCAGGAGCACTTGGGAGTGATCTAGGTGATGCTATG 

^GTGAAGAAGACATGCTTCAGGCAGCTGTGACCATGTCTTTAGAAACTGTCAGAAATG 

^TTTGAAAACAGAAGGAAAAAAATAATACCTTTAAAAAATAATTTAGATATTCATACT 

l^CCAACATTATCCTGTCTGATTACAGCATAGGGTCCACTTTGGTAATGTGTC 




ORF Start: ATG at 22 


ORF Stop:TAA at 1126 




SEQ ID NO: 90 


368 aa 1 MW at 42277.8kD 


NOV31b, ^ 

CG94871-02 Protein Sequence 1 

i 

r 
i 

¥ 


lESIFHEKQEGSLC^QHCliNNLLG^EYFSP 

ITFLQQPSGNMDDSG FFS I QVI SNAIiKVWGLELILFNS PEYQRLRIDP INERSFI CNY 
CEHWFTVRKIjGKQWFNI^SIjLTGPELISDTYIiALFIjAQLQQEG YS I FWKGDLPDCEA 
>QI^QMIRVQQMHRPKLIGEEIAQLKBQRTO^ 

*ALSRQEIDMEDEEADLRRAIQLSMQ^SSRNISQDMTO/TSGTNLTSEELRKRREAYFE 

^QQQKQCXJO^X20^X3QOX3CX3QQQRDLSGQSSHPCERPATSSGALGSDLGDAMSEED^Q 
AVTMSLETVRNDLKTEGKK 
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SEQIDNO:91 


1153 bp 


NOV31c, 
CG94871-04 DNA Sequence 


GTTGGCTCCAGACAAATAJUVCATGGAGTCC^TCTTCCACGAGAAACAAGAAGGCTCAr 
TTTGTGCTCAACATTGCCTGAATAACTTATTGCAAGGAGAATATTTTAGCCCTGTGGA 
ATTATCCTCAATTGCACATCAGCTGGATGAGGAGGAGAGGATGAGAATGGCAGAAGGA 
GG AGTT ACTAGTG AAGATTAT CG CACGTTTTT ACAGGTT AT AAG C AATGCCTTG AAAG 
TTTGGGGTTTAGAACTATT'CCTGTTCAACAGTCCAGAGTATCAGAGGCTCAGGATCGA 

TCP TATft 21 £l TH Ji 2i & fZ & r r<" , 21 , T w rT21 TH TP /"•A 7V TTI T A TV Clf* 7\ 7\ 1\ PTf /*" i"'i"i'T\ e~< tv i-imrnii tv 

<* ^-*-x«,A*4«*tivjMM/*.ij*vi uu i, l^vl/il i a 1 AAiia^ AACACTLjG 1. T 1 ACAGTTAGA 
AAATTAGGAAAACAGTGGTTTAACTTGAATTCTCTCTTGACGGGTCCAGAATTAATAT 
CAG AT AG ATATCT TG CACTTTT CTTGG CT C AATT ACAAC AGG AAGGTTATTCT AT ATT 
TGTCGTTAAGGGTGATCrGCCAGATTGCGAAGCTGACCAACTCCTGCAGATGATTAGG 
GTCCAACAGATGCATCGACCAAAACTTATTGGAGAAGAATTAGCACAACTAAAAGAGC 
AAAGAGTCCATAAAACAGACCTGGAACGAGTGTTAGAAGCAAATGATGGCTCAGGAAT 
GTTAGACGAAGATGAGGAGGATTTGCAGAGGGCTCTGGCACTAAGTCGCCATGAAATT 
G ACATGG AAG ATG AGGAAGCAG ATCTCCG CAGGGCTATTCAG CT AAGT ATGCAAGGTA 
GTTCCAGAAAC^TATCTCAAGATATGACACAGACATC 

AGAGCTTCGGAAGAGACGAGAAGCCTACTTTGAAAAACAGCAGCAAAAGCAGCAACAG 
CAGCAGCAG C AGCAG CAGCAGGGGGACCTATCAGGAC AG AGTTCACATCCATGTGAAA 
GGCCAGCCACCAGTT CAGGAGCACTTGGG AGTGATCTAGGTGATG CT ATGAGTGAAGA 
AGACATGCTTCGGGCAGCTGTGACCATGTCTTTAGAAACTGTCAGAAATGATTTGAAA 
A(^GAAGGAAAAAAATAATACCTTTAAAAAATAATTTAGATATTCATACTTTCCAACA 
TTATCCTGTGTGATTACAGOVTAGGGTCCACrTTGGTAATGTGTC^VAAGAG 




ORF Start: ATG at 22 


ORF Stop: TAA at 1060 




SEQ ID NO: 92 


346 aa 


MWat39764.1kD 


NOV31c, 
CG94871-04 Protein Sequence 


MES I FHEKQEGSLCAQHCLNNLLQGEYFS PVELSSIAHQLDEEERMRMAEGGVTSEDY 
RTFLQVISNALKVWGLELFLFNS PEYQRLRIDPI NERSFI CNYKEHWFTVRKLGKQWF 
I^LNSLLTGPELISDRYLALFLAQliQQEGYS I FWKGDLPDCEADQLLQM I R VQQMHRP 
KLIGEELAQIjKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRAIALSRHEIDMEDEEA 
DLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQQQQQQ 
GDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLRAAVTMSLETVRNDLKTEGKK 




SEQ ID NO: 93 | 1000 bp 


NOV31d, 
CG94871-05 DNA Sequence 


GTTGGCTCCAGACAAATAAACATGGAGTCCATCTTCCACGAGAAACAAGAAGGCTCAC 

TTTGTGCTCAACATTGCCTGAATAACTTATTGCAAGGAGAATATTTTAGCCCTGTGGA 

ATTATCCTCAATTGCACATCAGCTGGATGAGGAGGAGAGGATGAGAATGGCAGAAGGA 

GGAGTTACTAGTGAAGATTATCGCACGTTTTTACAGTGGTTTAACTTGAATTCTCTCT 

TGACGGGTCCAGAATTAATATCAGATACATATCTTGCACTTTTCTTGGCTCAATTACA 

ACAGGAAGGTTATTCTATATTTGTCGTTAAGGGTGATCTGCCAGATTGC^GAAGCTGAC 

CAACTCCTGCAGATGATTAGGGTCGAACAGATGC!ATCGACCAAAACTTAT^ 

AATTAG CACAACTAAAAGAG CAAAG AGT C C AT AAAACAG ACCTGGAACGAGTG TTAGA 

AG CAAATGATGGCTCAGGAATGTTAG ACG AAG ATG AGGAGGATTTG CAGAGGGCTCTG 

GCACTAAGTCGCCAAGAAATTGACATGGAAGATGAGGAAGCAGATCTCCGCGGGGCTA 

TTCAG CTAAG TATG CAAGGT AGTTC C AG AAACATAT CTCAAG ATATGACACAG ACAT C 

AGGTACAAAT CTTAC TT CAGAAG AG CTT CGG AAG AG ACG AGAAGC CTACTTTG AAAAA 

CAG CAG CAAAAGC AG C AACAG CAGC AG CAG CAG CAG CAGCAGGGGGAC CT AT CAGGAC 

AGAGTTCACATCCATGTGAAAGGCCAGCCACCAGTTCAGGAGCACTTGGGAGTGATCT 

AGGTGATGCrATGAGTGAAGAAGACATGCTTCGGGCAGCTGTGACCATGTCTT^ 

ACTGTCAG AAATG ATT TG AAAACAGAAGG AAAAAAATAAT AC CTTTAAAAAATAATTT 

AGATATTCATACTTTCCMCATTATCCTGTGTGATTACAGCATAGGGTCCACrTTT 

AATGTGTCAAAGAG 




ORF Start: ATG at 22 ORF Stop: TAA at 907 




SEQ ID NO: 94 


295 aa 


MW at 33420.8kD 


NOV31d, 
CG94871-05 Protein Sequence 


MESIFHEKQEGSLC^QHCXNNLLC^EYFSPVEL^ 

RTFI^WFNLNSLLTGPELISDTYLALFLAQLQQEGYS I FWKGDLPDCEADQLLQMIR 
VQ^MHRPKLIGEEIAQLKEQRVHKTDLERVLEANDGSGMLDEDEEX>LQRALAliSRQEI 
DMEDEEADLRGAIQI^MOGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQ 
QQQQQQX2GDLSGQSSHPCERPATSSGAIX3SDLGDAMSEEDMLRAAVTMSLETVRNDLK 
TEGKK 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 3 IB. 
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Table 31B. Comparison of NOV31a against NOV31b through NOV31d. 


Protein Sequence 


NOV3 la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV31b 


1..352 
1..368 


343/368 (93%) 
345/368 (93%) 


NOV31c 


K.352 
I..346 


326/361(90%) 
328/361 (90%) 


1 NOV31d 

\ 


129..352 
63..295 


21.6/233 (92%) 
217/233(92%) 



Further analysis of the NOV31a protein yielded the following properties shown in 
Table 3 1C. 



Table 31C. Protein Sequence Properties NOV31a 


PSort analysis: 


0.3000 probability located in nucleus; 0.1000 probability located in mitochondrial 
matrix space; 0. 1000 probability located in lysosome (lumen); 0.0000 probability 
located in endoplasmic reticulum (membrane) 


j SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV3 1 a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 3 1 D. 
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Table 31D. Geneseq Results for NOV31a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV3 la Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAY33492 


Human Machado-Joseph 
disease associated protein - 

Hnmn c^nipiu- 1Af\ ao 

[W09945944-A1, 16-SEP- 
1999] 


1.326 
1..347 


321/347 (92%) 
323/347 (92%) 


0.0 


AAR96128 


Human Machado-Joseph 
disease-related protein - Homo 

b<ipiciii>, Zyi aa. iJrvoVyZZoy- 

A, 09-APR-1996] 


1..291 
1..291 


289/291 (99%) 
291/291 (99%) 


e-166 


ABG22866 


Novel human diagnostic 
protein #22857 - Homo 
sapiens, 454 aa. 
[WO200175067-A2, 11-OCT- 
2001] 


I29..291 
292..454 


163/163 (100%) 
163/163 (100%) 


2e-87 


ABG22866 

i 
I 


Novel human diagnostic 
protein #22857 - Homo 
sapiens, 454 aa. 
[WO200175067-A2, 1 1-OCT- 
2001] 


129..291 
292..454 


163/163(100%) 
163/163 (100%) 


2e-87 


! ABG 17452 

i 
1 


Novel human diagnostic 
protein #17443 - Homo 
sapiens, 731 aa. 
[WO200175067-A2, 11-OCT- 
2001] 


1 55.32 1 
16..191 


163/176 (92%) 
165/176 (93%) 


le-85 



In a BLAST search of public sequence datbases, the NOV3 la protein was found to 
have homology to the proteins shown in the BLASTP data in Table 3 IE. 
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Table 31E. Public BLASTP Results for NOV31a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV31a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q96TC4 


ATAXIN-3 - Homo sapiens 
(Human), j61 aa. 


1..352 
1..361 


351/361 (97%) 
352/361 (97%) 


0.0 


i 015284 


JOSEPHIN MJDI - Homo 
sapiens (Human), 361 aa. 


1..352 
I..36I 


350/361 (96%) 
351/361 (96%) 


0.0 


015286 


JOSEPHIN MJD1 - Homo 
sapiens (Human), 373 aa 
(fragment). 


2..352 
1..373 


347/373 (93%) 
349/373 (93%) 


0.0 


Q96TC3 


ATAXIN-3 - Homo sapiens 
(Human), 364 aa. 


1..326 
1..335 


323/335 (96%) 
324/335 (96%) 


0.0 


015285 


JOSEPHIN MJD1 - Homo 
sapiens (Human), 364 aa. 


I. .326 
1..335 


322/335 (96%) 
323/335 (96%) 


0.0 



PFam analysis predicts that the NOV3 la protein contains the domains shown in 
the Table 3 IF. 



I 1 — 

Table 31F. Domain Analysis of NOV31a 



Pfam Domain 


NOV3 la Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Josephin 


1-198 


110/209 (53%) 
198/209(95%) 


2.8e-139 


UIM 


223..240 


11/18(61%) 
17/18(94%) 


0.011 


UIM 


243..260 


8/18(44%) 
16/18(89%) 


0.0044 


UIM 


325..342 


9/18(50%) 
17/18(94%) 


0.02 



Example 32. 



The NOV32 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 32A. 
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Table 32A. NOV32 Sequence Analysis 



SEQID NO: 95 



6224 bp 



NOV32a, 
CG94946-01 DNA Sequence 



CCGGCGCGGCCCGCGCGCTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 
CACCCGGGCCCGCTGCGGGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 
CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 
G CGAACGTGGTGCTCACCGGG ACGGTGG AGG AG ATCCTCAACGTGG ACCCGGTG CAGC 
ACACGTACrCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 
CCGGG AGAG CCTGCTGGACGGCGGCAACAAGGTGGTG ATCAG CGGCTTTGG AGACCCC 
CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
C^CGCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCAT 
GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACG CCTCCTGATGCGTGC CGGGG AATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 
CAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACC 
TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 
CGGCAGCJ^CCTGTGC^CGCrCGGCCX3ACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 
ACCTGCCGTGG CGCCC CCGAGGGG ACCGTCTG CGGCAG CGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCG CCTG CGCCCGCCAGGAGAATGTCTTCAAGAAGTT 
CGACGGCCCI^GTGACCCCTGTCAGGGCGCCCTCCCrcACCCGAGCCG(^GCTGCCGT 
GTGAACC CG CG CACGCGG CGCCCTGAGATGCT CCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 
CCGATCGGGGGCCGCCCGGGGTCTCCTCCTGCAGAAAGTGCGCTCCGGCCAGTGCCAG 
GGTCGAG ACCAGTGCCCGGAGCCCTGCCGGTT CAATGCCGTGTG CCTGTCCCGC CGTG 
GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 
TGCCCAGGACGGG CGCACGTATGACAGTGATTGCTGGCGG CAGCAGGCTG AGTG CCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
TCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGGAGCCCT 
GTG CG AGG CCG AGACCGGG CGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGGCCCAG 
CCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACG 
CCTGCAC^CACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGTGAGACCTGTGG 

TGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTG 
CCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGC 
AGGGCCGTGCGAGCAGGCCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGACGGT 
GACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGG 
ACGGGCCGTGTGTCTGTGACTTCS^CTGCCAGAGTGTCCCJIGGCAGCCCGGTGTGCGG 
CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA 
CAG CGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGG CCCCG CCTTCGCCCCGC 
TGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAA 
TATCACCGCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAAC 
CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 
lGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGG 
CATCGTCACCGATGGCCGGAGTGGCTGTACACCCIX3CAGCTGTGATCCCCAAGGCGCC 
GTGaMGATGACTGTGAGCAGATGACGGGGCrGTGCTCGTGTAAGCCCGGGGTGGCTG 
GACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGG CCCCG CGGGCTGTGAAGC 
TGACGCTTCTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGC 
GTGG AG G AGTCTGG CT CAG CC CACTGTGTCTG CCCG ATGCT CACCTGT CCAG AGGCCA 
ACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 
GACC^TCGCCTGCCGCC^GGGCCTGa^TCTCTATCCAGAGCCTGGGCCCGTGCCAG 
GAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAG 
GGCTCCTCCTGAGCCAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAG 
C^GTACCGCAC^CAGCCAGACCACCCCTCCGCCCTCATCGCGACCrCG^ 
AGCX5TCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCC 
CTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGG 
CGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCT^ 
CCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCXSAGAGGGCT^ 
ACAACTCCGCGTTGGGCTG CTGCTCTGATGGGAAGACGCCCTCGCTGGACG CAGAGGG 
CTCC^CTGCCCCGCCACCAAGGTGTTCCAGGGCGTCCTGGAGCTGGAGGGCGTCGAG 
GGCCAGGAGCTGTTCTACACX3CC<X!AGATGGCT 

AG ACAG C CAGGAG CATTG AG AGCACCCTGGACG AC CTCTTCCGG AATTCAGACGTCAA 



ATTGTGGATGTGCACTTTX5ACCCCACCACAGCCrTCAGGGCACCCGACX3 
CCCTGCrrCCGGCAGATCCAGGTGTCCAGGCGCOSGTCCTTGGGGGTGAGGCGGCCGCT 
GCAGGAG CACGTG CGATTT ATGGACTTTG ACTGGTTTCCTG OTTTTATC^ 
ACGTCAGGAGCC^TTGCTGCGGGAGC!CACGGCCyuSAGC!CA 
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CGTCCTCrGCTGTGACCCCTCGGGCCCCGCACCCCAGTCACAOiAGCCAGCCCGTrGC 

C^GACCACX^CAGCCCCCACCACACGTCGGCCCCC^ 

CCCGGfcCGTCGGCCCCCGGCCCCCCAGCAGCCT^ 

GCTTCC^CGGGGGGACCTGCCAGGACTGGGCATTGGGCGGGGGCTTCACCTGCAGCTG 

TTCGAGGGCCGCTCCTTCCTGGCCTTCCCCACCCTCCGCGCCTACCACACGCTGCGCC 
TGGCACTGGAATTCCGGGCGCTGGAGCCTCAGGGGCTGCTGCTGTACAATGGCAACGC 
CCGGGGCAAGGACTTCCTGGCATTGGCGCTGCTAGATGGCCGCGTGCAGCTCAGGTTT 
GACACAGGTTCGGGGCCGGCGGTGCTGACCAGTGCCGTGCCGGTAGAGCCGGGCCAGT 
GGCACCGCCTGGAGCTGTCCCGGCACTGGCGCCGGGGCACCCTCTCGGTGGATGGTGA 
GACCCCTGTTCTGGGCGAGAGTCCCAGTGGCACCGACGGCCTCAACCTGGACACAGAC 
CTCTTTGTGGGCGGCGTACCCGAGGACCAGGCTGCCGTGGCGCTGGAGCGGACCTTCG 
TGGGCGCCGGCCTGAGGGGGTGCATCCGTTTGCTGGACGTCAACAACCAGCGCCTGGA 
GCTTGGCATTGGGCCGGGGGCTGCCACCCGAGGCTCTGGCGTGGGCGAGTGCGGGGAC 
CACCC CTGCCTG CCCAACCCCTGCCATGGCGGGG CCCCATGCCAG AACCTGGAGGCTG 
GAAGGTTCCATTGCCAGTGCCCGCCCGGCCGCGTCGGACCAACCTGTGCCGATGAGAA 
GAGCC CCTGCCAGCCCAACCCCTGCCATGGGGCGGCGCCCTGCCGTGTG CTGCCCGAG 
GGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCCAGACAGCCT 
CGGGGCAGGACGGCTCTGGGCCCTTCCTGGCTGACTTCAACGGCTTCTCCCACCTGGA 
GCTGAGAGGCCTGCACACCATTGCACGGGACCTGGGGGAGAAGATGGCGCTGGAGGCC 
GTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAGAAGACGGACG 
GCAAGGGGGACTTCGTGTCGCTGGCACTGCGGGACCGCCGCCTGGAGTTCCGCTACGA 
CCTGGGCMGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGAGCCTGG 
ACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGACGGCC 

GAAGGAGCCGCTCTACGTAGGGGGCGCTCCCGACITCAGCAAGCTGGCCCGTGCTGCT 
GCCGTGTCCTCTGGCTTCGACGGTGCCATCCAGCTGGTCTCCCTCGGAGGCCGCCAGC 
TGCTGACCCCGGAGCACGTGCTGCGGCAGGTGGATOTCACGTCCTTTGCAGGTCACCC 
CTGCACCCGGGCCTCAGGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAG 
GCTGCCTATGTGTGCCTGTGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGC 
TGGTGGAGAAGTCAGCGGGGGACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGT 
CGAGTACCTCAACGCTGTGACCGAGAGCGAGAAGGCACTGCAGAGCAACCACTTTGAA 
CTGAGCCTGCGCACTGAGGCCACGCAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGG 
AGCGGGCAGACTATGTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAA 
CCTGGGCTCCCAGCCCGTGGTGCTGCGTTCCACCGTGCCCGTCAACACCAACCGCTGG 



CCCCTGTGACCGGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCT 
GTGGCTTGGGGGCCTGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGC 
ACAGGCTTTGTGGGCTGCTTGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCACCTGC 
TGGAGGACGCCGTCACCAAG C CAGAGCTG CGGCCCTGCCCCACCCCATGAGCTGGCAC 
CAGAGCCCCGCGCCCGCT 



ORF Start: ATG at 37 



ORF Stop; TGA at 6196 



SEQIDNO: 96 



2053 aa M W at 2 1 5628.0kD 



NOV32a, 
CG94946-01 Protein Sequence 



MRHGRPVP PGPAAGRPLLPLLWAACVLP GAGGTCP ERALERREEEANWLTGTVE E I 

LNVDPVQHTYS CKVRVWRYLKGKDLVARE SLLDGGNKVVISGFGDPL ICDNQVSTGDT 

RIFFVNPAPPYLWPAHKNELMLNSSIjMRI TLRNLEEVEFCVEDKPGTHFTPVPPTPPD 

ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAPVCXSSDASTySNECELQRAQC 

SO^RRIIUjLSRGPTOSRDPCSim'CSFGSTaUlSADGLTASCIiCPATCRGAPEGTVCG 

SDGADYPGBCQLLRRACARQENVFKKFDGPC35PCQGALPDPSRSCRYNPRTRRPEMLL 

RPESCPARQAPVCGDDGVTYEiroCVMGRSGAARGLLLQKVRSGO^O^RDQCPEPCRTO 

AVCLSRRGRPRCS CDRVTCDGAYRP VC AQDGRTYDSDCWRQQAECRQQRAI PS KHQGP 

CDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSIiYDPVCGSDGVTYGSACELEAT 

ACTLGRE IQVARKGPCDRCGQCRFGALCEAETGRCVCPS ECVAIiAQPVCGSDGHTYPS 

ECMLHVHACTHQI SLHVASAGPCETCGDAVCAFGAVCSAGQCVCPRCEHPP PGPVCGS 

DGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 

I WDEDS EDGPCVCDFSCQS VPGS PVCGSDGVTYSTECELKKARCES QRGLYVAAQGAC 

RGPAFAPLPPVAPLHCAQTpyGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPA 

TGQCSCllPGVGGLRCDRCEPGFmFRGIVlTCRSGCrPCSCTlPC^VRDDCEQMTGLC 

S CKPGVAGPKCGQCPDGRALG PAGCEADASAPATCAEMRCEFGARCVEESGS AHCVCP 

MLTCPEANATKVCGSDGVTYGNE CQLKTI ACRQGLQIS IQSLGPCQEAVAPSTHPTS A 

SVTVTTPGLLLSQALPAPPGALPI^SSTAHSQTTPPPSSRPRTTASVPRTTVWPVLT 

VPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGP 

PVERASCYNSAIX5CCSDGKTPSIJ)AEGSNCPATKVFQGVLELEGVEGQELFYTPEMAD 

PKSELFGETARSIESTLDDLFTWSDVKKDFRSVRLRDIX3PGKSVRAIVDVHFDPTTAF 

RAPDVARALLRQI Q VS RRRS LGVRR PLQ EHVRFMDFDWF PAF ITG ATSG Al AAG ATAR 

ATTASRLPSSAVTPRAPHPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPPAPQQPP 

KPCDSQPCFHGGTCQDWAI^GGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTL 

RAYHTLRIJU^EFRALEPQGLLLYNGNARGKDFIJUjALI^^ 

VPVEPGQWHRLELSRHV^RGTLSVDGETPVLGESPSGTIXSLtn^ 

VALERT FVGAGLRGC I RLLDVNNQRLELG I G PGAATRGSGVGECGDHPCL PNP CHGGA 

PCQNLEAGRFHCOjCPPGRVG PTCADEKS PCQPNPCHGAAPCRVLPEGGAQCECPLGRE 
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GTFCQTASGQDGSGPFI^FNGFSHLEI^GI^TIARDLGEKMALEAVFUUIGPSGLLL 
YNGQKTIXSKGDFVSIALRDRRLEFRYDLGKGA^ 

ALRVGDG PRVIX3ES PKSRKVPHTVLNLKEPLYVGGAPDFSKLARAAAVSSGF^ 

VSLGGRQLLTPEHVLRQVDVTSFTlGHPCTRASGHPCIiNGASCVPREAAYVCLCPGGFS 

GPHCEKGLVEKSAGDVDTIAFDGRTFVEYLNAVTESEKALQSNHFELSLRTEATQGLV 

LWSGKATERADYVAIAIVDGHLQLSYNLGSQPVVLRSTVPVNTNRWLRVVAHREQREG 

SLQVGNEAPVTGS S PLGATQLDTDG ALWLGGliPELPVGPALPKAYGTG FVGCLRDWV 

GRHPLHLLEDAVTKPEI*RPCPTP 



SEQ ID NO: 97 



4760 bp 



NOV32b, 
CG94946-02 DNA Sequence 



CCGGCGCGGCCCGCGCGCTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 
CACCCGGGCCCGCTGCGGGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 
CCTGCCCGGAG CCGGCGG GACATGCCCGGAGCGCGCGCTGGAGCGG CG CG AGG AGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 
ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 
CCGGGAGAGCCTGCTGGACGGCGGCAA(^GGTGGTGATC^GCGGCTTTGGAGACCCC 
CTCATCTGTGACAACC^GGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
C^CCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCAT 
G CGG ATCACCCTG CGGAACCTGG AGG AGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 
CAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACC 
TACAG CAACGAATG CGAGCTGCAG CGGG CGCAGTGCAG CCAGCAGCGCCG CATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGC^GCTT 
CGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 
ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTT 
CGACGGCCCTTGTGACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGT 
GTG AACCCG CG CACGCGGCGC CCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 
CCGATCGGGGGCCGCCCGGGGTCTCCTCCTGCAGAAAGTGCGCTCCGGCCAGTGCCAG 
GGTCGAGACCAGTGCCCGGAGCCCTGCCGGTTCAATGCCGTGTGCCTGTCCCGCCGTG 
GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 
TGCCCAGGACGGGCG CACGTATG ACAGTG ATTG CTGGCGG CAGCAGGCTGAGTGCCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
G CCTCGGGGTG CAGTGTG CATTTGGGG CG ACGTGTGCTGTG AAGAACGGGCAGGCAGC 
GTGTG AATG CCTGCAGG CGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGG CAGCG CGTG CG AGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
TCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGGAGCCCT 
GTGCGAGGCCGAGACCGGGCGCrGCGTGTGCCCCrCTGAATGCGTGGCTTTGGCCCAG 
CCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACG 
CCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGTGAGACCTGTGG 
AGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTGTGTCCCCGG 
TGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTG 
CCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGC 
AGGGCCGTGCGAGCAGGCCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGACGGT 
GACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGG 
ACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGG 
CTCAGATGGGGTCACCTACAG CACCGAGTGTGAG CTG AAGAAGGCCAGGTGTGAGTCA 
CAGCX3AGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGGCCCCGCCTTCGCCCCGC 
TGCOSCCTGTGGCCCCCTTACACTGTGCCC^GACGCCCrrACGGCn'GCrGCC^GGACAA 
TATCACCGCAGCCCGGGGCGTGGGC CTGGCTGG CTGCCCCAGTGCCTGCCAGTGCAAC 
CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 
CAGGTGTGGGKSGGCCTCAGGTGTGACCGCTGTGAGCCrc 

CATCX3TCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 
GTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCCTGTAAGCCCGGGGTGX3CTG 
GACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGC 
TGACX5CTTCTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGC 
GTGGAGGAGTCTGGCTC^GCCC^CTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCA 
ACGCTAC(^^GGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 
GACCATCGCCTGCCGCCAGGGCXn , GCAAATCTCTATCC!A^ 

GAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAG 



C^GTACCGCACACAGCC^GACCACCCCrcracCCT 

AGCGTCCCCAGGACC^CCGTGTGGCCCGTGCTGACGGTGCCCCCC^CGGC^CCCTCCC 



CGATGAGGAACTG AGCGGGGACCAGGAGGCCAGTG GGGGTGG CTCTGGGGGGCTCGAG 

CCCTTGGAGGGCAGCAGCX5TGGCCACCCCrGGGCCACCrGTCGAGAGGGCTTCCTGCT 

AC^CCCCTGCCATGGGGCGGCGCCCTGCCIGTGTCCTGCCCGAGGGTGGTGCTCAGTG 

CGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCC^GACAGCCTCGGGGCAGGACGGC 

TCTGGGCCCTTCCTGGCTGACTTCAAa^CrrTCrCCCACCT 

ACACCATTGCACXSGGACCTGGGGGAGAAGATGGCGCTGGAG^ 

AGGCCCCAGttKSCCrCCTGCTCTACAACGGGC^GAAGACGGAC^ 
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GTGT CG CTGG CACTGCGGG ACCGC CG CCTGGAGTTCCX5 CTACGACCTGGG CAAGGGGG 
CAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGAGCCTGGACCAGGGTCTCACT 
GGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGACGCCCCCCGTGTGTTGGGG 
GAGTCCCCGAAATCCCGCAAGGTTCCGCAC^CCGTCCTC^CCTGAAGGAGCCGCrCT 




ACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCTGCCGTGTCCTCTGG 
CTTCGACGGTGCCATCCAGCTGGTCTCCCTCGGAGGCCX3CCAGCTGCTGACCCCGGAG 
CACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCCCTGCACCCGGGCCT 
CAGGCCACCCCTGCCTC^TGGGGCCTCCrGCGTCC(XAGGGAGGCTGCCTATGTGTG 
CCTGTGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGAGAAGTCA 
GCGGGGGACX3TGGATACCTTGGCCTTTGACGGGCGGACCTTTGTCGAGTACCTCAACG 
CTGTGACCGAGAGCGAGAAGGC^CTGCAGAGCAACCACTTTGAACTGAGCCTGCGCAC 
TG AGGCCACGCAGGGG CTGGTG CTCTGG AGTGGCAAGG CCACGGAG CGGGC AGACTAT 
GTGG CACTGG CCATTGTGGACGGG CACCTGCAACTG AG CTACAACCTGGGCTCCCAGC 
CCGTGGTGCTGCGTTCCACCGTGCCCGTCAACACCAACCGCTGGTTGCGGGTCGTGGC 
ACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACCGGC 
TCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCTGTGGCTTGGGGGCC 
TGCCGGAGCTGCCCGTGGGCCCAGCACrGCCCAAGGCCTACGGCACAGGCTTTGTGGG 
CTGCTTGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCACCTGCTGGAGGACGCCGTC 
ACCAAGCC^GAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCACCAGAGCCCCGCGCC 
CGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 4732 




SEQIDNO:98 


1565 aa | MW at 163817.1kD 


NOV32b, 
CG94946-02 Protein Sequence 


MRHGRPVP PG PAAGRPLL PLLWAACVL PGAGGTCPERALERREEEANVVLTGT VEEI 
LNVD P VQHTY S C KVRVWRYL KG KD L VARE SLLDGGN FCWI SGFGDPLI CDNQV STGDT 
RIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAPVCGSDASTYSNECELQRAQC 
SQQRRIRLLS RGPCGS RD PCSNVTCS FG STCARS ADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECQLLRRACARQENVFKKFDGPCD PCQGAL PD PS RS CRVNPRTRRP EMLL 
RPESCPARQAPVCGDDGVTYENtK^MGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGP 
CDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEAT 
ACTLGRE I QVARKGPCDRCX3 QCRFGALCEAETGRCV C PSECVALAQPVCG SDGHTYPS 
ECMLH\nmCTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGS 
DGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
IWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGAC 
RGPAFAPLPPVAPLHCAQTPYGCCQDNITAARGVGIiAGCPSACQCNPHGSYGGTCDPA 
TGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLC 
SCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCP 
MLTCPEANATK^CGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSA 
SVTVTTPGIJiLSQALPAPPGALPIiAPSSTAHSC^TPPPSSRP 

VPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGP 
PVERASCTOPOTGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNGFS 
HLELRGLHTIARDLGEKMALEAVFIiARGPSGLLLYNGQKTDGKGDFVSLALRDRRLEF 
RYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKGALRVGDGPRVLGESPKSRKVPHTV 
LNLKEPLYVGGAPDFS KLARAAAVS SGFDGAI QLVS LGGRQIiLTPEHVLRQVDVTS FA 
GHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGR 
TFVEYLNAVTESEKALQSNHFELSLRTEATO^LVLWSGKATERADYVALAIVDGHLQL 
SYNLGSQ PVVLRST^ VNTNRWLRWAHREQREGSLQVGNEAPVTGSS PLGATQLDTD 
GALWLGGLPELPVGPALPKAYGTGFVGCLRDVVVGRHPLHLI.EDAVTKPELRPCPTP 




SEQ ID NO: 99 


893 bp 


NOV32c, 
CG94946-03 DNA Sequence 


CCGGCGCGGCCaSCGCGCTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 

CACCCGGGCCCGCTGCGGGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 

CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 

GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 

ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 

CCGGGAGAGCCrGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGA 

CTCATCTGTGACAACCAGGTGTCCACTrGGGGAC^ 

CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGGTGCTCTGGAGTGGCAAGGC 
CACGGAGCGGGCAGACTATGTGGCACTGGCCATTOT 

TACAACCTGGGCTCCCAGCCCGTGGTGCTGCGTTCCACCGTGCCCGTCAACACCAACC 
GCTGGTTGCGGGTCGTGGCACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAA 
TGAGGCCCCTGTGACCGGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGA 
GCCCTGTGGCT^GGGGGCCTGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCT 
ACGGCACAGGCTTTGTGGGCTGCTTGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCA 




GGCACCAGAGCCCCGCGCCCGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 865 




SEQ ID NO: 100 


276 aa | MW at 29688.8kD 
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NOV32c, 
CG94946-03 Protein Sequence 



MRHGRPVPPGPAAGRPl^PLLVVAAC\njPGAGGTCPERALERRE EEANWLTGTVEE I 
LNVD PVQHTYSCKVRVWRYLKGKDLV3UlESIiU)GGNKVVI SGFGDPL ICDNQVSTGDT 
RIFFVNPAPPYLWPAHKNELVLWSGKATERADYVMATVDGHL 

TVPVNTl^WLRWAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPV 
GPALPKAYGTGFVGCLRDVWGRHPLHLLEDAVTKPELRPCPTP 



SEQ ID NO: 101 



1931 bp 



NOV32d, 
CG94946-04 DNA Sequence 



CCGGCGCGGCCCGCGCGCTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 
CACCCGGGCCCGCTGCGGGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 
CCTGCCCGG AGCCGG CGGG ACATGCCCGG AG CGCGCGCTGG AG CGG CGCGAGGAGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 
ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGG'IGGC 
CCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCC 
CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTGAT 
GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACG CCTCCTGATG CGTG CCGGGGAATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 
CAAGAAGAG CCCGTG CCCCAGCGTGGTGGCG CCTGTGTGTGGGTCGGACG CCTCCACC 
TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGC CGTGCGGCTCGCGGGACCCCTGCTCC AACGTGACCTG CAGCTT 
CGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 
ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTT 
CGACGGCCCTTGTGACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGT 
GTGAACCCGCGCACGCGGCGCCCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 
CCG ATCGGGGG CCGCCCGGGGTCTCCTCCTG CAGAAAGTGCGCTCCGGCCAGTG CCAG 
GGTCGAGACCAGTG CCCGGAG GC CTGCCGGTTCAATGCCGTGTGCCTGTCCCGCCGTG 
GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 
TGCCCAGGACGGGCG CACGTATG ACAGTGATTGCTGG CGG C AG CAGG CTGAGTG CCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
T CCAGGTGGCGCGCAAAGGACCCTGTG ACCG CTGCGGGCAGTGCCGCTTTGGAG CCCT 
GCCTGTGACCGGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCTG 
TGGCTTGGGGGCCTGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCA 
CAGGCTTTGTGGGCTGCTTGCGGGATGTGGTGGTGGGCCGGC^CCCGCTGCACCroCT 
GGAGGACGCCGTCACCAAGCCAGAGCTGCGGCCCTGCCCCACCCCATG AGCTGGCACC 
AGAGCCCCGCGCCCGCT 



ORF Start: ATG at 37 



ORF Stop: TGA at 1903 



SEQ ID NO: 102 



622 aa 



NOV32d, 
CG94946-04 Protein Sequence 



MWat 66353.9kD 

MRHGRPVPPG PAAGRPLLPLLVVAAC^PGAGGTCPERALERREEE ANVVLTGTVEE I 
LNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKWI SGFGO PLI CDNQVS TGDT 
RIFFWPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKS PCPS WAP VCGSDASTYSNECELQRAQC 
S QQRRI RLLSRGPCGSRD PCSNVTCS FGSTCARSADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECQLLRRACARQENVFKKFDGP(^PCQGALPDPSRSCRVNPRTRRPEMLL 
RPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGP 
CrK3APSPCXGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSIX3VTYGSACELEAT 
ACTLGREIQVARKG PCDRCGQCRFGALPVTGS S PLGATQLDTDGALWLGGLP ELPVG P 
ALPKAYGTGFVGCLRDVWGRHPLHLLEDAVTKPELRPCPTP 
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SEQIDNO: 103 



4697 bp 



NOV32e, 
CG94946-05 DNA Sequence 



ccggcgcggcccgcgcgctcttccgccgcctctcgcat6cgcca.tggccggccggtcc 
cacccgggcccgctgcggggcggccgctgcigcctctccttgtggtggccxx:gtgcgt 
cctgcccggagccggcgggacatgcccx3gagcgcgcgctggagcggcgcgaggaggag 
gcgaacgtggtgctcaccgggacggtggaggagatcctcaacgtggacccggtgcagc 
acacgtactcctgcaaggttcgggtctggcggtacttgaagggcaaagacctggtggc 
ccgggagagcctgctggacggcggcaacaaggtggtgatcagcggctttggagacccc 
ctcatctgtgacaaccaggtgtccactggggacaccaggatcttctttgtgaaccctg 
cacccccat acctgtggccagcccacaag aacg ag ctg atgct caactccagcctcat 
gcggatcaccctgcggaacctggaggaggtggagttctgtgtggaagataaacccggg 
acccacttcactccagtgcctccgacgcctcctgatgcgtgccggggaatgctgtgcg 

GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 

TACAG CAACGAATGCGAG CTGCAGCGGGCGCAGTGCAGCCAGC AGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 
CGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 



G CGAGTGCCAGCTCCTGCG CCGCGCCTGCGCCCGCCAGG AGAATGTCTTCAAGAAGTT 
CGACGGCCCTTGTGACCCCTGTCAGGGCGCCCTCCCTGACCCX5AGCCGCAGCTGCCGT 
GTGAACCCGCGCACGCGGCGCCCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 
CCGATCGGGGGCCGCCCGGGGTCTCCTCCTGCAGAAAGTGCGCTCCGGCCAGTGCCAG 
GGTCGAGACCAGTGCCCGGAGCCCTGCCGGTTCAATGCCGTGTGCCTGTCCCGCCGTG 
GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 
TGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTG AATG CCTG CAGGCGTG CTCGAGCCTCTACG ATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
TCCAGGTGGCG CG CAAAGG ACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGG AGCCCT 
GTGOSAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGGCCCAG 
CCCGTGTGTGGCTCCGACX3GGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACG 
CCTGCACACACCAGATCAGCCTG CACGTGGCCTCAG CTGGACCCTGTG AGACCTGTGG 
AGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTGTGTCCCCGG 
TGTGAGCACCCCCCGCCCGGCCCCX5TGTGTGGCAGCGACGGTGTCACCTACGGCAGTG 
CCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGC 
AGGGC CGTG CGAGCAGG CCGAGTGCGGTTCCGGAGG CTCTGGCTCTGGGGAGGACGGT 
GACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGG 

CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA 
CAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGGCCCCGCCTTCGCCCCGC 
TGCCGCCTGTGGCrcCCTTACACTGTGCCC^GACGCCCTACGGCTGCTGCCAGGACAA 
TATCACCGCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAAC 
CCCCATGGCTCTTACGGCGGCAC 
CAGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGC 
CATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 
GTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTG 
GACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGC 
TGACGCI^CTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGC 
GTGGAGGAGTCTGGCTCAGCCCACTGTGTCrGCCCGATGCTCACCTGTCCAGAGGCCA 
ACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 
GACCATCGCCTGCCGCCAGGGCCTGCAAATCTCTATCCAGAGCCTGGGCCCGTGCCAG 
GAGGCTGTTGCTCCCAGCACrrC^CCCGAC^TCTGCCTCan'GACTGTGACCACCCCAG 
GGCTCCTCCTGAGCa^C^CTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAG 
CAGTACCGCACACAGCCAGACCACCCCTCCGCCCTCATCX3CGACCTCGGACCACTGCC 
AGCGTCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCC 
CTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAG 
CGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCTCGAG 
CCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCT 
ACAACTCCGCGTTGGGCTGCTGCTCTGATGGGAAGACGCCCTCGCTGGACGCAGAGGG 
CTCCAACTGCCCCGCCACCAAGGTGTTCCAGGGCGTCCTGGAGCTGGAGGGCGTCGAG 
GGCCAGGAGCTGTTCTACACGCCCGAGATGGCTGACCCCAAGTCAGAACTGTTCGGGG 
AGACAGCCAGGAGCATTGAGAGCACCCTGGACGACCTCTTCCGGAATTCAGACGTCAA 
GAAGGATTTCCGGAGTGTCCGCTTGCGGGACCTGGGGCCCGGCAAATCCGTCCGCGCC 
ATTGTGGATGTGCACTTTGACCCCACCACAGCCTTCAGGGCACCCGACGTGGCCCGGG 
CCCTGCTCCGGCAGATCCAGGTGTCCAGGCGCCGGTCCTTGGGGGTGAGGCGGCCGCT 
GCAGGAGCACGTGCGATTTATGGACTTTGACTGGTTTCCTGCGTTTATCACGGGGGCC 
ACGTCAGGAGCCATTGCTGCGGGAGCCACGGCCAGAGCCACCACTGCATCGCGCCTGC 
CX3TCCTCTGCTGTGACCCCTCGGGCCCCGCACCCOU3TCACACAAGCC^GCCCGTTG^ 
CAAGACCACGGCAGCCCCCACCACACGTCGGCCCCCCACCACTGCCCCCAGCCGTGTG 

TGTGTGCCTGT 
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GTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGAGAAGTCAGCGGQ 

GGACGTGGATACCTTGG CCTTTGACGGGCGGACCTTTGTCGAGTAC CTCAACG CTGTG 

ACCGAGAG CGAGAAGG CACTG CAG AG CAACCACTTTG AACTGAGCCTGCGCACTGAGG 

CCACGC^GGGGCTGGTGCTCTGGAGTGGCAAGGCCACGGAGCGGGCAGACTATGTGGC 

ACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAACCTGGGCTCCCAGCCCGTG 

GTGCTGCGTTCCACCGTGCCCGTCAACACCAACCGCTGGTTGCGGGTCGTGGCACATA 

GGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACCX3GCTCCTC 

CCCGCTGGGCGCCACGCAG CTGGACACTGATGGAGCC CTGTGGCTTGGGGGCCTGCCG 

GAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGTGGGCTGCT 

TGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCACCTGCTGGAGGACGCCGTCACCAA j 

GCCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCACCAGAGCCCCGCGCCCGCT 


I 


ORF Start: ATG at 37 


ORF Stop: TGA at 4669 




SEQIDNO: 104 


1544aa MW at 1 62003 .7kD 


NOV32e, 
CG94946-05 Protein Sequence 

i 

i 

i 


MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANVVLTGTVEEI 
LNVD PVQHTYS CKVRVWR YLKG KDLVARE S LLDGGNKWI SGFGD PI>I CDNQVSTGDT 
RIFF^PAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKS PC PS WAPVCGSDAS TYSNECELQRAQC 
SQQRRIRLLSRGPC^SRDPCSNVTCSFGSTCT^ADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECQLLRRACARQENVFKKFDGPCDPCQGALPDPSRSCRVNPRTRRPEMLL 
RPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCS CDRVTCDG AYRPVCAQDGRTYDS DCWRQQAECRQQ, RAI P S KHQGP 
CDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEAT 
ACTLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPS 
E CMLHVHACTHQ I SLHVAS AG PCETCGD AVCAFGAVCSAGQCVCPRCEHP PPGP VCGS 
DGVTYGSACELREAACLQQTQ I EEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
IWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGAC 
RGPAFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPA 
TGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLC 
SCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCP 
MLTCPEANATKVCG SDG VT YGNECQLKTI ACRQG LQI S IQS LGPCQE AVAPS TH PTS A 
SVTWTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLT 
VPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGP 
PVERASCYNSALGCCSDGKTPSIiDAEGSNCPATKVFQGVLELEGVEGQELFYTPEMAD 
PKSELFGF^ARSIESTLDDLFRNSDVKKDFRSVRLRDLGPGKSVRAIVDVHFDPTTAF 
RAPDVARALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATAR 
ATTAS RLPS S AVTPRAPH PSHTSQPVAKTTAAPTTRRP PTTAPS RVPGRRPPAS CVPR 
EAAYVO^CPGGFSGPHCEKGLVEKSAGDVDTLAFIXSRTFVEYLNAVTESEKALQSNHF 
ELSLRTEATGGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPVVIjRSTVPVNTNR 
WLRWAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPVGPALPKAY 
GTGFVGCLRDVWGRHPLHLLEDAVTKPELRPCPTP 




SEQ ID NO: 105 


6494 bp 


NOV32f, 


CCGGCGCGGCCCGCGCGCTCTTCCGCCX3CCTCTCGCATGCGCCATGGCCGGCCGGTCC 


CG94946-06 DNA Sequence 

• 


CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 
ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 
CCGGGAGAGCCTGCTGGACGG CGGCAACAAGGTGGTGATCAGCGG CTTTGGAGACCCC 
CT CATCTGTGACAACCAGGTGTC CACTGGGG ACAC CAGGATCTTCTTTGTGAACCCTG 
C^CCCCCATACCTGTGGCCAGCCC^CAAGAACGAGCTGATGCTC^CTCCaGCCTCAT 
GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCX5TCTG 
CAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACC 
TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 
CGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 
ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCrrTCAAGAAGTT 
CGACGGCCCTTGTGACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGT 
GTGAACCCGCGCACGCGGCGCCCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCC 
GG CAGG CGCCAGTGTGTGGGGACGACGGAGT CAC CTACG AAAACG ACTGTGTCATGG G 
CCGATCGGGGGCCGCCCGGGGTCTCCTCCTGCAGAAAGTGCGCTCCGGCCAGTGCCAG 




GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 
TGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCrCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTGAATGCCrTCCAGGCGTGCTCGAGCCTCT^ 

GT CACATACGGCAG CGCGTGCG AGCTGGAGG CCACGGCCTGTACCCTCGGG CGGGAG A 
TCCAGGTGGCGCGCAAAGGACCCTGTGACCG CTGCXK3GCAGTGCCGCTTTGGAGCCCT 
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GTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGGCCCAG 
CCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACG 
CCTGCACACACCAG ATCAGCCTG CACGTGG CCTCT^ CTGGACCCTGTGAGACCTGTGG 

TGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTG 
CCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGC 
AGGGCCGTGCGAGCAGGCCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGACGGT 
GACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCX3GAGG 
ACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGG 
CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA 
CAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGGCCCCGCCTTCGCCCCGC 
TGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAA 
TATCACCGCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAAC 
CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 
CAGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGG 
CATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 
GTGCGGGATG ACTGTG AGGAG ATG ACGGGG CTGTGCTCGTGTAAG CCCGGGGTGGCTG 
GACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGC 
TGACGCTTCTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGC 
GTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCA 
ACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 
GACCATCG CCTG CCGCCAGGG C CTG CAAATCTCT ATCCAGAGCCTGGGCCCGTGCCAG 
GAGGCTGTTGCTCCCAG CACTC ACCCGACAT CTGCCTCCGTGACTGTGACCACCCCAG 
GGCTCCTCCTGAGCCAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAG 
CAGTACCGCACACAGCCAGACCACCCCTCCGCCCTCATCGCGACCTCGGACCACTGCC 
AGCGTCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCC 
CTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAG 
CGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCTCGAG 
CCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCT 
ACAACTCCGCGTTGGGCTGCTG CTCTGATGGG AAGACGC CCTCGCTGGACGCAGAGGG 
CTCCAACTGCCCCGCCACCAAGGTGTTCCAGGGCGTCCTGGAGCTGGAGGGCGTCGAG 
GGCCAGGAGCTGTTCTACACGCCCGAGATGGCTGACCCCAAGTCAGAACTGTTCGGGG 
AGACAGCCAGGAGCATTGAGAGCACCCTGGACGACCTCTTCCGGAATTCAGACGTCAA 
GAAGGATTTCCGGAGTGTCCGCTTGCGGGACCTGGGGCCCGGCAAATCCGTCCGCGCC 
ATTGTGGATGTGCACTTTGAC CCCACCACAGCCTTCAGGGCAC CCGACGTGGCCCGGG 
CCCTGCTCCGGCAGATCCAGGTGTCCAGGCGCCGGTCCTTGGGGGTGAGGCGGCCGCT 
GCAGGAGCACGTGCGATTTATGGACTTTGACTGGTTTCCTGCGTTTATCACGGGGGCC 
ACGTCAGG AG CC ATTG CTGCGGGAGCCACGGCCAGAGCCACCACTGCATCGCG CCTGC 
CGTCCTCTGCTGTGACCCCTCGGGCCCCGCACCCCAGTCACACAAGCCAGCCCGTTGC 
CAAGACCACGGCAGCCCCCACCACACGTCGGCCCCCCACCACTGCCCCCAGCCGTGTG 
CCCGGACGTCGGCCCCCGGCCCCCCAGCAGCCTCCAAAGCCCTGTGACTCACAGCCCT 
GCTTCCACGGGGGGACCTGCCAGGACTGGGCATTGGGCGGGGGCTTCACCTGCAGCTG 
CCCGGCAGGCAGGGGAGGCGCCGTCTGTGAGAAGGTGCTTGGCGCCCCTGTGCCGGCC 
TTCGAGGGCCGCTCCTTCCTGGCCTTCCCCACCCTCCX3CGCCTACCACACGCTGCGCC 
TGGCACTGGAATTCCGGGCGCTGGAGCCTCAGGGGCTGCTGCTGTACAATGGCAACGC 
CCX5GGGCAAGGACTTCCTGGC^TTGGCGCTGCTAGATGGCCX3CGTGCAGCTCAGGTTT 
GACACAGGTTCGGGGCCGGCGGTGCTGACCAGTGCCGTGCCGGTAGAGCCGGGCCAGT 
GGCACCGCCTGGAGCTGTCCCGGCACTGGCGCCGGGGCACCCTCTCGGTGGATGGTGA 
GACCCCTGTTCTGGGCGAGAGTCCCAGTGGCACCGACGGCCTCAACCTGGACACAGAC 
CTCTTTGTGGGCGGCGTACCCGAGGACCAGGCTGCCGTGGCGCTGGAGCGGACCTTCG 
TGGGCGCCGGCCTGAGGGGGTGCATCCGTTTGCTGGACGTCAACAACCAGCGCCTGGA 
GCTTGGCATTGGGCCGGGGGCTGCCACCCGAGGCTCTGGCGTGGGCGAGTGCGGGGAC 
CACCCCTGCCTGCCCAACCCCTGCCATGGCGGGGCCCCATGCCAGAACCTGGAGGCTG 
GAAGGTTCCATTG CCAGTGCCCGCCCGGCCGCGTCGGAC CAACCTGTGCCGATGAGAA 
GAGCCCCTGCCAGCCCAACCCCTGCCATGGGGCGGCGCCCTGCCGTGTGCTGCCCGAG 
GGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCCAGACAGCCT 
CGGGGCAGGACGGCTCTGGGCCCTTCCTGGCTGACTTCAACGGCTTCTCCCACCTGGA 
G CTG AG AGG CCTG CACACCATTG C ACGGG ACCTGGGGGAGAAG ATGG CG CTGGAG GC C 
GTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAGAAGACGGACG 
GCAAGGGGGACTTCGTGTCGCTGG CACTG CGGGACCGCCGCCTGGAGTTCCGCTACGA 
CCTGGGCAAGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGAGCCTGG 
ACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCrGCGTGTGGGCGACGGCC 
CCCGTGTGTTGGGGGAGTCCCCGAAATCCCGCAAGGTTCCGCACACCGTCCTCAACCT 
GAAGGAGCCGCTCTACGTAGGGGGCGCTCCCGACTOCAGCAAGCTGGCCCGTGCTGCT 
GCCGTGTCCTCTGGCTTCGACGGTGCCATCCAGCTGGTCTCCCTCX3GAGGCCGCCAGC 
TGCTGACCCCGGAGCACGTGCTGCGGCAGGTGGACGTCACG 
CTGCACCCGGGCCTCAGGCCACCCCTXiCCTCAATCGGGCCT 

GCTGCCTATGTGTGCCTGTGTCCCGGGGK3ATTCTCAGGACCGCACTGCGAGAAGGGGC 
TGGTGGAGAAGTCAGCGGGGGACGTGGATACCITGGCCTTTGACGGGCGGACCrTTGT 
CGAGT ACCTCAACG CTGTGAC CG AG AGCGAG AAGGCACTGCAGAG CAACCACT TTGAA 
CTGAGCCTGCGCACTGAGGCCACGCAGGGGCTGGTGCTC^^ 

AGCGGGCAGACTATGTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGCrACAA 
CCTGGGCTCCCAGCCCGTCGTGCTGCXjTTCCACCGTGCCCGTCAACACCAACCGCTGG 
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TTGCGGGTCG TGG CACATAGGGAGCAG AGGGAAGGTTCCCTGCAGGTGGGCAATGAGG 
CCCCTGTGAC CG G CT CCTCCC CG CTGGGCGCCACX3 CAGCTGGACACTGATGGAG CCCT 
GTGG CTTGGTGAGTGTTTTGGGG AGACTAGAGAGGGATGCCCAAGGGTCTCAGATAT C 
CGAGGGACAGACTCCACCCCCCAGCGCCC^CCCTTGAGTCAGGGTGCATGTGAGCCGG 
CGGGCTGGGCTCTCTTCTCCCGCTGTAGCCCCTGCAGTTCCCAGTGCTGTGGGGCCGG 




AGCATCACTGAGTCACAG CCGGGTGACTCCCACTGTCTGTGCTGCAGGGG CCTGCCGG 
AGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGTGGGCTGCTT 
GCGGGATGTGGTGGTGGGCCGGCACCCGCTG CACCTGCTGG AGGACG CCGTCACCAAG 
CCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCACCAGAGCCCCGCGCCCGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 6466 


! 


SEQ ID NO: 106 


2 143 aa I MW at 225054.6RD 


NOV32f, 
CG94946-06 Protein Sequence 

i 
i 


MRHGRPVPPGPAAGRPLLPLLVVAACVLPGAGGTCPERALERREEEANVVLTGTVEEI 
LNVDPVQHTYSCKVRVWRYIiKGKDLVARESLLDGGNKVVISGFGDPLICDNQVSTGDT 
RIFFWPAPPYLWPAHKNEI^NSSLMRITIiRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCG FG AVCE PNAEG PGRAS CVCKKS PCPS VVAPVCGSDASTYSNECELQRAQC 
SQQRRIRLLSRGPCGS RD PCSNVTCS FGSTCARSADGLTASCLCP ATCRG APEGTVCG 
SDGADYPGECQLLRRACARQENVFKKFDGPCDPCC^ALPDPSRSCRVNPRTRRPEMLL 
RPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCG^jRDQCPEPCRFN 
AVC L S RRGRPRC S CDR VTCDGAYR P VCAQDGRTYDS D CWRQQ AE CRQQ RAI P S KHQG P 
CDQ APS PCLGVQCAFGATCAVKNGQAACECLQACSSLYD PVCGSDG VTYG SACELEAT 
ACTLGREIQVARKGPCDRCGOCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPS 
ECMLHVHACTHQ I S LHVAS AGP CETCGDAVCAFG AVCS AGO. CVCPR CEHP PPG PVCGS 
DGVTYGSACELREAACI^QQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
iWDEDSEDGPCVCDFSCQSVPGSPVCGSIXSVTySTECELKKARCESQRGLYVAAOGAC 
RGP AF APLP P VAPLHCAQTPYG CCQDNITAARGVGLAG CP S ACQCNPHGS YGGTCD PA 
TGQCSCRPGVGGLRCDRCEPGFV^FRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLC 
SCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCP 
MLTCPEANATKVCGSDGVTYGNECQLKTI ACRQGLQIS I QSLGPCQEAVAPSTHPTSA 
SVTVTTPGLLIiSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLT 
VPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGP 
PVERASCYNSALGCCSDGKTPSLDAEGSNCPATKVFC<?VLELEGVEGQELFYTPEMAD 
PKS ELFGETARS I ESTLDDLFRNSDVKKDFRSVRLRDLGPGKSVRAI VDVHFDPTTAF 
RAPDVARALLRQ I QVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAI AAGATAR 
ATTASRL PS SAVT PRAPHPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPPAPQQP P 
KPCDSQPCFHGGTCQDWALGGGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTL 
RAYHTLRLALE FRAI»E PQGLLLYNGNARGKDFLALALLDGRVQLRFDTGSGPAVLTS A 
VPVEPGQWHRLELSRHWRRGTLSVDGETPVLGESPSGTDGLNLDTDLFVGGVPEDQAA 
VALERTFVGAGLRGCIRLLDVNNQRLELGIGPGAATRGSGVGECX3DHPCLPNPCHGGA 
PCQNLEAGRFHCQCPPGRVGPTCADEKSPCQPNPCHGAAPCRVLPEGGAQCECPIiGRE 
GTFCQTASGQDGSGPFIJVDE^GFSHLEI^GLHTIARDLGEKMALEAVFLARGPSGIJj^ 
YNGQKTIXSKGDFVSIiALRDRRLEFRYDIXSKGAA 

ALRVGIX3PRVLGESPKSRKVTHTVLNLKEPLYVGGAPDFSKLARAAAV 

VSLGGRQLLTPEHVLRQVDVTS FAGHPCTRASGHPCLNGASCVPREAAYVCLCPGG FS 

GPHCEKGLVEKSAGDVDTLAFIXSRTFVEYLNAVTESEKA^ 

LWSGKATERADYVALArTOGHI^LSYNIX3SQPVVLRSTVPVNTmWL^ 

S LQVGNEAPVTGSS PLGATQLDTDGALWLGEC FGETREGCPRVSDI RGTDSTPQRP PL 

SQGACEPAGWAIiFSRCSPCSSQCCGAGRRVPRCGPPAGHLLVGVPISITESQPGDSHC 

LCCRGLPELPVGPALPKAYGTGFVGCLRDVWGRHPLHLLEDAVTKPELRPCPTP 




SEQ ID NO: 107 


5688 bp 


NOV32g, 
CG94946-07 DNA Sequence 


CCGGCGCGGCCCGCGCX5CTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 

CACCCGGGCCCGCTGCX3GGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 

CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 

G CG AACGTGGTG CTCACCG GG ACGGTGGAGG AG ATCCTCAACGTGGACCCGGTGCAG C 

ACACGTACrCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 

CCGGGAGAGCCTGCTGGA(^GCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCC 

CTCATCTGTGAC^CCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAA 

CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATC 

GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 

ACCCACTTCACTCCAGTGCCTCCGA(XCCTCCTGATGCGTGCCGGG^ 

GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 

CAAGAAGAGCCCGTGCCCC^GCGTGGTGGCXSCCTGTGTGTGGGTCGGACGCCTCCACC 

TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 

TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 

CGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 




ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTT 




GTGAACCCGCGCACX3CGGCGCCCTGAGATGCGCCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACXSAAAACGAC^^ 
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CCGATCGGGGGCCGCCCGGGGTCTCCTCCTGCAGAAAGTQCGCTCCGGCCAGTGCCAG 

GGTCGAGACCAGTGCCCGGAGCCCTGCCGGTTCAATGCCGTGTGCCTGTCCCGCCGTG 

GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACXjGGGCCTACAGGCCCGTGTG 

TG CCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGG CAG CAGGCT GAGTGCCGG 

CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 

GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 

GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 

GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 

TCCAGGTGGCGCGCAAAGX3ACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGGAGCCCT 

GTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGGCCCAG 

CCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACG 

CCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGCGAGACCTGTGG 

AGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTGTGTCCCCGG 

TGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTG 

CCTGCGAGCTACGGGAAG CCG CCTGCCTCCAGCAGACACAGATCGAGGAGG CCCGGGC 

AGGG CCGTGCGAG CAGG CCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGG AGGACGGT 

GACTGTGAGCAGGAGCTGTGCCGG CAGCGCGGTGG CATCTGGGACGAGGACTCXSGAGG 

ACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGG 

CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA. 

CAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGGCCCCACCTTCGCCCCGC 

TGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAA 

TATCACCGC^GCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAAC 

CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 

CAGGTGTGGGGGGCCTCAGGTGTGACCG CTGTGAGCCTGG CTTCTGGAACTTTCGAGG 

CATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 

GTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTG 

GACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGC 

TGACGCTTCTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGC 

GTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCA 

ACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 

GACCATCGCCTGCCG ACGGTGTCAC CTACGCCAGGGCCTGCAAATCTCTATCCAGAGC 

CTGGGCCCGTGCCAGGAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGA 

CTGTGAC(^CCCCAGGGCTCCTCCTGAGCCAGGC^CTGCCGGCCCCCCCCGGCGCCCT 

CCCCCTGGCTCCCAGCAGTACCGCAC^CAGCCAGACCACCCCTCCGCCCTCATCGCGA 

CCTCGG ACCACTG CCAGCGTCCCCAGGACCACCGTGTGG CCCGTGCTGACGGTGCCCC 

CCACGGCACCCTCCCCTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAG 

CACTGATGG AAGCAG CGATG AGGAACTGAG CGGGG ACCAGGAG G C CAGTGGGGGTGG C 

TCTGGGGGG CCCGAG CCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCG 

AGAGGGCTrCCTGCTACAACCCCTGCCATGGGGCGGCGCCCTGCCGTGTGCTGCGCGA 

GGGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCCAGACAGCC 

TCGGGGCAGGACGGCTCTGGGCCOTCCTGGCTGAerTCAACGGCTTCTCCCACCTGG 

AGCTGAGAGGCCTGCACACCTTTG CACGGGACCTGGGGGAGAAGATGGCG CTGGAGGT 

CGTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCXCTACAACGGGCAGAAGACX5GAC 

GG CAAGGGGGACTTCGTGTCGCTGGCACTGCGGGACCGCCGCCTGGAGTTCOSCTACG 

ACCTGGGCAAGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGAGCCTG 

GACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGACGGC 

CCCCGTGTGTTGGGGGAGTCCCCGGTTCCXSCACACCGTCCTCAACCrGAAGGAGCCGC 

TCTACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCTGCCGTGTCCTC 

TGGCTTCGACGGTG CCATCCAG CTGGTCTCCCTCGGAGGCCGCCAGCTGCTGACCCCG 

GAGCACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCCCTGCACCCGGG 

CCrCAGGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAGGCTGCCTATGT 

GTGCCTGTGTCCCGGGGGATTCTCAGGACCGCAGTGCGAGAAGGGGCTGGTGGAGAAG 

TCAGCGGGGGACG TG G ATACCTTGG C CTTTG ACGGG CGGACCTTTGTCGAGTACCTCA 

ACGCTGTGACCX^GAGCGAGAAGGCACTGCAGAGCAACCACTT^ 

CACTGAGGCCACGCAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGGAGCGGGCAGAC 

TATGTGGCACTGGCGATTGTGGACGGGCACCTGCAACTGAGCrACAACCTGGGCTCCC 

AGCCCGTGGTGCnrcGTTCCACCGTGra^ 

GGCACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACC 

GGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCTGTGGCTTGGGG 

GCCTGCCGGAGCTGCCCGTGGGCCC^GCACTGCCCAAGGCCrACGGCACAG^CTTTGT 

GG^CTGCTTGCGGGATGTGGTGGTGGGCCGGCACCCXSCTGCACCTGCT 

GTCACCAAGCCAGAGCIGCGGCCCTGCCCCAC^CC^TG AGCTGGCACCAGAGCCCCGC 

GCCCGCTCTAATTATTTTCTATTTTTGTAAACTTGTTGCT 

GCCTGAGTGTTGGCCGGAGGGACltSCTGGCCCGGCCTCCCTTCCGTCCAGGCAG 

GCTGCAGACAGACCTAGTGCrGAGGGATGGACAGGCGAGGTGGCAGCGTGGAGGGCTC 

GGCGTGGATGGCAGCCTCAGGACACACACCCCTGCCTCAAGGTGCTGAGCCCCCGCCT 

TG CACTG CGCCTGCCC CACGGTGTCCCCGCCGGGAAGCAG CCC CGG CTCCTG AATCAC 

CCTCGCTCCGTC^GGCGGGACTCGTGTCrCAAAAAGGAAGGGGCTGCTGAGGTCTGAT 

GGGGCCCTTCCTCCGGGTGACCCCACAGGGCCTTTCCAAGCCCCTATTTGAGCTGCTC 

CTTCCTGTGTGTGCTCTGGACCCTGCCTCGGCCTCCTGCGCC^U^TACIXSTGACTTCCA 

AACAATGTTACTGCTGGGCACAGCTCTGCGTTGCTCCCGTGCrccCTGCGCCAGCCCC 

AGGCTGCTGAGGAGCAGAGGCCAGAC'CAGGGCCGATCTGGGTGTCCTGACCCTCAGCT 

GGCCCTCCCCAGCC^CCCTGGAC^TGACCGTATCCCTCTGCCACACCCCAGGCCCTGC 
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GAGGGGCTATCGAGAGGAGCTCACTGTGGGATGGGGTTGACOTCTGCCGCCTGCCTGG 
GTATCTGGGCCTGX3CCATGGCTGTGTTCTrCATGTGTTGATTTTAT^ 
GTGGTGGGTCTCATCTTTCCCATCTCGCCTGAGA^ 
AATCCTCCCCACAGCGTCAGTGAAAGTCGTCCrTGTCTCAG^ 

CAGTG TCTG AC CAAGG TCAAGGGG CAGGTGCAGAGGTGGCAGGGATGG CTCCGAAG CC 
AGAA 





ORF Start: ATG at 37 ORF Stop: TGA at 4735 




SEQ ID NO: 108 


1566 aa 


MWatl64102.4kD 


NOV32g, 
CG94946-07 Protein Sequence 


MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANVVLTGTVEEI 
LNVDPVQHTYS CKVRVWRYLKGKDLVARESLLDGGNKWI SGFGDPLI CDNQVSTGDT 
R I F FVN PAP PYXiWP AHKNELMLNS SLMRI TLRNLE E VE FCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAPVCGSDASTYSNECELQRAQC 
S QQRRIRLLSRG PCGSRDPC SNVTCS FGSTCARS ADGLTAS CLC PATCRGAP EGTVCG 
S DG ADYPGECQLLRRACARQENVFKKFDGPCD PCQG ALPD PSRS CRVNPRTRRPEMRL 
RPESCPARQAPVCGDDGVTYEITOCVMGRSGAARGIiLLQiCVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCS CDRVTCHX5AYRP VCAQDGRTYDSDCWRQQAECRQQRAI PS KHQGP 
CDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEAT 
ACTLGREIQVARKGPO)RCGQCRFGALCEAETGRCVCPSEC^ALAQPVCGSDGHTYPS 
ECMLHVHACTHQI SLHVAS AGPCETCGDAVCAFGAVCSAGQCVCPRCEHPP PGPVCGS 
DGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
IWDEDSEIX3PCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGAC 
RG PTFAPLP P VAPLHCAQTPYGCCQDN I TAARGVGLAGCP S ACQCNPHGS YGGTCD PA 
TGQCS CRPGVGGLRCDRCE PG FWNFRG I VTDGRSG CT PCS CDPQGAVRDDCEQMTG LC 
S CKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCE FGARCVEESGS AHCVCP 
MLTCPEANATKVCGSDGVTYGNECQLKTIACRRCHLRQGLQISIQSLGPCQEAVAPST 
HPTSASVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTV 
WPVLTVPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGPEPLEGSSV 
ATPGPPVERASCYNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLAD 
FNGFSHLELRGLHTFARDI^EKWALEWFLARGPSGLLLY1JGQKTDGKGDFVS3jALRD 
RRLE FR YDLG KGAAVI RSRE P VTLG AWTRVSLERNGRKG ALRVGDGPRVLGES P VPHT 
VLNLKEPLYVGGAPDFS KIiARAAAVS SGFDGAIQLVSLGGRQLLTPEHVLRQVDVTS F 
AGHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDG 
RT FVE YLNAVTES EKALQS NH F ELS LRTEATQGLVLWSG KATERADYVALAI VDGHLQ 
L S YNLG SQPVVIjRSTVP VNTNRWLRWAHREQREG SLQVGNEAPVTGS S PLGATQLDT 
DG ALWLGGL P ELP VGP ALP KAYGTG FVGCLRDVWGRH PLHLLEDAVTKPELRPCPTP 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 32B. 



Table 32B. Comparison of NOV32a against NOV32b through NOV32g. 


Protein Sequence 


NOV32a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV32b 


34.. 1536 
34..1549 


1072/1557(68%) 
1128/1557 (71%) 


NOV32c 


1913..2053 
136..276 


141/141 (100%) 
141/141 (100%) 


NOV32d 


34..549 
34..549 


505/516(97%) 
505/516(97%) 


NOV32e 


34.. 1536 
34.. 1528 


1170/1515(77%) 
1201/1515(79%) 


NOV32f 


34..2002 
34..2002 


1758/1969 (89%) 
1758/1969 (89%) 


NOV32g 


34..1536 
34..1550 


1081/1562(69%) 
1134/1562(72%) 
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Further analysis of the NOV32a protein yielded the following properties shown 
Table 32C. 



Table 32C Protein Sequence Properties NOV32a 


PSort analysis: 


0.7618 probability located in outside; 0.1900 probability located in lysosome 
(lumen); 0.1000 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 34 and 35 



A search of the NOV32a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 32D. 
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Table 32D. Geneseq Results for NOV32a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV32a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW26609 


Human agrin - Homo sapiens, 492 
aa. [W0972181 1-A2, 19-JUN-1997] 


1 591. .2053 
22..492 


460/471 (97%) 
461/471 (97%) 


0.0 


AAB93754 


Human protein sequence SEQ ID 
NO:13424 - Homo sapiens, 413 aa. 
[EP 1 0746 1 7-A2, 07-FEB-200 1 ] 


583..968 
1..386 


381/386(98%) 
384/386 (98%) 


0.0 


AAY73993 


Human prostate tumor EST fragment 

derived protein #1 80 - Homo 
sapiens, 4 16 aa. [DEI 9820 1 90-A 1 , 
04-NOV-1999] 


1634..2053 
1..416 


414/420 (98%) 
414/420 (98%) 


0.0 


AAB31889 


Amino acid sequence of a human 
protein - Homo sapiens, 4393 aa. 
[WO200105422-A2, 25-JAN-2001] 


1355..2052 
3639..4393 


252/794 (31%) 
352/794(43%) 


7e-88 


ABG23265 


Novel human diagnostic protein 
#23256 - Homo sapiens, 4436 aa. 
[WO200175067-A2, 1 l-OCT-2001] 


1355..2051 
3672..4435 


252/803 (31%) 
350/803 (43%) 


7e-85 



In a BLAST search of public sequence datbases, the NOV32a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 32E. 



Table 32E. Public BLASTP Results for NOV32a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV32a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


000468 


AGRIN PRECURSOR - Homo 
sapiens (Human), 2026 aa 
(fragment). 


24..2053 
1..2026 


2022/2030(99%) 
2022/2030 (99%) 


0.0 


P25304 


Agrin precursor - Rattus 
norvegicus (Rat), 1959 aa. 


160..2053 
51..1959 


1558/1914(81%) 
1663/1914(86%) 


0.0 


P31696 


Agrin precursor - Gallus gallus 
(Chicken), 1955 aa. 


128.2050 
1..1952 


1234/1970 (62%) 
1479/1970 (74%) 


0.0 


Q90404 


Agrin - Discopyge ommata 
(Electric ray), 1328 aa 
(fragment). 


716..2051 
1..1325 


733/1353 (54%) 
932/1353 (68%) 


0.0 


Q96IC1 


UNKNOWN (PROTEIN FOR 
IMAGE:3544662) - Homo 
sapiens (Human), 488 aa 
(fragment). 


1562..2053 
1..488 


486/492 (98%) 
486/492 (98%) 


0.0 
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PFam analysis predicts that the NOV32a protein contains the domains shown in 
the Table 32F. 



Table 32F. Domain Analysis of NOV32a 


Pfam Domain 


NOV32a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Kazal 


201. .246 


25/61 (41%) 
36/61 (59%) 


7.2e-l8 


Kazal 


276.321 


21/62 (34%) 
33/62 C53%^ 


5.1e-13 


Kazal 


346.393 


18/61 (30%) 
33/61 (54%^ 


7.9e-I2 


Kazal 

i 

| 


420..465 


21/61 (34%) 
38/61 (62%} 


4.1e-16 


J Kazal 


494..538 


24/61 (39%) 
38/61 (6^>%) 


3.6e-l? 

i 


r - V 
Kazal 


559..603 


19/61 (31%) 
38/61 (62%) 


1.5e-18 


Kazal 


624..668 


26/62 (42%) 
37/62 (60%) 


I.5e-I7 


Kazal 


709..754 


24/62 (39%) 
40/62 (65%) 


l.2e-16 


laminin_EGF 


797..848 


28/61 (46%) 
46/61 (75%) 


l.2e-20 


lamininJEGF 


851. .895 


21/59 (36%) 
37/59 (63%) 


4e-ll 


Kazal 


927..973 


25/62 (40%) 
41/62 (66%) 


5.3e-18 


SEA 


11 34.. 1256 


39/132 (30%) 
112/132 (85%) 


1.4e-36 


EGF 


1337.. 1370 


16/47(34%) 
24/47 (51%) 


0.00054 


laminin_G 


1404..1535 


70/162(43%) 
119/162(73%) | 


3.1e-53 


EGF 


1557.. 1589 


16/47 (34%) 
27/47 (57%) 


5.1e-06 


EGF 


1596.. 1628 


16/47(34%) 
25/47 (53%) 


0.0002 


Iaminin_G 


1672..1807 


70/161 (43%) 
123/161 (76%) 


5.1e-51 


EGF 


1 826.. 1860 


14/47 (30%) 


2.3e-06 
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25/47 (53%) 




laminin_G 


1905..2036 


59/161 (37%) 
125/161 (78%) 


1.7e-50 



The NOV33 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 33 A. 



Table 33A. NOV33 Sequence Analysis 




SEQIDNO:109 [ 3354 bp 


NOV33a, 
CG95 165-01 DN A Sequence 

1 

i 

i 
i 

i 


TGTGGC^GGAGGCGATGCGGCGCCGCCGCTArCTfirr,fi^^ 

GGGCGGCGGAGACGGGCTGCCGCGGTCCCGGGACTGGCTCTACGAGTCCTACTACTGC 
ATGAGCCAGCAGCACCCGCTCATCGTCTTCCTGCTGCTCATCGTCATGGGCTCCTGCC 
TCGCCCTGCTCGCCGTCTTCTTCGCGCTCGGCCTGGAAGTTGAAGACCATGTGGCGTT 
TCTAATAACAGTTCCAACTGCCCTGGCGATTTTCTTTGCGATATTTATCCTGGTCTGC 
ATCGAGTCTGTGTTTAAGAAGCTGCTGCGCCTCTTCTCGTTGGTGATATGGATATGCC 
TTGTTGCCATGGGATACCTGTTCATGTGTTTTGGAGGCACCGTCTCTCCCTGGGACCA 
GGTATCGTTCTTCCTCTTCATCATCTTCGTGGTGTACACC7VTGCTGCCCTTCAACATG 
CGAGACGCCATCATTGCCAGCGTCCTCACCTCCTCCTCCCACACCATCGTGCTAAGCG 
TCTGCCTGTCTGCAACACCGGGAGGCAAGGAGCACCTGGTCTGGCAGATCCTGGCCAA 
TGTGATCATTTTCATCTGTGGG AAC CTGGCGGG AG CCTACCATAAGCACCTCATGGAA 
CTCGCTCTTCAGCAAACATATCAGGACACCTGTAATTGCATCAAGTCGCGGATCAAGT 
TGGAATTTGAAAAACGTCAACAGGAGCGGCTTCTGCTCTCCCTGCTGCCGGCCCACAT 
CGC CATGGAGATG AAAG CGG AG ATCATC CAGAGGCTGCAGGGCCCCAAGGCGGGCCAG 
ATGGAGAACACAAATAACTTCCACAACCTGTATGTGAAGCGGCATACAAACGTGAGCA 
TCTTATACGCTGACATCGTTGGCTTTACCCGGCTGGCAAGTGACTGCTCCCCGGGAGA 
ACT AGT C CACATG CTG AATG AG CT C TT TGG AAAGTTTGATCAAATTG CAAAGG AG AAT 
GAATGCATGAGAATTAAAATTTTAGGAGACTGCTACTACTGTGTATCTGGACTCCCTA 
TATCTCTCCCTAACCATGCCAAGAACTGTGTGAAAATGGGGCTGGACATGTGTGAAGC 
CATAAAGAAAGTGAGGGATGCTACTGGAGTTGATATCAACATGCGCGTGGGCGTGCAT 
TCTGGGAATGTCCTGTGTGGCGTGATTGGTCTGCAGAAGTGGCAATATGATGTGTGGT 
CACATGATGTGACCTTGGCCAACCACATGGAAGCTGGAGGGGTCCCTGGACGTGTTCA 
CATTTCTTCTGTCACCCTGGAGCACTTGAATGGCGCTTATAAAGTGGAGGAGGGAGAT 
GGTGACATTAGGGACCCATATTTAAAACAGC^CCTGGTGAAAACCTACTTTGTGATCA 
ACCCCAAGGGAGAACGACGGAGCCCCCAGCATCTCTTCAGACCTCGCCACACCCTTGA 
TGGAGCCAAAATGAGGGCCTCGGTCCGC^TGACCCGGTACTTGGAGTCCTGGGGGGCA 
GCCAAGCCCTTTGCACACCTACATCACAGGGACAGCATGACCACAGAGAACGGCAAGA 
TCAGCACCACGGATGTACCCATGGGTCAGCATAATTTTCAAAATCGCACCTTAAGAAC 
CAAGTC71CAAAAGAAGAGATTTGAAGAAGAATTGAATGAAAGGATGATTCAAGCAATT 
GATGGGATTAATGCACAGAAGCAATGGCTCAAGTCTGAAGACATTaiGAGAATCTCAC 
TGCTTl"i'CTATAACAAAGTACTAGAAAAAGAGTACCGGGCCACGGCACTGCCAGCGTT 
CAAGTATTATGTGACTTGTGCCTGTCTCATATTCTTCTGCIATCTTCATT 

CTCGTGCTGCCAAAAACGTCTOTCCTGGGCATCTCCTTTGGGGCTGCGTTTCTCTTGC 
TGGCCTTCATCCTCTTCGTCTGCT^ 

CTCTCCCCTGCTCATGTGGCTTTTGAAGTCCTCGGGCATCATTGCCAACCGCCCCTGG 
CCACGG ATCTCTCTCACGAT CATCACCACAG CCATCATATTAATG ATGG CCGTGTTCA 
ACATGTTTTTCCTGAGTGACTCAGAGGAAACAATCCCT^ 

CACAAGCTTT TCAG CCTCAAATAATCAGGTGGCGATTCTGCGTGCG CAGAATTTATTT 

TTCCTCCCGTACTITATCTACAGCTGCAT/rCTGGGACTGATATCCrGTTCC^ 

TG CGGGTAAACTATGAGCTGAAGATG T TGATCATGATG GTGG C CTTGGTGGGCTACAA 

CACCATCCTACTCCACACCCACGCCCACGTCCrGGGCGACTACAGCCAGGTCTT 

GAGAGACCAGGCATTTGGAAAGACCTGAAGACCATGGGCTCTGTGTCTCTCrrCTATAT 

TCTTCATCaCAltccttgTTCTGGGTAGACAGA^ 

CTTATGGAAGAACAAATTCAAAAAAGAGCGGGAG<^GATAGAGACCATGGAGAACCTG 
AACCG CGTGCTG CTGGAG AACGTGCTTCCCGCGCACGTGG CTGAG CACTTCCTGG CCA 
GGAGCCTGAAGAATGAGGAGCrATACCACCAGTCCTATGACTGCGTCTGTGTCATGTT 
TGCCTCCATTCCGGATTT C AAAG AATTTT ATACAG AATCCGACGTG AACAAGGAGGG C 
TTG<3AATGCCTTC^GCTCCTGAACGAGATCATCGCrrGACTTTGATGATCTTCTT^ 
AGCCAAAATTCAGTGGAGTTG AAAAG ATTAAGACCATTGG CAGCACATACATGG CAG C 
AACAGGTCTGAGCGCTGTGCCCAGCCAGGAGCACTCCCAGGAGCCCGAGCGGCAGTAC 
ATGCACATTGGCACC^TGGTGGAGTTTGCrTTTGCCCTGGT^ 

TCAACAAGOICTCCTTCAACGACTTCAAATTGCGAGTGGGTATTAACCATGGACCTGT 
GATAG CTGGTGTGATTGG AGCTCAGAAG CCACAATATGATATCTGGGG CAACACTGTC 
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AATGTGG CCAGTAGGATGGACAG CACCGGAGTCCTGGACAAAATACAGGTTAC CG AGG 
AGACGAGCCTCGTCCTGCAGACCCTCGGATACACGTGCACCTGTCGAGGAATAATCAA 
CGTGAAAGGAAAGGGGGACCTGAAGACGTACTTTGTAAACACAGAAATGTCAAGGTCC 
CTTTCCCAGAG CAACGTGG CAT CCTGAAGAGTCACCTT CATTTTGGCAAGAAGACTGT 
ATTTTCAGGAAGGTATCACACACTTTCTGACTGCAACTTCTGTCCCTT 





ORF Start: ATG at 15 


ORF Stop: TGA at 3273 




SEQ ID NO: 110 


1086 aa 


MWat 122956.2kD 


NOV33a, 
CG95165-01 Protein Sequence 


MRRRRYLRDRSEEAAGGGDGLPRSRDWLYESYYCMSQQHPLIVFLIJ^IVMGSCI^LA 
VFFALGLETVEDHVAFLITVPTALAI FFAI FILVCI ESVFKKLLRLFSLVI WICLVAMG 
YLFMCFGGTVS PWDQVSFFLFI I FWYTMLPFNMRDAI I ASVLTSSSHTI VLSVCLSA 
TPGGKEHLVWQIIANVIIFICGNLAGAYHKHLMELALQQTYQDTCNCIKSRIKLEFEK 
RQQERLLLSLLPAHI AMEMKAEI IQRLQGPKAGQMENTNNFHNLYVKRHTNVS ILYAD 
IVGFTRLASDCSPGELVHMLNELFGKFDQIAKENECMRIKILGDCYYCVSGLPISLPN 
HAKNCVKMGLDMCEAI KKVRDATGVDINMRVGVHSGNVLCGVIGLQKWQYDVWSHDVT 
LANHMEAGGVPGRVHISSVTLEHLNGAYKVEEGDGDIRDPYLKQHLVKTYFVINPKGE 
RRSPQHLFRPRHTIJDGAKMRASVRMTRYLESWGAAKPFAHIJiHRDSMTTENGKISTTD 
VPMGQHNFQNRTLRTKSQKKRFEEELNERMIQAIDGINAQKQWLKSEDIQRISLLFYN 
KVLE KEYRATALPAFKYYVTCACL I FFCI F I VQ I LVLPKTSVLG I S FGAAFLLIiAFI L 
FVCFAGQLLQCSKKASPLLMWLLKSSGI IANRPWPRI SLTI ITTAI ILMMAVFNMFFL 
SDSEETI PPTANTTNTSFS ASNNQVAI LRAQNLFFLPYFI YSCI LGLI SCSVFLRVNY 
ELKMLIMMVALVGYOTILLHTHAHVLGDYSQVLFERPGIWKDLKTMGSVSLSIFFITL 
LVLGRQNEYYCRLDFLWKNKFKKEREE I ETMENLNRVLLENVLPAHVAEHFLARSLKN 
EELYHQS YD CVCVMF AS I PDFKE FYTE SD VNKEGL ECLRLLNE 1 1 ADFDDLLSKP K FS 
GVEKIKTIGSTYMAATGLSAVPSQEHSQEPERQYMHIGTMVEFAFALVGKLDAINKHS 
FNDFKLRVGINHGPVIAGVIGAQKPQYDIWGNTVNVASRMDSTGVLDKIQVTEETSLV 
LQTLGYTCTCRGI INVKGKGDLKTYFVNTEMSRSLSQSNVAS 



Further analysis of the NOV33a protein yielded the following properties shown in 
Table 33B. 



Table 33B, Protein Sequence Properties NOV33a 


PSort analysis: 


0.8000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.3000 
probability located in microbody (peroxisome) 


SignalP analysis: 


Cleavage site between residues 65 and 66 



A search of the NOV33a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 33C. 
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Table 33C. Geneseq Results for NOV33a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV33a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


A A Ffi9Q"i 

8 


nuuidn aaenyiate cyclase 25678 - 

Homo sapiens, 1086 aa 
[WO200144453-A1, 2 1 -JUN-200 1 ] 


I. .1086 
1..I086 


1086/1086(100%) 
1086/1086(100%) 


0.0 


6 


Adenylyl cyclase type 1I-C2 C2 alpha 
domain - Homo sapiens, 1090 aa 
[US6107076-A, 22-AUG-2000] 


1..1086 
1..1090 


i UJ7/ 1 \jy\J {yj/o) 

1062/1090 (97%) 


0.0 


AAR9456 j 
0 


Rat adenylyl cyclase - Rattus sp, 1090 
aa. [WO9608260-A1, 2I-MAR-1996] 


1..1086 
1..1090 


1039/1090 (95%) 
1062/1090 (97%) 


0.0 


AAU0I92 
4 


Human adenylate cyclase polypeptide - 
Homo sapiens, 1077 aa. 
[WO200125448-A1, 12-APR-2001] 


22.. 1078 
I0..1069 


605/1070 (56%) 
772/1070 (71%) 


0.0 


AAB0200 

! 8 

j 


Type IV adenylyl cyclase - Homo 
sapiens, 1064 aa. [US6 107076- A, 22- 
AUG-2000] 


22.. 1082 
I0..1062 


609/1072(56%) 
769/1072(70%) 


0.0 

1 



In a BLAST search of public sequence datbases, the NOV33a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 33D. 
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Table 33D. Public BLASTP Results for NOV33a 


Protein 
Accession 
! Number 

j 


Protein/Organism/Length 


NOV33a 
Residues/ 
Match Residues 


IH<*nt itipo/ 

Similarities for 
the Matched 
Portion 


Expect 
Value 


P26769 

i 


Adenylate cyclase, type 11 (EC 
4.6.1.1) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Rattus 
norvegicus (Rat), 1090 aa. 


L.I 086 
1.1090 


1039/1090 f95%) 
1062/1090(97%) 


0.0 


Q08462 

I 


Adenylate cyclase, type 11 (EC 
4.6.1.1) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Homo sapiens 
(Human), 887 aa (fragment). 


200.. 1086 
1..887 


887/887 (100%) 
887/887(100%) 


0.0 


f — — — — * — 
] Q91WF3 

i 


SIMILAR TO ADENYLYL j 22. .1082 
CYCLASE 4 (ADENYLYL 1 0 . 1 075 
CYCLASE TYPE 4) (EC 4.6. I.I) - j 
Mus musculus (Mouse), 1077 aa. j 


612/1074 (56%) 
780/1074 (71%) 


0.0 


i CAC37757 


SEQUENCE 2 FROM PATENT | 22.. 1078 
WOO 125448- Homo sapiens 10.. 1069 
(Human), 1077 aa. j 


605/1070(56%) 
772/1070(71%) 


0.0 


P26770 


Adenylate cyclase, type IV (EC j 22.. 1082 i 609/1072(56%) 
4.6.1.1) (ATP pyrophosphate-lyase) j 10.. 1062 j 769/1072(70%) 
(Adenylyl cyclase) - Rattus j 
norvegicus (Rat), 1064 aa. j 


0.0 



PFam analysis predicts that the NOV33a protein contains the domains shown in 
the Table 33E. 



Table 33E. Domain Analysis of NOV33a 


Pfam Domain 


NOV33a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Sodcu 


388.395 


6/8 (75%) 
8/8(100%) 


0.73 


guanylate_cyc 


276..460 


71/226 (31%) 
151/226 (67%) 


2.7e-68 


guanylate_cyc 


873..1073 


89/227 (39%) 
186/227 (82%) 


2.le-96 



Example 34. 



The NOV34 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 34A. 
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Table 34A. NOV34 Sequence Analysis 




SEQ ID NO: 111 


3117bp 


NOV34a, 
CG95 175-01 DNA Sequence 

i 

! 
i 


CCTCCCCAGTAGCTGGGACTATGGGAGCGTGCCACC^TGCCrGGTTAATTTTTGTATT 

TTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTTGTCTTCCCCTCTCCTTAGTT 

ATCCTCCTGGATTCCAAAGCCTCCCAGGCCGAGCTGGGCTGGACTGCACTGCCAAGTA 

ATGGGTGGGAGGAGATCAGCGGCGTGGATGAACACGACCGTCCCATCCGCACGTACCA 

AGTGTGCAATGTGCTGGAG CCCAACCAGG ACAACTGGCTGCAGACTGG CTGG ATAAGC 

CGTGGCCGCGGGCAGCGCATCTTCGTGGAACTGCAGTTCACACTCCGTGACTGCAGCA 

GCATCCCTGGCG CCGCGGG T ACCTGCAAGGAGACCTTCAACGTCTACT AC CTGG AAAC 

TGAGGCCGACCTGGGCCGTGGGCGTCCCCGCCTAGGCGGCAAAAT CGACACG ATCG CG 

GCGGACGAGAGCTTCACGCAGGGCGACCTGGGTGAGCGCAAGATGAAGCTGAACACAG 

AGGTGCG CG AGATCGG ACCGCTCAGCCGG CGGGGTTTCCACCTGG CCTTT CAGG ACGT 

GGGCGCATGCGTGGCGCTTGTCTCGGTGCGCGTCTACTACAAGCAGTGCCGCGCCACC 

GTGCGGGGC CTGGCCACGTTCCCAGCCACCGCAGCCGAG AGCG CCTTCTCCACACTGG 

TGGAAGTGGCCGGAACGTGCGTGGCGCIACrCGGAAGGGGAGCCTGGCAGCCCCCCACG 

CATGCACTGCGGCGCCGACGGCGAGTGGCTGGTGCCTGTGGGCCGCTGCAGCTGCAGC 

GCGGGATTCCAGGAGCGTGGTGACTTCTGCGAAGGTATCTGTCCCCCAGGGTTTTACA 

AGGTGTCCCCGCGGCGGCCCCTCTGCTCACCGTGCCCAGAGCACAGCCGGGCCCTGGA 

AAACGCCTCCACCTTCTGCGTGTGCCAGGACAGCTATGCGCGCTCACCCACCGACCCG 

CCCTCGGCTTCCTGCACCCGTCCGCCGTCGGCGCCGCGGGACCTGCAGTACAGCCTGA 

GCCGCTCGCCGCTGGTGCTGCGACTGCGCTGGCTGCCGCCGGCCGACTCGGGAGGCCG 

CTCGGACGTCACCTACTCGCTGCTGTGCCTGCGCTGCGGCCGCGAGGGCCCGGCGGGC 

GCCTGCGAGGGGCCGCGCGTGGCCTTCCTACCGCGCCAGGCAGGGCTGCGGGAGCGAG 

CCGCCACGCTGCTGCACCTGCGGCCCGGCGCGCGCTACACCGTGCGCGTGGCCGCGCT 

CAACGGCGTCT CGGGCCCGGCGGCCGCCG CGGGAACCACCTACGCGCAGGTCACCGTC 

TCCACCGGGCCCGGGGGTAAGGCCGTCCGCGCCCCCCACCCCGAGGCCACCGCGCCTG 

CCGCCCCTGCGCCCTCTTGGGGCCGCCCCGTCGGTCCTGCGGGATCAGCGCCCTGGGA 

GGAGG ATGAG AT CCG CAGGG ACCGAGTGGAACCCCAGAGCGTGTCCCTGTCGTGG CGG 

GAGCCCATCCCTGCCGGAGCCCCTGGGGCCAATGACACGGAGTACGAGATCCGATACT 

ACGAGAAGGTGAGTGCGCAGAGTGAGCAGACTTACTCCATGGTGAAGACAGGGGCGCC 

C^C^GTCACOSTGATTTTCCTCCCAGCTGCCTCAGGGTCCAGGGACCAGAGCCCCGCC 

ATTGTCGTCACCGTAGTGACCATCTCGGCCCTCCTCGTCCTGGGCTCCGTGATGAGTG 

TGCTGGCCATTTGGAGGAGGAGGCCCTGC^GCrrATGGa^AAGGAGGAGGGGATGCCCA 

TGATGAAGAGGAGCTGTATTTCCACTGTGAGTTGGCTGGGAAAGTCCCAACACGTCGC 

ACATTCCTGGACCCCCAGAGCTGTGGGGACCTGCTG^GGCrGTGCATCTGTTCGCCA 

AGGAACTGGATGCGAAAAGCGTCACGCTGGAGAGGAGCCTTGGAGGAGGCAAGCTGGG 

CGGGCGGTTTGGGGAGCTGTGCTGTGGCTGCITGCAGCTCCCCGGTCGCCAGGAGCr 

CTCGTAGCCGTGCACATGCTGAGGGACAGCGCCTCCGACTCAC^GAGGCTCGGCTTCC 

CGTTGTTACCCG AGGTAGG GGAAG CAC CTTG ATGATTGTCACCG AGTACATGAG C CAT 
GGGGCCCTGGAC^CTTCCTC^GGCMC^GCACGAGGGGCAGCTGGTGGCTGGGCAAC 
TGATGGGGTTGCTGCCTGGGCTGG CAT CAGCCATGAAGTATCTGTCAGAGATGGGCTA 
CGTTCACCGGGGCCTGGCAGCTCGCCATGTGCTGGTC^ 
ATCTCTGGCTTCGGGraGGGCCCCO^GACCGATCAGAG 

TGAGGCTACAGAGTGGCCGGAGCCCRGCGCTATGGGCCGCTCCCX5AGACACTTCAGTT 
TGGCCACITC^GCTCTGCCAGTGACGTGTGGAGCrTCGGCATC!ATCATGTGGGAGGTG 
ATGGCCTTTGGGGAGCGGCCTTACTGGGACATGTCTGGCCJ^GACGTGGTGATCAAGG 
CTGTGGAGGATGGCTTCCGGCTGCCACCCCCCAGGAACTGTC 

ACTAATGCTCGACTGCTGGCAGAAGGACCCAGGtGAGCGGCCCAGGTTCTCCCAGATC 
CACAGCATCCTGAGCAAGATGGTGCAGGACCC^GAGCCCCCCAAGTGTGCCCTGACTA 
CCTGTCCCAGGCCTCrGACCCGCAGGCCrCCCACT 

C^CCTTCCCCTCCTTTGGCTCTGTGGGCXSCGTGGCTGGAGGCCCTGGACCTGTGCCGC 
TACAAGGACAGCTTCGCGG CTGCTGGCTATGGGAGCCTGGAGG CCGTGGCCGAGATGA 
CTAGCCAGGACCTGGTGAGCCTAGGC^TCTCTTTGGCTGAAC^TCGAGAGGCCCTCCT 
CAGCX3GGATCAGCGCCCTGCAGGCACX3AGTGCTCCAGCT 
GTGTG^GTGGACCCCATTCTTCCAAGGCAGGACTCCGGTGGGG 
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ORF Start: ATG at 21 


ORF Stop: TGA at 3078 




SEQIDNO: 112 


1019 aa 


MWat 110412,5kD 


NOV34a, 
CG95 175-01 Protein Sequence 


MGACHHAWLI FVFLVEMGFHHVGQACLPLSLVI LLDS KASQAELGWTALPSNGWEEI S 
GVDEHDRPIRTYQVCNVLE PNQDNWLQTGWI S RGRGQRI FVELQFTLRDCS S I PGAAG 
TCKETFN\ryYLETEADLGRGRPRLGGKI DTI AADES FTQGDLGERKMKLNT EVRE IGP 
LSRRGFHLAFQDVGACVALVSVRVYYKQCRATVRGLATFPATAAESAFSTLVEVAGTC 
VAHSEGEPGSPPRMHCGADGEWLVPVGRCSCSAGFQERGDFCEGICPPGFYKVSPRRP 
LCS PC P EHSRALENAST FCVCQDS YARS PTD PP S ASCTRP PS APRDLQYS LS RS P LVL 
RLRWLPPADSGGRSDVTYSLLCLRCGREGPAGACEGPRVAFLPRQAGLRERAATLLHL 
RPGARYTVRVAALNGVSGPAAAAGTTYAQVTVS TG PGG KAVRAPHPE ATAP AAPAPSW 
GRPVGPAGSAPWEEDEIRRDRVEPQSVSLSWREPIPAGAPGANDTEYEIRYYEKVSAQ 
SEQTYSMVKTGAPTVTVIFLPAASGSRDQSPAIVVTVVTISALLVI^SVMSVI^IWRR 
RPCSYGKGGGDAHDEEELYFHCELAGKVPTRRTFLDPQSCGDLLQAVHLFAKELDAKS 
VTLERSLGGGKLGGRFGELCCGCLQLPGRQELLVAVHMLRDSASDSQRLGFLAEALTL 
GQFDHSHIVRLEGWTRGRGSTLMIVTEYMSHGALDGFLRQRHEGQLVAGQLMGLLPG 
LASAMKYLSEMGYVHRGLAARHVLVSSDLVCKISGFG^ 

S PALWAAPETLQFGHFSS ASDVWSFGI I MWEVMAFGERPYWDMSGQDWIKAVEDGFR 
LPPPRNCPNLLHPJjMLDCWQKDPGERPRFSQIHSILSKMVQDPEPPKCALTTCPRPLT 
RRPPTPLADRAFSTFPSFGSVGAWLEALDLCRYKDSFAAAGYGSLEAVAEMTSQDLVS 
IiG ISLAEHREALLSGI SALQARVLQLQGQGVQV 



Further analysis of the NOV34a protein yielded the following properties shown in 
Table 34B. 



Table 34B. Protein Sequence Properties NOV34a 


. PSort analysis: 


0.4600 probability located in plasma membrane; 0.1000 probability located in 
endoplasmic reticulura(membrane); 0.1000 probability located in endoplasmic 
reticulum (lumen); 0.1000 probability located in outside 


SigrialP analysis: 


Cleavage site between residues 43 and 44 



A search of the NOV34a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 34C. 
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Table 34C. Geneseq Results for NOV34a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV34a Residues/ 
Match Residues 


i u en u Lies/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM47209 

i 

! 

i 


Human NOV3 protein - Homo 

sapiens, 1000 aa. 
[ WO200 1 7485 1 - A2, 1 1 -OCT- 
2001] 


7..I019 
9.. 1000 


933/1054 (88%) 
936/1054 (88%) 


0.0 


AAU03553 


Human protein kinase #53 - 
Homo sapiens, 1009 aa. 
[WO200138503-A2, 3 1-MAY- 
2001] 


24..1019 
18.1009 


913/1048 (87%) 
916/1048 (87%) 


0.0 


AAW0342I 


Mouse developmental kinase I - 
Mus sp, 998 aa. [WO9621013- 
Al, ll-JUL-1996] 


7..1019 
9..998 


522/1055(49%) 
694/1055(65%) 


0.0 


AAR85092 

i 


EPH-like receptor protein 
tyrosine kinase HEK1 1 - Homo 
sapiens, 998 aa. [W09528484- 
Al,26-OCT-1995] 


7..1019 
9..998 


520/1055(49%) 
696/1055 (65%) 


0.0 


j AAR85090 

i 

I 

I 


EPH-like receptor protein 
tyrosine kinase HEK7 - Homo 
sapiens, 991 aa. [W09528484- 
Al,26-OCT-1995] 


28..1010 
33..985 


457/1018(44%) 
641/1018(62%) 


0.0 



In a BLAST search of public sequence datbases, the NOV34a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 34D. 
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Table 34D. Public BLASTP Results for NOV34a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV34a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAD 13085 


SEQUENCE 5 FROM 
PATENT WOO 174851 - 
Homo sapiens (Human), 
lOOOaa. 


7.1019 
9.. 1 000 


933/1054 (88%) 
936/1054 (88%) 


0.0 


CAD23752 


SEQUENCE 1 FROM 
PATENT WO0208253 - 
Homo sapiens (Human), 
974 aa (fragment). 


33..I019 
I..974 


925/1027 f90%^ 
926/1027 (90%) 


0 O 


CAD23753 


SEQUENCE 3 FROM 
PATENT WO0208253 - 
Homo sapiens (Human), 
1022 aa (fragment). 


33..1019 
1..I022 


930/1071 (86%) 
931/1071 (86%) 


0.0 


CAD23754 


SEQUENCE 5 FROM 
PATENT WO0208253 - 
Homo sapiens (Human), 
647 aa (fragment). 


33..653 
1..623 


575/661 (86%) 
576/661 (86%) 


0.0 


( CAD23755 

1 

i 


SEQUENCE 7 FROM 
PATENT WO0208253 - 
Homo sapiens (Human), 
695 aa (fragment). 


33..653 
I..67I 


580/705 (82%) 
581/705 (82%) 


0.0 



PFam analysis predicts that the NOV34a protein contains the domains shown in 
the Table 34E. 



Table 34E. Domain Analysis of NOV34a 


Pfam Domain 


NOV34a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


EPHJbd 


32..203 


115/178(65%) 
156/178 (88%) 


4.4e-119 


m3 


33L.426 


25/98 (26%) 
66/98 (67%) 


2e-06 


pkinase 


671. .903 


70/269 (26%) 
167/269(62%) 


l.le-37 


SAM 


942.. 1 006 


27/68 (40%) 
50/68 (74%) 


2.9e-15 
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Example 35. 

TheNOV35 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 35A. 



Table 35A. NOV35 Sequence Analysis 




SEQ ID NO: 113 


765 bp 


NOV35a, 
CG95693-0I DNA Sequence 


ATGGTGCTCTTTGGCTGGCTTTTTCGCAGAGTTCGCAGAGAGAATGTTATCTTTGGCA 
TTCTAACAGTGATGTCAATACAAGGTTATGCAAACCTCTGTAATCAATGGAGCATAAC 
AGGAGAATTTAATGATTTGCCTCAGGAAGAACTTTTACAGTGGATCAAATACAATACC 
GTACCAGGTGACAACTCAGTGGGTCCAGCTCTTGCTGGACAGGCAGAGTGGTCTCTCC 
TCATTTCCACCACTACCACAGGCCCGTGGG AGGTAATG CCTGG CTACCG CCAACATTC 
ACTTAAGGCTGAAGGACCTTTCAGTCAGCTTGTGGTGAGTGCTGCCAGGCCTGGAACT 
CACCCirCAGGGCAGTGGGCTCTCTTCCAGCCCAGGGCAGATGCTGTCTTTGCAGGTG 
CCATGCCTACAATGGCAAGTGTCAAGCTGTCTACACTTCATCCCATTGTGAATCATCC 
ACATT ACGAAG ATG C AG ACTTGAG AAATTG TGAATTACGTTACTCACATGG AAAAAGG 
AAATTTGCAAATGTGATTAAATTAATGCTCATGAGATGTGGAGATTATCCTTATTATC 
CAGGG ATC ATGGG ATGTGGTCTG CACACTCAACTG AGAAGGTC CAGTAATGGCAAGGC 
AGTCAC C AGGGGACCCATGTC CTC AG AGG AC CTGGCTC CTCAGAGAAACCGCAGGTTC 
ACTGGGGGAGATTATCAGGACTTTAGACCTCAGGATGAAGAGACACACTCCAGGAGGG 
GCTGTGGATAG 




ORF Start: ATG at 1 


ORF Stop: TAG at 763 




SEQ ID NO: 114 


254 aa 


MWat2853L2kD 


NOV35a, 
^CG95693-01 Protein Sequence 

1 


MVLFGWLFRRVRRENVI FGILTVMS IQGYANLCNQWSITGEFNDLPQEELLQWIKYNT 
VPGDNSVGPALAGQAEWSLLISTTTTGPWEVMPGYRQHSLKAEGPFSQLWSAARPGT 
HPSGQWALFQPRADAVFAGAMPTMASVKLSTLHPIVNHPHYEDADLRNCELRYSHGKR 
KFANVIKI^LMRCGDYPYYPGIMGCGLHTQLRRSSNGKAVTRGPMSSEDLAPQRNRRF 
TGGD YQDFRPQDEETHS RRG CG 



Further analysis of the NOV35a protein yielded the following properties shown in 
Table 35B. 



Table 35B. Protein Sequence Properties NOV35a 


PSort analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.3592 
probability located in lysosome (lumen); 0.2463 probability located in microbody 
(peroxisome); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignaiP analysis: 


Cleavage site between residues 14 and 15 



A search of the NOV35a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 35C. 
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Table 35C. Geneseq Results for NOV35a 


Geneseq 

IHpntifiPr 


Protein/Organism/Length [Patent ft, 
L/atej 


NOV35a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW85723 


Novel protein (Clone AX56_28) - 

nomo sapiens, 1 / I aa. 
[WO9920644-A1, 29-APR-1999] 


3..100 
2_>..l !:> 


59/98 (60%) 
64/98 (65%) 


4e-23 


AAG03191 


Human secreted protein, SEQ ID 
NO: 7272 - Homo sapiens, 102 aa. 
[EP1033401-A2, 06-SEP-2000] 


3..58 
23..78 


46/56 (82%) 
48/56 (85%) 


le-19 


AAM4I346 


Human polypeptide SEQ ID NO 
6277 - Homo sapiens, 618 aa. 
[WO2001533I2-A1, 26-JUL-2001] 


3..60 
434..491 


36/58 (62%) 
45/58 (77%) 


4e-15 


AAM39560 


Human polypeptide SEQ ID NO 
2705 - Homo sapiens, 6 1 8 aa. 
[ WO200 1 533 1 2-A 1 , 26-JUL-200 1 ] 


3..60 
434..491 


36/58 (62%) 
45/58 (77%) 


4e-15 


AAG68230 


Cycline-dependent human kinase 14 
protein SEQ ID NO:2 - Homo 
sapiens, 129 aa. [WO200175013- 
A2, ll-OCT-2001] 


I30..180 
5..56 


38/52 (73%) 
41/52 (78%) 


3e-12 



In a BLAST search of public sequence datbases, the NOV35a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 35D. 
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Table 35D. Public BLASTP Results for NOV35a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV35a Residues/ 
Match Residues 


IHpntitip*;/ 

Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96LZ9 

• 


CDNA FLJ32949 FIS, CLONE 
I bb I U008020, WEAKLY 

SIMILAR TO DPY-19 
PROTEIN - Homo sapiens 
(Human), 758 aa. 


3..100 
576..668 


59/98 (60%) 
64/98 (65%) 


le-22 


094954 


KIAA0877 PROTEIN - Homo 
sapiens (Human), 580 aa 
(fragment). 


3..60 
396..453 


36/58 (62%) 
45/58 (77%) 


Ie-14 


S44629 


F22B7. 10 protein - 
Caenorhabditis elegans, 628 aa. 


90..183 
464..551 


41/96 (42%) 
51/96 (52%) 


le-09 


P34413 


Protein dpy-I9 - Caenorhabditis 
elegans, 683 aa. 


90..183 
519..606 


41/96 (42%) 
51/96 (52%) 


le-09 


Q9VWR8 


CG6659 PROTEIN - Drosophila 
melanogaster (Fruit fly), 872 aa. 


I21..I58 
721. .758 


17/38(44%) 
22/38 (57%) 


0.077 



PFam analysis predicts that theNOV35a protein contains the domains shown in 
the Table 35E. 



Table 35E. Domain Analysis of NOV35a 



Pfam Domain 



NOV35a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 36. 

The NOV36 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 36A. 
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Table 36A. NOV36 Sequence Analysis 




SEQIDNO:115 


3500 bp 


NOV36a, 
CG958 14-01 DNA Sequence 


GCGCTGCAGTTCTCCCGGCTATGCATGGGCTGCGGCTCCGGCTCTAGCACAGGCACCA 

GCCGCCGCCGCACCCGGCCCCAGCGCCCACCGTCTGCATGTGCCCGCCGTAGCCGTCT 

GCCCAGCCCGCAGCCCGCGCTCCACGGAGCGCTGGAGACCACCGTGGGGGGCCCCTTC 

TGCCCTCGAGAGAAGCGGTCTTGGAGGTATGGATTTAGGTGGTTGGATTTTTTCCGTG 

GATCTATCAATTCAC^ATTCGAATTTGGAAGAAAGAAGGAAAACATGACGTCTCCAGC 

CAAATTCAAAAAGGATAAGGAGATCATAGCAGAGTACGATACTCAGGTCAAAGAGATC 

CGTGCTCAGCTCACAGAGCAGATGAAATGCCTGGACCAGCAGTGTGAGCTTCGGGTGC 

AACTGTTGCAGGACCTCCAGGACTTCTTCCGAAAGAAGGCAGAGATTGAGATGGACTA 

CTCCCGCAACCTGGAGAAGCTGGCAGAACGCTTCCTGGCCAAGACACGCAGCACCAAG 

GACCAGCAATTCAAGAAGGATCAGAATGTTCTCTCTCCAGTCAACTGCTGGAATCTCC 

TCTTAAACCAGGTGAAGCGGGAAAGCAGGGACCATACCACCCTGAGTGACATCTACCT 

GAATAATATCATTCCTCGATTTGTACAAGTCAGCGAGGACTCAGGAAGACTCTTTAAA 

AAGAGTAAAGAAGTCGGCCAGCAGCTCCAAGATGATTTGATGAAGGTCCTGAACGAGC 

TCTACTCGGTCATGAAGACATATCACATGTACAATGCCGACAGCATCAGTGCTCAGAG 

CAAACTAAAGGAGGCGG AG AAG CAGGAGG AG AAG CAAATTGGTAAATCGGTAAAGC AG 

GAGGACCGGCAGACCCCACGCTCCCCTGACTCCACGGCCAACGTTCGCATTGAGGAGA 

AACATGTC CGGAGGAG CTCAGTGAAGAAG ATTGAGAAGATG AAGGAGAAGCACCAAGC 

CAAGTACACGGAGAATAAGCTGAAGGCCATCAAAGCCCGGAATGAGTACTTGCTGGCT 

TTGGAGGCAACCAATGCATCTGTCTTCAAGTACTACATCCATGACCTATCTGACCTTA 

TTGATCAGTGTTGTGACTTAGGCTACCATGCAAGTCTGAACCGGGCTCTACGCACCTT 

CCTCTCTGCTGAGTTAAACCTGGAACAGTCGAAGCATGAGGGTCTGGATGCCATCGAG 

AATGCAGT AGAAAACCTGGATGCCACCAGTGAC AAGCAGCGCCTCATGG AGATG T ACA 

ACAACGTCTTCTGCCCCCCTATGAAGTTTGAGTTTCAGCCCCACATGGGGGATATGGC 

TTCCCAGCTCTGTGCCCAGCAG CCTGTCCAG AGTG AGCTGGTACAG AG ATG C C AACAA 

CTGCAGTCTCGCTTATCCACTCTAAAGATTGAAAACGAAGAGGTAAAGAAGACAATGG 

AGGCCACCCTGCAAACCATCCAGGACATTGTG ACTGTCGAGG ACTTTG ATGTG T CTG A 

CTGCTTCCAGTACT^GCAACTCCATGGAGTCCGTCAAGTCCACGGTCTCTGAAACCTTC 

ATGGGCAAGC CCAGCATTGCTAAGAGGAGAGC C AACCAGCAAGAGACAGAGCAGTTTT 

ATTTCACAAAAATGAAAGAGTACCTGGAGGGCAGGAACCTCATCACCAAGTTACAAGC 

CAAGCATGACCTTCTGCAGAAAACCCTGGGAGAAAGTCAGCGGACAGATTGCAGTCTA 

GCCAGGCGC^GCTC^CrGTGAGGAAACAGGACTCCAGCCAGGCAATTCCTCTAGTGG 

TGGAAAGCTGTATCCGGTTTATCAGCAGACACGGACTACAGCATGAAGGAATTTTCCG 

GGTGTCAGGATCCCAGGTGGAAGTGAATGACATCAAAAATGCCTTTGAGAGAGGAGAG 

GACCCCCTGGCTGGGGACCAGAACGACCATGACATGGATTCCATAGCTGGTGTCCTGA 

AGCTTTACTTCCGGGGGCTGGAACACCCTCTCTTCCCCAAGGACATCTTTCATGACCT 

GATGGCCTGCGTC^CAATGGACMCCTGCAGGAGAGAGCrrCTGCACATCCGGAAAGTC 

CTCCTAGT CCTGCCCAAAACC ACTCTG ATTATCATGAGATACCTCTTTG CCTT CCTCA 

ATCATTTATCACAGTTCAGTGAAGAGAAC^TGATGGACCCCTACAACCTCGCCATCTG 

CTTCGGGCCCTCGCrAATGTCAGTGCCAGAGGGCCACGACCAGGTGTCCTGCCAAGCC 

CACGTGAATGAGCTGATCAAAACCATCATCATCCAGCATGAGAACA 

CCAGGG AG CTGG AGGGCCCTGTCTACAG CAGAGG AGGAAG CATGG AGG ATTACTGTG A 

TAGCCCTCATGGAGAGACTACCTCGGTTGAAGACTCAACCCAGGATGTGACCGCAGAG 

CACCACACGAGCGATGACGAATGTGAGCCCATCGAGGCCATTGCCAAGTTTGACTACG 

TGGGCCAXaACAGCCCGAGAGCTCsrCvJTT 1 

GCGGGCTTCCGACGACTGGTGGGAAGGCCGGCACAATGGCATCGACGGACTCATCCCC 

CATCAGTACATCGTGGTCCAAGACACCGAGGACGGTGTCGTGGAGAGGTCCAGCCCCA 

AGTCTGAGATTGAGGTCATTTCTGAGCCACCTGAAGAAAAGGTGACAGCCAGAGCGGG 

GGCCAGCTGTCCCAGTGGGGGTCATGTAG CCG ATATTTATCTTG CAAACATCAACAAG 

CAAAGGAAGCGTCCAGAATCTGGGAGC!ATCCGGAAAACTTTTCGGAGTGACAG 

GGCTGAGCAGTTCCCTGACTGACTCCTCCTCCCCAGGGGTGGGGGCTAGCTGCCGCCC 

ATCCTCCCAG CCCATCATGAGCCAGAG CCTCCCCAAAGAAGGGCCAGATAAGTGTTCC 

ATCAGTGGGCACGGGAGCCTCAACTCCATCAGCCGCCACTCATCCCTGAAGAATCGGC 

TGG ATAGT CCACAG ATCCGG AAGACTG C(^CAGCGGG AAGGT G^AAAAGCTTCAATAA 

CCATCGGCCCATGGACCCTGAGGTCATTG CTCAGGATATTGAGG CAACAATG AACTCG 

GCCCTGAATG AGCTACGGGAACTAGAACGGCAGAGCAGTGT CAAACACAC CCCTG ACG 

TGCTTCTGGACACCTTGGAGCCCCTCAAAACCTCCCCAGTGGTGGCCCCCACGTCAGA 

GCCCTCCAGCCCTCTGCACACCC^GCTCCT 

AGCGCCAGTACTGCTCGGGACATCGCCTGCGCCTTCCGGCCTGTCAAGTCTGTCAAGA 

TGGCTGCCCCGGTOtf^CCACCAGCCACACGGCCC^ 

AAATGCCACTAGCCCTGGTGTCAACTCATCA^ 

TGTACTGTCTGAGGGATAAT 
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ORFStart:ATGat25 


ORF Stop: TGA at 3490 




SEQIDNO: 116 


1155aa 


MWat 130304.7kD 


NOV36a, 
CG958 14-01 Protein Sequence 


MGCGSGSSTGTSRRRTRPQRPPSACARRSRLPSPQPALHGALETTVGGPFCPREKRSW 
RYGFRWLDFFRGS INSQFEFGRKKENMTSPAKFKKDKEI I AEYDTQVKEIRAQLTEQM 
KCLDQQCELRVQLLQDLQDFFRKKAEIEMDYSRN^ 

NVLSPVNCWNLLtiNQVKRESRDHTTLSDIYLNNI I PRFVQVSEDSGRLFKKSKEVGQQ 
LQDDLMKVLNELYSVMKTYHMYNADS I SAQSKLKEAEKQEEKQI GKSVKQEDRQTPRS 
PDSTANVRI EEKHVRRS SVKKI EKMKEKHQAKYTENKLKAI KARNEYLIiALEATNASV 
FKYYIHDLSDL IDQCCDLGYHASLNRALRTFLSAELNLEQSKHEGLDAI ENAVENLDA 
TSDKQRLMEMYNNVFCPPMKFEFQPHMGDMASQLCAQQPVQSELVQRCQQLQSRLSTL 
KIENEEVKKTMEATLQTIQDIVTVEDFDVSDCFQYSNSMESVKSTVSETFMGKPSIAK 
RRANQQETEQFYFTKM KE YLEGRNL I TKLQAKHDLLQ KTLGESQRTDCSLARRS STVR 
KQDSSQAI PLWESCI RFI SRHGLQHEGI FRVSGSQVEVND I KNAFERGEDPLAGDQN 
DHDMDSIAGVLKLYFRGLF^PLFPKDIFHDLMACVTMDNLQERALHIRlOn^VLPKTT 
LI I MRYLFAFLNHLSQ FS E ENMMD PYNLAI CFG PS LM S VPEGHDQVSCQAHVNELI KT 
III QHENI FPS PRELEG PVYSRGGSME DYCDS PHGETTSVEDSTQDVTAEHHTSDDEC 
E P I EAI AKFDYVGRTARELS FKKG ASLLLYQRASDDWWEGRHNG IDGLI PHQYI WQD 
TEDGVVERSSPKSEIEVISEPPEEKVTARAGASCPSGGHVADIYIiANINKQRKRPESG 
SIRKTFRSDSHGLSSSLTDSSSPGVGASCRPSSQPIMSQSLPKEGPDKCSISGHGSLN 
S I S RHS S LKNRLD S PQI RKTATAGRS KS FNNHRPMDP EVI AQD I EATMNS ALNELREL 
ERQSSVKHTPDWLDTLEPLKTSPWAPTSEPSSPLHTQLLKDPEPAFQRSASTAGDI 
ACAFRPVKSVKMAAPVKPPATRPKPTVFPKTNATS PGVNS STSPQSTDKSCTV 



Further analysis of the NOV36a protein yielded the following properties shown in 
Table 36B. 



1 — ' — . . 

Table 36B. Protein Sequence Properties NOV36a 

i 


j PSort analysis: 


0.9400 probability located in nucleus; 0.4936 probability located in mitochondrial 
matrix space; 0.3000 probability located in microbody (peroxisome); 0.2087 
probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV36a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 36C. 
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Table 36C. Geneseq Results for NOV36a 


Identifier 


r i uici 11/ v^rganibin/ l,ciih,ui 
[Patent #, Date] 


1>VJ V JKja IxCjIUUCoi 

Match Residues 


Identities/ 

ouiiiidi iiiv^o i \j\ 

the Matched 
Region 


Value 


AAM79887 


Human protein SEQ ID NO 
jjjj nomo sapiens, lL/yH da. 
[WO200157190-A2, 09-AUG- 
2001] 


62.; 1155 
1 1004 


1088/1094 

1090/1094 
(99%) 


0.0 


ABG 18053 


Novel human diagnostic 
protein # 1 8044 - Homo 
sapiens, i uvd aa. 
[WO200175067-A2, 11-OCT- 
2001] 


63..1155 
3.. 1095 


1089/1093 
(99%) 

(99%) 


0.0 


ABG05128 


Novel human diagnostic 

nrofpin \ 10 - Hnmn Qfinien^ 

1095 aa. [WO2001 75067- A2. 
ll-OCT-2001] 


63..1155 
3..1095 


1089/1093 
C99%) 

1089/1093 
(99%) 


0.0 


ABG 18053 


Novel human diagnostic 
protein #18044 -Homo 
sapiens, 1095 aa. 
[WO2001 75067- A2, ll-OCT- 
2001] 


63.. 1155 
3.1095 


1089/1093 
(99%) 

1089/1093 
(99%) 


0.0 


ABG05128 


Novel human diagnostic 
protein #51 19 - Homo sapiens, 
1095 aa. [WO2001 75067- A2, 
ll-OCT-2001] 


63.. 11 55 
3.. 1095 


1089/1093 
(99%) 

1089/1093 
(99%) 


0.0 



In a BLAST search of public sequence datbases, the NOV36a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 36D. 
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Table 36D. Public BLASTP Results for NOV36a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV36a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


075044 


KIAA0456 PROTEIN - Homo 
sapiens (Human), 1095 aa 
(fragment). 


63..1155 
3..1095 


1089/1093 (99%) 
1089/1093(99%) 


0.0 


Q9P2P2 


KJAAI304 PROTEIN - Homo 
sapiens (Human), 1051 aa 

\il agllicul^. 


107.. 1155 
12..1051 


719/1063 (67%) 
850/1063 (79%) 


0.0 


CAC22407 


SEQUENCE 47 FROM 
PATENT WO0075320 - 
Homo sapiens (Human), 1075 
aa. 


85..1151 
K.1062 


642/1079 (59%) 
819/1079 (75%) 


0.0 


Q925I2 

1 


RHO GTPASE- 
ACTIVATING PROTEIN - 
Mus musculus (Mouse), 987 
aa. 


204.. 1151 
8..974 


554/983 (56%) 
713/983 (72%) 


0.0 


Q91Z69 


GAP1 - Mus musculus 
(Mouse), 714 aa (fragment). 


447.. 1155 
3..7I4 


439/721 (60%) 
545/721 (74%) 


0.0 



PFam analysis predicts that the NOV36a protein contains the domains shown in 
the Table 36E. 



Table 36E. Domain Analysis of NOV36a 


Pfam Domain 


NOV36a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


FCH 


106..204 


27/109 (25%) 
81/109 (74%) 


2.2e-19 


RhoGAP 


589..741 


54/170(32%) 
128/170 (75%) 


4.4e-53 


SH3 


815..869 


20/58 (34%) 
41/58(71%) 


1.7e-ll 



Example 37. 



The NOV37 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 37A. 
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Table 37 A. NOV37 Sequence Analysis 




SEQIDNO: 117 


3790 bp 


NOV37a, 
CG95824-01 DNA Sequence 

» 


CAAACAG CGATGTCC CAGACGCAG GACTAC G AGTGCAGGAG CCATAATG T CG AC CTG C 

CGGAGTCGAGGATTCCAGGGTCGAACACTCGGTTGGAGTGGGTGGAGATCATCGAACC 

GCGCACCCGCGAGCGCATGTACGCCAACCTGGTCACCGGTGAGTGCGTGTGGGACCCG 

CCGGCCGGCGTCCGCATCAAGCGCACCAGCGAGAACCAGTGGTGGGAGCTGTTCGACC 

CCAA(^CGTCCCGCTTCTACTACTACAATGCCAGCACGCAGCGCACGGTGTGGCACCG 

GCCGCAGGGCTGCGACATCATCCCGCTGGCCAAGCTGCAGACGCTGAAGCAGAACACG 

GAGTCCCCGCGCGCCTCGGCGGAGAGCAGCCCCGGGCGCGGCAGCAGCGTCAGCCGTG 

AGGGCAGCACCAGCTCCTCCCTGGAGCCCGAGCCCGACACTGAGAAAGCGCAGGAGTT 

GCCAGCGAGGGCCGGGCGGCCCGCGGCGTTTGGGACAGTGAAGGAGGACAGCGGCAGC 

TCTTCACCACCAGGAGTGTTCCTTGAGAAGGACTATGAGATTTACCGGGATTACAGTG 

CGGACGGCCAGCTTCTTCACTACAGGACCTCCTCGCTGCGGTGGAACTCGGGCGCCAA 

AGAGCGCATGCTCATCAAGGTCGCTGATCGGGAGCCCAGCTTCCTCGCCGCCCAGGGC 

AATGGCTACGCCCCAGACGGCCCACCTGGGGTCCGCTCCCGCAGACCCTCCGGCAGCC 

AGCACTCACCCAGCCTGCAGACCTTCGCCCCGGAGGCTGACGGCACCATCTTCTTCCC 

AGAGAGGAGGCCGTCACCCTTCCTGAAGAGGGCCGAGCTCCCAGGGAGCAGCTCCCCG 

CTGCTGGC CCAG CCCCGAAAG CCCTCCGGGGACTCGCAG CCCTCCTCCCCGCGCTATG 

GCTATGAACCCCCGCTCTACGAGGAGCCCCCAGTGGAGTACCAGGCCCCCATCTACGA 

TGAGCCCCCCATGGACGTGCAATTCGAGGCTGGCGGGGGCTACCAGGCCGGCTCTCCC 

CAGCGGTCGCCGGGCCGTAAGCCCCGGCCGTTCCTCCAGCCCAACAAGCAGGGCCCCC 

CCTCGCCCTGCCAGCAGCTGGTGCTCACCAAGCAGAAGTGTCCCGAGCGCTTCCTGAG 

CCTGGAGTACAGTCCCGCCGGCAAGGAGTACGTGCGGCAGCTGGTCTACGTGGAGCAG 

GCGGGCTCCAGCCCCAAGCTGCGCGCCGGCCCGCGGCACAAGTACGCGCCCAACCCCG 

GCGGTGGTTCGTACTCCTTGCAGCCCAGCCCCTGCCTGCTGAGGGACCAGCGCCTGGG 

CGTCAAGTCCGGAGACTACAGCACCATGGAGGGACCTGAGCTGCGGCACAGCCAGCCG 

CCCACGCCGCTGCCACAGGCCCAGGAGGATGCCATGTCCTGGTCCAGCCAGCAGGACA 

CCCTGTCCTCCACAGGCTACTCCCCGGGCACGCGCAAGCGGAAGAGCAGAAAGCCCTC 

TTTGTGCCAAGCCACCAGCGCCACCCCCACTGAGGGCCCCGGGGACCTGCTTGTGGAG 

CAGCCCCTGGCCGAGGAACAGCCCCCGTGCGGGACCAGCCTCGCCCCCGTGAAGCGAG 

CGGAAGGTGAGGCCGAAGGGGCGCGGGGCGCGGCCGAGCCCTTCCTGGCGCAGGCT CG 

GCTGGCCTGGGAGGCGCAGCAGGCCCACTTCCACATGAAGCAGAGGAGCAGCTGGGAC 

TCCCAG CAGGACGGCTCTGGCTACGAGAG CG ACGGCGCCCTGCCACTG C CCATG CCCG 

GGCCGGTGGTGCGGGCCTTCAGCGAGGACGAGGCGCTGGCCCAGCAGGAGAACAGGCA 

CTGG AGGAGGGG C ACCTTCG AGAAGCTAGG CTT CCCCCAG ATCCTGCTGGAGAAGAGC 

GTCTCCGTGCAGACCAACCTGGCCTCACCAGAGCCCTACCTCCACCCCTCACAGTCTG 

AGGACCTCGCTGCCTGTGCCCAGTTCGAGAGCAGCCGGCAGAGCCGCAGCGGCGTTCC 

CAGCTCCAGCTG CGTCTTCCCCACTTTCACGCTGCG CAAGCCCTCCTCGGAGACGGAC 

ATCGAGAACTGGGCCTCCAAGCACTTCAACAAGC!ACACGCAGGGC 

AGGTGTCCATCGCCAACOTGCTGGCCTGGAGCAGCGAGTCCATCAAGAAGCCCATGAT 

CGTGACAAGCGACCGGCACGTGAAGAAGGAGGCCTGCGAGCTCITCAAGCrGATCCAG 

ATGTAC^TGGGTGACCGGOSGGCC^GGCCGACCC^CTGC^CGTGGCCCTGGAGGTGG 

CCACCAAGGGCTGGAGCGTGCAGGGCCTGCGGGACGAGCTCTACATCCAGCTGTGCCG 

GCAGACCACCGAGAACTTCCGCCTGGAGAGCCTGGCCCGCGGCTGGGAGCTCATGGCC 

ATCTGCCTGGCCTTTTTCCCGCCCACCCCCAAGTTCC^ 

TCTACCGGCACATGGACCCCGTCAATGACACTAAAGGGGTGGCGATAAGCACGTATGC 

C^GTACTGTTACC^CAAGCTACAGAAGG(^GCCCTGACCGGGGCCAAGAAGGGGCTG 

AAGAAGCCCAACGTGGAGGAGATCCGGCATGCCAAGAACGCCGTGTTCAGCCCGTCCA 

TGTTCGGCAGCG(^CTG(^GGAGGTCATGGGCaTGCAGAGAGAGCGCTACCCCGAGCG 

CCAGCTGCCOTGGGTGCAGACACGGCTCTCTGAGGAGGTGCTGGCGCTCAACGGTGAC 

C AGACAG AGGG CATCTTCAGGGTC C CTGGGU ACA 1 1 tiAU AtiLT i. bMluLLL J. bAAbL 

TGCAGGTGGACCAGTGGAAGGTGCCCACAGGCCTGGAAGACCCCCACGTCCCTGCGTC 

CCTGCTGAAGCTGTGGTACCGGGAGCTGGAGGAGCCCCTGATCCCGCACGAGTTCTAC 

GAGCAGTGCATCGCGCACTACGACAGCCCCGAGGCGGCGGTGGCCGTGGTGCACGCGC 

TGCCCCGCATC^CCGC^TGGTGCTGTGCrACCTCATCCGCTTCCrrGCAGGTCTTCGT 

GCAGCCGGCCAACGTCGCGGTCACCAAGATGGATGTCAGCAACCTGGCCATGGTGATG 

GCGCCOACTGCTTGCGCTGCa^TCCGACGACCCGCXSCGTC^TCT 

GCAAGGAGATGTCCTTCCTGCGGGTGCTCATCCAGCACCTGGACACCAGCTTCATGGA 

GGGTGTGCTGGAGCGGGGGCGCCCGGGGACAGC^aGGATGTCCrGCCGCCCCrCAGCCA 

GGCCX3AACTCCGCACTCGCTCTCCCGGCAGAGGGGCCAGAATCGCCCGGCCCAGCCCT 

GGAGCCCCCTCC^CTCCCCCAGGCCCCTGGCCCCGGCGCTCCCC^CGTCTTCTGCCTG 

GTCTGAGGGTGCAGCCAGGGCACAGCAGCGGCGGGGAGGGCGCCTCTGGCCCCCCACC 

TCACX3GCCAG1TCCCGCGGGC!ACCGCCTCGCCCTCCGCTGGCCGCGGGTCAGCTCCGA 

GAAAGTGCCTTCTGTGTCCTGGAGCCGAGCGACGCTGCCTCCTTGGGGCCGGGCTGCC 

TCCCTGTGGCTCCTGCGCGCCCTGGCCTGGGCCTC 

TCCCTTTCTCCTGTCCTCGTCCTGGCCTG(^GCrCTTCCCAGCCC 

CGACCTCTCCCaSCCTCCTCTCCCTCCCTCGGCCCGTGGTCCCCAGCTGGTGACTGCT 

CAGGAGTTTGGGGGCTCCAG 
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ORF Start: ATGat 10 


ORF Stop: TGA at 3763 




SEQ IDNO:118 


1251 aa 


MWatl38786.4kD 


NOV37a, 
CG95824-01 Protein Sequence 

i 
\ 

i 

i 

i 

1 

i 
i 


MSQTQDYECRSHNVDLPESRI PGSNTRLEWVEI I EPRTRERMYANIiVTGECVWDPPAG 
VRI KRTSENQWWELFDPNTSRFYVYNASTQRTVWHRPQGCDI I PLAKLQTLKQNTES P 
RASAESSPGRGSSVSREGSTSSSLEPEPDTEKAQELPARAGRPAAFGTVKEDSGSSSP 
PGVFLEKDYEIYRDYSADGQLLHYRTSSLRWNSGAKERMLIKVADREPSFLAAQGNGY 
APDGPPGWSRRPSGSQHSPSLQTFAPEADGTIFFPERRPSPFLKRAELPGSSSPLLA 
QPRKPSGDSQPSSPRYGYEPPLYEEPPVEYQAPIYDEPPMDVQFEAGGGYQAGSPQRS 
PGRKPRPFLQPNKQGPPSPCQQLVLTKQKCPERFLSLEYSPAGKEYVRQLVYVEQAGS 
SPKLRAGPRHKYAPNPGGGSYSLQPSPCLLRDQRLGVKSGDYSTMEGPELRHSQPPTP 
LPQAQEDAMSWSSQQDTLSSTGYSPGTRKRKSRKPSLCQATSATPTEGPGDLLVEQPL 
AEEQPPCGTSIiAPVKRAEGEAEGARGAAEPFLAQARLAWEAQQAHFHMKQRSSWDSQQ 
DGSGYESDGAL PL PM PG P WRAFS E DE ALAQQENRHWRRGTFE KLGFPQ ILLEKS VS V 
QTNLASPEPYLHPSQSEDIiAACAQFESSRQSRSGVPSSSCVFPTFTLRKPSSETDIEN 
WASKHFNKHTQGLFRRKVS I ANMiAWSSES IKKPMI VTSDRHVKKEACELFKIjIQMYM 
GDRRAKADPLHVALEVATKGWSVQGLRDELYIQLCRQTTENFRLESLARGWELMAICIi 
AFFP PT PKFHS YLEGYI YRHMD PVNDTKG VAI ST YAKYCYHKLQKAALTGAKKGLKKP 
NVEEIRHAKNAVFSPSMFGSALQEVMGMQRERYPERQLPWVQTRLSEEVLALNGDQTE 
GIFRVPGDIDEVNALKLQVDQWKVPTGLEDPHVPASLLKLWYRELEEPLIPHEFYEQC 
IAHYDS PEAAVAWHALPRINRMVLCYLI RFLQVFVQPANVAVTKMDVSNLAMVMAPN 
CLRCQSDDPRVIFENTRKEMSFLRVLIQHLDTSFMEGVLERGRPGTGGMSCRPQPGRT 
PHSLSRQRGQNRPAQPWS PLHS PRPLAPAL PTSS AWS EG AARAQQRRGGRLWP PTSRP 
VPAGTASPSAGRGSAPRKCLLCPGAERRCLLGAGLPPCGSCAPWPGPCPAAPVSPSLS 
PVLVLACSSSQPRESFPTCPRLLSLPRPWPSW 



Further analysis of the NOV37a protein yielded the following properties shown in 
Table 37B. 



i — ■ — ^ i 

j Table 37B. Protein Sequence Properties NOV37a 


PSort analysis: 


0.7000 probability located in nucleus; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV37a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 37C. 
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Table 37C. Geneseq Results for NOV37a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV37a Residues/ 
Match Residues 


Identities/ 
Similarities fhrthf* 

<J 1111 1 IUI 1 tl WO 1UI LI 

Matched Region 


Expect 
Value 


ABB59882 

; 


Drosophila melanogaster 
polypeptide SEQ ID NO 6438 - 
Drosophila me!ano°aster, 1309 
aa. [WO200171042-A2', 27- 
SEP-2001] 


681. .1083 
897.. 1309 


213/418(50%) 
282/418(66%) 


e-114 


AAE03048 


Human preoptic regulatory 
factor-2 (hPORF^ nrotein #1 - 

Homo sapiens, 75 aa. 
[WO200142464-A2, I4-JUN- 
2001] 


1009. J 083 
1 7S 


75/75(100%) 
7S/7S (\ no%^ 

/ Jl 1 J \ 1 \J\J /O J 


7e-36 


ABG06669 


Novel human diagnostic 
protein #6660 - Homo sapiens, 
322 aa. [WO2001 75067- A2, 
ll-OCT-2001] 


872..1091 
28..253 


76/230 (33%) 
1 11/230(48%) 


2e-l7 


ABG06669 

I 


Novel human diagnostic 
protein #6660 - Homo sapiens, 
322 aa. [WO200 1 75067-A2, 
ll-OCT-2001] 


872.. 1091 
28..2S3 


76/230 (33%) 
1 1 1/230 (48%) 


2e-17 


1 AAB42926 


Human ORFX ORF2690 
polypeptide sequence SEQ ID 
NO:5380 - Homo sapiens, 903 
aa. [WO200058473-A2, 05- 
OCT-2000] 


928.. 1250 
84..447 


100/377 (26%) 
142/377 (37%) 


2e-13 



In a BLAST search of public sequence datbases, the NOV37a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 37D. 



237 



WO 03/010327 



PC17US02/14199 



Table 37D. Public BLAST? Results for NOV37a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV37a Residues/ 

lvjaLvll rvcbiuuco 


Identities/ 
Similarities for the 
Matched Portion 


Expect 

V dlUC 


Q9C0H5 

i 


KIA A 1 68 8 PROTEIN - 
Homo sapiens (Human), 
1094 aa (fragment). 


1..1083 
12.. 1094 


1083/1083 (100%) 
1083/1083 (100%) 


0.0 


Q9VDE9 


CG3421 PROTEIN - 
Drosophila melanogaster 
(Fruit fly), 1309 aa. 


68 1. .1083 
897..1309 


213/418(50%) 
282/418(66%) 


e-1 13 


PI 8890 


Putative preoptic regulatory 

f*»rtnr-9 nrpntircr»r fP/^P F 9\ 
laciur-z. prcLUibor Uivr-Z ) 

- Rattus norvegicus (Rat), 75 
aa. 


1009.. 1083 


74/75 (98%) 
idi to \yy /o) 


3e-35 


Q9BKW0 

! 


HYPOTHETICAL 36.4 
KDA PROTEIN - 
Caenorhabditis elegans, 317 
aa. 


798..932 
179..307 


50/135 (37%) 
80/135 (59%) 


3e-20 


AAH23344 

! 

| 


SIMILAR TO 
HYPOTHETICAL 
PROTEIN 
DKFZP564BI 162 - Mus 
musculus (Mouse), 654 aa. 


872..1091 
22..247 


74/230 (32%) 
109/230(47%) 


4e-17 



PFam analysis predicts that the NOV37a protein contains the domains shown in 
the Table 37E. 



Table 37E. Domain Analysis of NOV37a 


Pfam Domain 


NOV37a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


WW 


65..95 


12/31 (39%) 
19/31 (61%) 


0.0054 


MyTH4 


761. .879 


34/123(28%) 
85/123 (69%) 


1.4e-19 


RhoGAP 


909..1056 


52/171 (30%) 
105/171 (61%) 


7.3e-19 



Example 38. 



The NOV38 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 38A. 
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Table 38A. NOV38 Sequence Analysis 




SEQ ID NO: 119 


1269 bp 


NOV38a, 
CG96198-0I DNA Sequence 

• 


CT^ATGTTCCATTTAGGTGAAGAAAGCCCTGTGTGTCCTCGTCCAACACCGGTCTTG 
ACAAAGCCCCCCGGGCAGCATCCTTTTCTCGCCACACTGCAGCAAGCTCCTCCCTTAG 
TTGAAGCAGAAGCTCCCACCTTCTCGGCAGCAGCCAAGGTCCGAGTGGGAGTGACCAT 


* 
i 

\ 
i 

j 


CX3GGAACCCAGCCAGCTCCGCCCCTCTCCGGTCAGGAGACTCTTCATCCATTTAGCAG 

CCGCCG ACTTACTAGTCACTTTTGTGGTTATG CCCCTAGATG CCACCTGGAATATCAC 

TGTTCAATGGCTGGCTGTGGACATCGCATGTCGGACACTGATGTTCCTGAAACTAATG 

GCCACGTATT CTGCAG CTTTCCTG CCTGTGGTCATTGG ATTG G ACCGCCAGGCAGCAG 

TACTCAACCCGCTTGGATCCCGTTCAGGTGTAAGGAAACTTCTGGGGGCAGCCTGGGG 

ACTTAGTTTCCTGCTTGCCTTCCCCCAGGTGGCTGGCCCAGTCCCTTTCACTCAGTGT 

GTCACCAAAGGCAGCTTCAAGGCn'CAATGGCAAGAGACCACCTATAACCTCTTCACCT 

TCTGCTGCCTCTTTCTGCTGCCACTGACTGCCATGGCCATCTGCTATAGCCGCATTGT 

CCTCAGTGTGTCCAGGCCC CAGACAAGGAAGG CTGGTGAATTTGCCCTCCCCCGCTCC 

TTTGACAATTGTCCCCGTGTTCGTCTCCGGGCCCTGAGACTGGCCCTGCTTATCTTGC 

TGACCTTCATCCTCTGCTGGACACCTTATTACCTACTGGGTATGTGGTACTGGTTCTC 

CCCCACCATGCTAACTGAAGTCCCTCCCAGCCTGAGCCACATCCTTTTCCTCTTGGGC 

CTCCTC^TGCTCCTTTGGATCCTCTCCTCTATGGGGCCTTCACCTCCGCCCGCCTCA 

GCCTCCCAAAGTGCTGGGATTACAGGCGTG AG CCACCGCG CCCAGCCAG CTTTGACTA 

TTTTTATACTCCTCACAATTCTCAAGCACTTCrTAGGCAGAGCACAAAAACAAAA 

AAAAAAAAACTTAGAAATATATGAATGCAAGAGATGGGAAGAGGAAAGACCATTTCCC 

AACTAGCTCAGAGTAGCCTGATCGTAACTCCAAACAGTTCACAAAATTCTCTTAGGCT 

GAAACTATTCATCTTTCCTAAATGTCCCTCATGTCTGCTAAAATCACTTTC 




ORF Start: ATG at 5 


ORF Stop: TGAat 1124 


| . 


SEQ ID NO: 120 


373 aa 


MWat4l692.8kD 


NOV38a, 
;CG96 198-01 Protein Sequence 

i 


MFHIiGEESPVCPRPTPVLTKPPGQHPFLATLQQAPPLVEAEAPTFSAAAKVRVGVTIV 
LFVS S AGGNLAVLWS VTRREPS QLRPS PVRRL F I HL AAADLLVTFVVM PLDATWN I TV 
QWIAVDIACRTLMFLKLMATYSAAFLPWIGLDRQAAVLNPLGSRSGVRKLLGAAWGL 
S FLLAF PQVAG PVPFTQ CVTKG S F KAQWQETT YNLFTFCCLFLIiPLTAMAI CYSRI VL 
SVSRPQTRKAGEFALPRS FDNCPRVRI^RALRLALLILLTF I LCWTPYYLLGMWYWFS P 
TMLTEVPPSLSHI LFLLGLLNAPLDPIiLYGAFTSARLSLPKCWDYRREPPRPAS FDYF 
YTPHNSQALLRQSTKTKKKKKLRNI 


r- "" 


SEQ ID NO: 121 


1551 bp 


NOV38b, 
CG96 198-02 DNA Sequence 


ATGTTTATCTCATCCCATTACAAGGTAAGTTCCAGAGGAGCAGAGGCCTTGTTTGTCT 
TTT CTT TT TTTT TG AGACGGT ATTT CACTGTTGTTG CCCAGG CTAGAGT A CAATGG CA 
CGATCTCGGCTC^CCACAAACTCTGCCTCCCGGGTTCAAGCGATTCrrCCTGCCTCAAC 
CTCCCGAGTAGCTGGGATTACAGGCATGCGCCACCACACCCGGCTAATTTTGTATTTT 
TAGTAGAGACGGGGTTTCTCCTGCCCACCTTCTCGGCAGCAGCCAAGGTCCGAGTGGG 
AGTGACCATTGTGCTGTTTGTTTCTTCGGCTGGAGGGAACCTGGCAGTCCTGTGGTCA 
GTGACACGGCGGGAACCCAGCCAGCTCCGCCCCTCTCCGGTCAGGAGACTCTTCATCC 
ATTTAGCAGCCGCCGACTTACTAGTCACTTTTGTGGTTATGCCCCTAGATGCCACCTG 
GAATATCACTGTTCAATGGCTGGCTGTGGACATCGCATGTCGGACACTGATGTTCCTG 
AAACTAATGG CCACG TATT CTGCAG CTTT CCTGC CTGTG GTCATTGGATTG GACCG C C 
AGGCAGCAGTACTCAACCCGCTTGGATCCCGTTCAGGTAAAAGAAAAAAGAAAGCAAT 
TATGGTTCCTGGCTGGGCGCTGAGGTTCATG CCTGTAATCCCAG CACTTTGGAAGG CT 
GAGGTGAGCAGATC^GGCCCAGTCCCTTTCACTGAGTGTGTCACCAAAGGCAGCTTCA 
AGGCTCAATGGCAAGAGACC^CCTATAACCTCTTC^CCTTCrGCTGCCTCITTCrGCT 
GCCACTGACrGCCATGGCCATCTGCTATAGCCGCATTGTCCTCAGTGTGTCCAGGGAA 
TTTGCCCTCCCCCGCTCCTTTGACAATTGTCCCCGTGTTCGTCTCCGGGCCCTGAGAC 
TGGCCCTGCTTATCTTGCTGACCTTC^TCCrCTGCTGGAC^CCTTATTACCTACTGGG 
TATGTGGTACTGGTTCTCCCCCACCATGCTAACTGAAGTCCCTCCCAGCCTGAGCCAC 
ATCCTTTTCCTITTTGGGCCTCCTCAATGCTCCT^ 

TCACCCITGGCrGCCGAAGATTATTCACTGGATCTCTGAATACCC^GGCCCCCTCC^C 

CATGGCCAGCCGGGGTGGGGGCCGGGGTCGTGGCCGGGGCCAGTTGACCTTCAACGTG 

GAGGCCGTGGG (^TTGGGCTCCACAGTAGGTACTGTG ATCATTTTAAAAAT CAGTTGG 

TTAC^TACCTACTAAAAATCCTTGTTGTTGTTGTTTTGAGACGTA 

ACC CAGGCT AAG AGTG CAGTGGTG CAGT CTCGGCTCACTG CATCCTCTGCCTCCCGGG 

TTCAAGCAATTCTCCTGCCTCAAGCCTCCTGAGTACCTGGGA 

AGCCTCCTGAGTACCTGGGACTACAGGCGCCCCCCCCCACACACACACACACACCCGG 
CTAATTTTTGTATTTTTAGTAGAGACGGGATTTCACCATGTTG 
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ORF Start: ATG at 1 


ORF Stop:TAA at 1510 




SEQ ID NO: 122 


503 aa 


MW at56199.8kD 


NOV38b, 
CG96 198-02 Protein Sequence 

! 

! 


MFISSHYKVSSRGAEALFVFSFFLRRYFTWAQARVQWHDLGS PQTL PPGFKRFS CLN 
LPSSWDYRHAPPHPANFWLVETGFLLPTFSAAAKVRVGVTIVLFVSSAGGNLAVLWS 
VTRREPSQLRPSPVRRIiF IHIJUU^LLVTFVVMPLDATVJN ITVQWLAVDI ACRTLMFL 
KLMAT YS AAFL PWI GLDRQAAVLNPLGSRSG KRKKKA I MVPGWALRFMPV I P ALWKA 
EVSRSGPVPFTQCVTKGSFKAQWQETTYNLFTFCCLFLLPLTAMAICYSRIVLSVSRE 
FALPRSFDNCPRWLRAIiRIJUjLILLTFILCWTPYYLLGMWYWFSPTMLTEVPPSLSH 
ILFLLGLLNAPLDPLLYGAFTLGCRRLFTGSLNTQAPSTMASRGGGRGRGRGQLTFNV 

eavgiglhsrycdhfknqlvtyllkilwwlrrslalsprlrvqwcslgslhplppg 
fkqfsclkppeylglqapskppeylglqapppththtpg 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 38B. 



Table 38B. Comparison of NOV38a against NOV38b. 


< ! Protein Sequence 

i 


NOV38a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


! NOV38b 

1 


4S..323 
86..369 


214/291 (73%) 
219/291 (74%) 



Further analysis of the NOV38a protein yielded the following properties shown in 
Table 38C. 



Table 38C Protein Sequence Properties NOV38a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.3000 
probability located in microbody (peroxisome) 


SignalP analysis: 


Cleavage site between residues 70 and 71 



A search of the NOV38a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 38D. 
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Table 38D. Geneseq Results for NOV38a 


Geneseq 
Identifier 


Protein/Organism/Length 
TPatent # Datel 


NOV38a Residues/ 


Identities/ 
Similarities for 

Region 


Expect 
Value 


AAU10819 


Human Type 11 GnRH-R 
splice variant 1 protein #2 - 
Homo sapiens, 377 aa. 
FWO2G0178796-A1 25- 
OCT-2001) 


40..323 
26..322 


278/297 (93%) 
280/297 (93%) 


e-156 


AAU10816 


Human Type 11 GnRH-R 
splice variant 3 protein - 
Homo sapiens, 366 aa. 

f WO700 1 7X7Q6-A 1 9 5- 

OCT-2001] 


40..323 
15.31 1 


278/297 (93%) 
280/297 (93%) 


e-156 


AAU10815 


Human Type 11 GnRH-R 
splice variant 2 protein - 
Homo sapiens, 376 aa. 
[WO200 178796- A 1,25- 
OCT-2001] 


40..323 
25..321 


278/297 (93%) 
280/297 (93%) 


e-156 

i 


AAU10814 • 


Human Type II GnRH-R 
splice variant 1 protein #1 - 
Homo sapiens, 377 aa. 
[WO200178796-Al,25- 
OCT-2001] 


40..323 
26..S22 


278/297 (93%) 
280/297 (93%) 


e-156 


AAU10813 


Human Type II GnRH-R - 
Homo sapiens, 379 aa. 
[WO200178796-Al,25- 
OCT-2001J 


40..323 
28..324 


278/297 (93%) 
280/297 (93%) 


e-156 



In a BLAST search of public sequence datbases, the NOV38a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 38E. 
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Table 38E. Public BLASTP Results for NOV38a 


Protein 
Accession 
j Number 


Protein/Organism/Length 


NOV38a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96P88 


TYPE II GONADOTROPIN- 
RELEAS1NG HORMONE 
RECEPTOR - Homo sapiens 
(Human), 379 aa (fragment). 


40.. 323 
28..324 


281/297 (94%) 
283/297 (94%) 


e-158 


CAD 12280 


SEQUENCE 9 FROM 
FA I ENT WOO 178796 - Homo 
sapiens (Human), 366 aa. 


40..323 
15..311 


278/297 (93%) 
280/297(93%) 


e-156 


CAD 1 2279 


SEQUENCE 7 FROM 
PATENT WO01 78796 - Homo 
sapiens (Human), 376 aa. 


40..323 
25..321 


278/297(93%) 
280/297 (93%) 


e-156 


CAD12278 


SEQUENCE 5 FROM 
PATENT WOO 1 78796 - Homo 
sapiens (Human), 377 aa. 


40..323 
26..322 


278/297 (93%) 
280/297 (93%) 


e-156 


CAD 12277 


SEQUENCE 3 FROM 
PATENT WO01 78796 - Homo 
sapiens (Human), 379 aa. 


40..323 
28..S24 

i 


278/297 (93%) 
280/297.(93%) 


e-156 



PFam analysis predicts that the NOV38a protein contains the domains shown in 
the Table 38F. 



Table 38F. Domain Analysis of NOV38a 


Pfam Domain 


NOV38a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tmJ 


66.319 


60/287 (21%) 
191/287(67%) 


9.7e-41 



Example 39. 



The NOV39 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 39A. 
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Table 39A. NOV39 Sequence Analysis 




SEQ ID NO: 123 


1047 bp 


NOV39a, 
CG96231-01 DNA Sequence 

! 

t 

! 

1 

! 

I 


ATGTTTGGCCCCGCTAAAGGTCGCCATTTTGGAGTCCACCCGGCGCCTGGTTTCCCCG 
GCGGCGTCTCCCAACAGGCTGCCGGGACCAAAGCTGGCCCCGCGGGTGCCTGGCCTGT 
GGGCAGCCGGACCGACACGATGTGGCGGCTCCGCTGCAAGGCCAAGGACGGCACCCAT 
GTTTTG CAGGGGCTGTC CAGCCGGACCCGGGTGCX3GGAACTCCAGGGCCAAATTGCCG 
CCATCACCGGGATCGCCCCCGGCGGTCAGCGAATCCTCGTCGGATACCCTCCCGAGTG 
CCTGGATCTCAGCAATGGGGATACCATTCTGGAAGACTTGCCCATCCAATCTGGTGAC 
ATGCTGATCATTGAAGAAGACCAAACCAGGCCCAGAAGTTCACCTGCATTTACTAAAC 
GTGGTGCTTCTAGTTACGTCAGGGAAACTTTGCCTGTGCTTACCAGAACCGTGGTCCC 
AGCAGACAACTCTTGCCTCTTTACTAGTGTGTACTATGTCGTCGAAGGAGGAGTCTTG 
AATCC AGCTTGTG CCCC TGAG ATGAGACGCCTCATAGCACAAATTGTAGC AAGCGATC 
CAGACTTCTATAGTGAGGCAATACTGGGAAAAACAAATCAAGAGTACTGTGACTGGAT 
CAAAAGGGATGACACTTGGGGAGGAGCAATAGAGATATCGATTTTGTCCAAGTTTTAC 
CAATGTGAAATATGTGTAGTGGATACACAGACAGTAAGAATTGATCGTTTTGGGGAAG 
ATGCAGGATATACCAAAAGGGTTCTGCTTATTTATGATGGCATCCACTATGATCCACT 
TCAGCGTAACTTCCCTGATCCAGATACACCTCCTCTGACCATTTTCTCCTCTAATGAT 
GATATTGTTCTTGTACAAGCACTGGAATTAGCAGATGAAGCTAGAAGAAGGAGACAGT 
TTACTGATGTCAACCGCTTCACCCTGAGATGCATGGTATGTCAGAAAGGATTAACTGG 
ACAAGCAGAAGCAAGGGAACATGCCAAGGAGACAGGCCATACCAACTTTGGAGAAGTG 
TGA 


< 

j 


ORF Start: ATG at 1 


ORF Stop: TGA at 1045 


i 

5 ... . 


SEQ ID NO: 124 


348 aa 


MW at 38322.0kD 


\ NOV39a, 
!CG96231-01 Protein Sequence 


MFG PAKGRH FGVH P APG F P GG VSQQAAGTKAG P AG AWPVGSRTDTMWRLRC KAKDGTH 
VLQGLSSRTRVRELQGQIAAITGIAPGGQRILVGYPPECLDLSNGDTILEDLPIQSGD 
ML HE EDQTR PRS S P AFTKRG ASS YVRETL P VLTRTWPADN S CLFTSVYYWEGGVL 
NPACAPEMRRLIAQIVASDPDFYSEAILGKTNQEYCDWIKRDDTWGGAIEISILSKFY 
QCEICWDTQTVRIDRFGEDAGYTKRVLLIYDGIHYDPLQRNFPDPDTPPLTIFSSND 
DIVLVQALEU^EARRRRQFTDVNRFTLRCMVCQKGLTGQAEAREHAKETGHTNFGEV 


i 


SEQ ID NO: 125 


1047 bp 


! NOV39b, 
CG96231-02 DNA Sequence 


ATGTTTGGCCCCGCTAAAGGTCGCCATTTTGGAGTCCACCCGGCGCCTGGTTTCCCCG 
GCGGCGTCTCCCAACAGGCTGCCGGGACCAAAGCTGGCCCCGCGGGTGCCTGGCCTGT 
GGGCAGCCGGACCGACACGATGTGGCGG CT CCGCTGCAAGGCCAAGGACGGCACCCAT 
GTTTTGCAGGGGCTGTCCAGCCGGACCCGGGTGCGGGAACTCCAGGGCCAAATTGCCG 
CCATCACCGGGATCGCCCCCGGCGGTCAGCGAATCCTCGTCGGATACCCTCCCGAGTG 
CCTGGATCTCAGCAATGGGGATACCATTCTGGAAGACTTGCCCATCCAATCTGGTGAC 
ATGCTGATCATTGAAGAAGACCAAACCAGGCCC^GAAGTTCACCTGCATTTACTAAAC 
GTGGTGCTTCTAGTTACGTCAGGGAAACTTTGCCTGTGCTTACCAGAACCGTGGTCCC 
AGCAGACAACTCTTGCCTCTTTACTAGTGTGTACTATGTCGTCGAAGGAGGAGTCTTG 
AATCCAGCTTGTGCCCCTGAGATGAGACGCCTCATAGCACAAATTGTAGCAAGCGATC 
CAGACTTCTATAGTGAGGCAATACTGGGAAAAACAAATCAAGAGTACTGTGACTGGAT 
C AAAAGGG ATG ACACTTGGG G AGG AG CAATAGAG ATAT CGATTT TGTC CAAGTTTTAC 
CAATGTGAAATATGTGTAGTGGATACACAGACAGTAAGAATTGATCGTTTTGGGGAAG 
ATGCAGGATATACCAAAAGGGTTCTGCTTATTTATGATGGCATCCACTATGATCCACT 
TCAGCGTAACTTCCCTGATCC^GATACACCTCCTCTGACCATTTTCTCCTCTAATGAT 
GATATTGTTCTTGTACAAGCACTGGAATTAGCAGATGAAGCTAGAAGAAGGAGACAGT 
TTACTGATGTCAACCGCTTCACCCTGAGATGCATGGTATGTCAGAAAGGATTAACTGG 
ACAAGCAG AAGCAAGGGAACATGCCAAGG AG ACAGG C CAT AC CAACTTTGG AGAAGTG 
TGA 




ORF Start: ATG at 1 


ORF Stop: TGA at 1045 




SEQ ID NO: 126 


348 aa 


MW at 38322.0kD 


NOV39b, 
CG96231-02 Protein Sequence 


MFGPAKGRHFGVHPAPGFPGGVSQQAAGTKAGPAGAWPVGSRTDTMWRLRCKAKDGTH 
VLQGLS SRTRVRELQGQI AAITGI APGGQR ILVG YPPECLDLSNGDTI LEDLP IQSGD 
ML 1 1 EEDQTRPRSS PAFTKRGASS YVRETLPVLTRTWPADNS CLFTSVYYWEGGVL 
NPACAPEMRRLI AQIVASDPDFYS EAILGKTNQEYCDWI KRDDTWGGAI EI S ILSKFY 
QCEI CWDTQTVRI DRFGEDAGYT KRVLLI YDG I HYDPLQRNF PDPDTPPLTI FSSND 
DIVLVQALEIiADEARRRRQFTDVNRFTLRCMVCQKGLTGQAEAREHAKETGHTNFGEV 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 39B. 
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Table 39B, Comparison of NOV39a against NOV39b. 


Protein Sequence 


NOV39a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV39b 


1 ..348 
I. .348 


344/348 (98%) 
344/348 (98%) 



Further analysis of the NOV39a protein yielded the following properties shown i 
Table 39C. 





Table 39C. Protein Sequence Properties NOV39a 


PSort analysis: 


0.3000 probability located in nucleus; 0.2271 probability located in lysosome 
(lumen); 0.1 109 probability located in microbody (peroxisome); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV39a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 39D. 
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Table 39D. Geneseq Results for NOV39a 




Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV39a Residues/ 
Match Residues 


Identities/ 
Similarities 
for the 

lviaiLtiivu 

Region 


Expect 
Value 


t 

! ABB60807 

i 

i 
! 


Drosophila melanogaster 
polypeptide SEQ ID NO 9213 
- Drosophila melanogaster, 
347 aa. [ WO200 1 7 1 042-A2, 
27-SEP-2001] 


49..348 

7.347 
* 


144/343 
(41%) 
196/343 


2e-64 


AAG53575 

i 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 68223 - 
Arabidopsis thaliana, 146aa. 
[EP1033405-A2, 06-SEP- 
2000] 


148..271 
4.. 124 


63/124 
(50%) 
87/124 

[py/o) 


8e-31 


| AAG20460 

! : 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 22659 - 
Arabidopsis thaliana, 146 aa. 

2000] 


148..271 
4..124 


63/124 
(50%) 
87/124 
(69%) 


8e-31 


AAG53576 

< 
i 

i 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 68224 - 
Arabidopsis thaliana, 122 aa. 
[EP1033405-A2, 06-SEP- 
2000] 


179..271 
8.. 100 


52/93 
(55%) 
67/93 
(71%) 


le-24 


AAG20461 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 22660 - 
Arabidopsis thaliana, 122 aa. 
[EP1033405-A2, 06-SEP- 
2000] 


179..271 
8.. 100 


52/93 
(55%) 
67/93 
(71%) 


le-24 



In a BLAST search of public sequence datbases, the NOV39a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 39E. 
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Table 39E. Public BLAST? Results for NOV39a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV39a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAL39440 


GM14814P-Drosophila 
melanogaster (Fruit fly), 
347 aa. 


49..34S 
7..347 


144/343(41%) 
196/343 (56%) 


5e-64 


Q9VRJ9 


CG4603 PROTEIN - 
Drosophila melanogaster 
(Fruit fly), 347 aa. 


49.348 
7..347 


144/343 (41%) 
196/343 (56%) 


5e-64 


Q9LPT6 


F11F12.1 PROTEIM 
(HYPOTHETICAL 23.4 
KDA PROTEIN) - 

Arabidopsis thaliana 
(Mouse-ear cress), 208 aa. 


148..347 
4..206 


96/206 (46%) 
131/206 (62%) 


le-44 


0 13974 

i 

i 
i 

i 

i 

i 


HYPOTHETICAL 35.7 
KDA PROTEIN 
C24C9.14IN 
CHROMOSOME 1 - 
Schizosaccharomyces 
pombe (Fission yeast), 
329 aa. 


48..347 
5..328 


101/333 (30%) 
161/333 (48%) 


4e-36 


P43558 


Hypothetical 33.5 kDa 
protein in SEC53-FET5 

intergenic region - 
Saccharomyces cerevisiae 
(Baker's yeast), 301 aa. 


92..347 
39..299 


90/278 (32%) 
144/278 (51%) 


7e-33 



PFam analysis predicts that the NOV39a protein contains the domains shown in 
the Table 39F. 



Table 39F. Domain Analysis of NOV39a 


Pfam Domain 


NOV39a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


OTU 


154..269 


40/135 (30%) 
87/135 (64%) 


l.le-18 



Example 40. 



The NOV40 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 40A. 
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Table 40A. NOV40 Sequence Analysis 




SEQ ID NO: 127 


591 bp 


NOV40a, 


AACAGAGGTTCAGCTGGGAGACCCCTCACACACAGGTGAAGGCGTGGCTGTAGAACCT 


CG96260-01 DNA Sequence 


CTTCTGCTCCCCCACCCAC^GCAGTTTCTGGCAGTTTCAGAGGAGGGTCAAACACATC 


! 

! 


ATAAAGGGATCCCCGTGGATGACACTGACAGGCACAGCCCCTCATCTCCCTCTCCCTA 
CGAGAAGCTGAAGGAGTTCAGCTGCCAGCCTGTGTTGAACAGCTACCAGTTCCACATC 




GGCTGAAGGCCTGTGAGTGTGACAAGCAATCCGTGCACTGCTTCAAAGAGAGCCTGCC 
CACCT ATG AG AAAAACTT CAAG C AG TTCT CCAG CC AG CCCAGG TGTGG CAG AC AT AAG 
CCCTGGTGCTAGGGACACCACAGGGTCCCTCTCATCATCCAGCATCCGCTCTAGTGTT 
GCTCTTCCAGG 




ORF Start: ATG at 61 


ORF Stop: TAG at 532 




SEQ ID NO: 128 


157 aa 


MWatl7585.0kD 


NOV40a, 
CG96260-0! Protein Sequence 


MDPLVSSGMKVIAILTLLLFCSPTHSSFWQFQRRVKHITGRSAFFSYYGYGCYCGLGD 
KGIPVDDTDRHSPSSPSPYEKLKEFSCQPVLNSYQFHIVNGAWGGCTLGPGASCHCR 
LKACECDKQSVHCFKESLPTYEKNFKQFSSQPRCGRHKPWC 



Further analysis of the NOV40a protein yielded the following properties shown in 
Table 40B. 



( — ■ — — ' ' — ' 1 

! Table 40B. Protein Sequence Properties NOV40a 

i 


PSort analysis: 


0.8200 probability located in endoplasmic reticulum (membrane); 0.2222 probability 
located in microbody (peroxisome); 0.1900 probability located in plasma membrane; 
0.1000 probability located in endoplasmic reticulum (lumen) 


i 

J SignalP analysis: 


Cleavage site between residues 27 and 28 



A search of the NOV40a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 40C. 
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Table 40C. Geneseq Results for NOV40a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV40a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


? AAG63223 


Amino acid sequence of a 
human lipid metabolism 
enzyme - Homo sapiens, 1 52 
aa. [WO200I53468-A2, 26- 
JUL-2001] 


11..157 
2.. 152 


139/151 (92%) 
140/151 (92%) 


le-82 


A AT? f\1(\dd 

r\r\ IVDj \J l t l t 


RPLA2-8 - Rattus sp, 1 58 aa. 
[WO9502328-A, 26-JAN- 
1995] 


1-157 
1..158 


106/160 (66%) 
121/160 (IWA 


le-55 


AAB 12536 


Mouse secretory 
phospholipase A2 protein 
sequence SEQ ID NO: 14 - 
Musmusculus, 144 aa. 
[WO200034486-A1, I5-JUN- 
2000] 


17.. 157 
10.. 144 


48/141 (34%) 
70/141 (49%) 


le-17 


AAB 11994 

i 

i 


Mouse secreted phospholipase 
A2 - Mus musculus, 144 aa. 
[JP2000166544-A, 20-JUN- 
2000] 


'17..I57 
10.. 144 


48/141 (34%) 
70/141 (49%) 


le-17 


AAB56432 


Human prostate cancer antigen 

protein sequence SEQ ID 
NO: 1010 - Homo sapiens, 164 
aa.[WO200055174-Al,2N 
SEP-2000] 


9..150 
21..164 


48/150 (32%) 
72/150(48%) 


2e-16 



In a BLAST search of public sequence datbases, the NOV40a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 40D. 
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Table 40D. Public BLASTP Results for NOV40a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV40a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Rxneet 
Value 


B54762 


phospholipase A2 (EC 3.1.1.4) 
RPLA2-8 precursor - rat, 158 aa. 


1..157 
1..158 


106/160 (66%) 
121/160 (75%) 


2e-55 


P39878 


Group IIC secretory phospholipase 
A2 precursor (EC 3. 1.1.4) 
(Phosphatidylcholine 2- 
acylhydrolase GIIC) (GIIC 
SPLA2) (PLA2-8)(14kDa 
phospholipase A2) - Rattus 
norvegicus (Rat), 1 50 aa. 


9.. 157 
1 .150 


99/152 (65%) 
114/152 (74%) 


8e-52 


P48076 


Group IIC secretory phospholipase 
A2 precursor (EC 3. 1 . 1 .4) 
(Phosphatidylcholine 2- 
acylhydrolase GIIC) (GIIC 
sPLA2) (PLA2-8)(14kDa 
phospholipase A2) - Mus 
musculus (Mouse), 1 50 aa. 


9.. 1 57 
1..150 


98/152 (64%) 
114/152 (74%) 


9e-51 


A54762 


phospholipase A2 (EC 3. 1 . 1 .4) 
MPL2-8 - mouse, 130 aa 
(fragment). 


27.. 157 
130 


90/132 (68%) 
103/132(77%) 


5e-48 


A60718 


phospholipase A2 homolog, non- 
pancreatic - human, 50 aa 
(fragment). 


22..68 
4..50 


46/47 (97%) 
47/47 (99%) 


6e-23 



PFam analysis predicts that the NOV40a protein contains the domains shown in 
the Table 40E. 



Table 40E. Domain Analysis of NOV40a 


Pfam Domain 


NOV40a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


phoslip 


27..68 


19/43 (44o/ 0 ) 
31/43 (72%) 


1.7e-13 


phoslip 


77.. 150 


20/82(24%) 
52/82(63%) 


4.2e-06 



Example 41. 



The NOV41 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 41 A. 
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Table 41 A. NOV41 Sequence Analysis 




SEQ ID NO: 129 


952 bp 


NOV41a, 
CG96364-01 DNA Sequence 


CCTGCCGCCTGCCTCCTCTTTCCTTTCAACATGACAGATGCCGCTGTGTCCTTCACCA 
AGGACTTCCTGGCAGGTG<5AGTGGCAGCAGCCATCTCCAAGATGGGGGTAGCGCCC7VT 
CGAGAGGGTCAAGCTGCTGCTGGAGGTACAG CATGCCAGCAAGCAGATCACCG CAGAT 
ATGCAATACAAGGGCATTATAGACTGCGTGGTCCATATTCTCAAGGAGCAGGGAGTCC 
TGTCCTTCTGGCACCGTAACCTGGTCJy\TGTCCTCAGATACTTCCCCACCCAGGCTCT 
CAACTTGGCCTTCAAAGATAAACACAAGCAGATCTTCCTGGGTGGTGTGGACAAGAGG 
ACCCAGTTTGGGAGCTACTTTGCAGGGAATCCGGCATCAGGTGGTGCCGCTGGGGCCA 
CATCCTTGTGTTtTGTGTACCCTCTTGATTTTGCCCATACCTGTCTAGCAGCTGACGT 
GGGTAAAGCTGGAGCTGAAAGGGAATTCCGAGGCCTCGGTAACTGCCTGGTTAAGATC 
TACGAAACTGATGGGATTAAGGGCCTGTACCAAGGCTTTAACGTGTCTGTGCAGGGTA 
TTATCATCTACCGAGCTGCCTACTTTGGTATCTATGACACTGCAAAGGGAATGCTTCC 
GGATCCCAGGAACACTCACATCATCATCAGCTGGATGATCACTCAGACTGTCACTGCC 
ATTGCTGGGTTGACTTCCCATCCATTTGACACCGTTCGATGCCGCATGATGATGCAGT 
CAGGGCGCAAAGGAATTGACGTCATGTACACAGGCACGCTTGACTGCTGGAGGAAGAT 
TGTTGGTGATGAAGGAGG CAAAGATTTTTTCAAGGGTGCATGGTCCAGTGTT CTCAGA 
GGCATGGATGGTGCTTTTGTGCTTGTCTTGTATGGTGAAATCAAGAAGTACACATAAG 
TTATTTCCTGGGATTTCCCCCACT 




ORF Start: ATG at 3 1 


ORF Stop: TAA at 925 




SEQ ID NO: 130 


298 aa 


MW at 32668.6kD 


NOV41a, 
CG96364-01 Protein Sequence 

i 

| 


MTDAAVS FTKDFLAGGVAAAI SKMGVAPI ERVKLLLEVQHASKQITADMQYKG I IDCV 
VHILKEQ<3VLSFWHRNLVNVLR^ 

PASGGAAGATSLCFVYPLD FAHTCLAADVGKAGAEREFRGLGNCLVKI YETDG I KGLY 
QGFNVSVQGIIIYRAAYFGIYDTAKGMLPDPRNTHIIISWMITQTVTAIAGLTSHPFD 
TVRCRMMMQS GRKG I DVMYTGTLDC WRK I VG D EGGKDFF KG AW S S VLRGMDGAFVLVL 
YGEIKKYT 




SEQ ID NO: 131 


745 bp 


! NOV41b, 

' CG96364-03 DNA Senuenrp 

!' 


CCGCAGCGCCGTAGTCAAACCGAACCCGGCCCAGTCCCGTCCTGCAGCAGTCTGCCTC 
CTTCTTTCMCATGACAGATGCCGCATTGTCCTTCGCCAAGGACTTCCTGGCAGGTGG 
AGTGGCCGCAGCCATCTCCAAGACGGCGGTAGCGCCCATCGAGCGGGTCAAGCTGCTG 
CTGCAGGTGCAGCATGCCAG CAAGC AG ATCACTG CAGAT AAG CAATACAAAGGCATTA 
T AG ACTGCGTGGTCCGTATT CCCAAGGAGCAGGAAGTTCTGTCCTTCTGG CGCGGTAA 
CCTGGCCAATGTCATCAGAT ACTTCCCCACC CAGGCTCTTAACTTCG CCTTCAAAGAT 
AAATACAAGCAGATCTTCCTGGGTGGTGTGGACAAGAGAACCCAGTTTTGGCGCTACT 
TTGCAGGGAATCTGGCATCGGGTGGTGCCGCAGGGGCCACATCCCTGTG'riTTGTGTA 
CCCTCTTGATTTTGCCCGTACCCGTCTAGCAGCTGATGTGGGTAAAGCTGGAGCTGAA 
AGGGAATTCCGAGGCCTCGGTGACTGCCTGGTTAAGATCTACAAATCTGATGGGATTA 
AGGGCCTGTACCAAGGCCTTGACTGCTGGCGGAAGATTGCTCGTGATGAAGGAGGCAA 
AGCTTTTTTCAAGGGTGCATGGTCCAATGTTCTCAGAGG CATGGGTGGTG CTTTTGTG 
CTTGTCTTGTATGATGAAATCAAGAAGTACACATAAGTTATTTCCTAGG 




ORF Start: ATG at 70 


ORF Stop: TAA at 730 




SEQ ID NO: 132 


220 aa 


MWat24272.9kD 


NOV41b, 
CG96364-03 Protein Sequence 


MTD AALS FAKD FLAGG VAAAI S KTAVAP I ERVKLLLQVQHAS KQITADKQYKGI IDCV 
VRI PKEQEVLSFWRGNLANVIRYFPTQALNFAFKDKYKQI FLGGVDKRTQ FWRYFAGN 
LASGGAAGATSLCFVYPLDFARTRLAADVGKAGAEREFRGLGDCLVKIYKSDGIKGLY 
QGLDC^RKIARDEGGKAFFKGAWSNVUIGMGGAFVLVLYDEIKJ<YT 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 41B. 



Table 41B. Comparison of NOV41a against NOV41b. 


Protein Sequence 


NOV4 la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV41b 


1..178 
1..178 


142/178(79%) 
150/178 (83%) 



250 



WO 03/010327 PCT/US02/14199 



Further analysis of the NOV41a protein yielded the following properties shown in 
Table 41 C. 



Table 41C. Protein Sequence Properties NOV41a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 03600 probability located in 
mitochondrial matrix space; 0.3000 probability located in mitochondrial 
interrnembrane space; 0.2883 probability located in lysosome (lumen) 


SignalP analysis: ; 


No Known Signal Sequence Predicted 



A search of the NOV41a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 4 1 D. 



Table 41D. Geneseq Results for NOV41a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NO V4 1 a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU 10379 


Human adenine nucleotide 
translocator 2 (ANT2) - Homo 

sapiens, 298 aa. 
[WO200185944-A2, 15-NOV- 
2001] 


1..298 
1..298 


261/298 (87%) 
274/298 (91%) 


e-150 


AAU01199 


Human adenine nucleotide 
transIocator-2 (ANT-2) protein - 

Homo sapiens, 298 aa. 
[WO200132876-A2, 10-MAY- 
2001] 


1..298 
1..298 


261/298 (87%) 
274/298 (91%) 


e-150 


AAY71032 


Human adenine nucleotide 
translocator ANT2 - Homo 
sapiens, 298 aa. 
[WO200026370-A2, 11-MAY- 
2000] 


1..298 
1.298 


261/298 (87%) 
274/298(91%) 


e-150 


AAU10380 


Human adenine nucleotide 
translocator 3 (ANT3) - Homo 

sapiens, 298 aa. 
[WO200185944-A2, 15-NOV- 
2001] 


1..296 
L.296 


242/296 (81%) 
267/296 (89%) 


e-I43 


AAU01200 


Human adenine nucleotide 
translocator-3 (ANT-3) protein - 

Homo sapiens, 298 aa. 
[WO200132876-A2, 10-MAY- 
2001] 


1..296 
1..296 


242/296(81%) 
267/296 (89%) 


e-143 
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In a BLAST search of public sequence datbases, the NOV41a protein was found to 
have homology to the proteins shown in the BLAST? data in Table 41E. 



Table 41E. Public BLASTP Results for NOV41a 


ri oiein 
Accession 
j Number 


Protein/Organism/Length 


NOV4 la Residues/ 
Match Residues 


Identities/ 
Similarities for the 
jviaicnea r onion 


Expect 
Value 


P05141 


ADP,ATP carrier protein, 
fibroblast isoform (ADP/ATP 
translocase 2) (Adenine 
nucleotide translocator2) 
(ANT 2) - Homo sapiens 
(Human), 298 aa. 


1..298 
I. .298 


263/298 (88%) 
275/298 (92%) 


e-151 


Q09073 

i 
! 


A DP, ATP carrier protein, 
fibroblast isoform (ADP/ATP 
translocase 2) (Adenine 
nucleotide trans locator 2) 
(ANT 2) - Rattus norvegicus 
(Rat), 298 aa. 


1..298 
L.298 


261/298 (87%) 
274/298 (91%) 


e-150 


A29132 


ADP,ATP carrier protein T2 - 
human, 298 aa. 


L.298 
1..298 


261/298 (87%) 
274/298 (91%) 


e-I50 


j P5I88I 

i 
j 


A DP, ATP carrier protein, 
fibroblast isoform (ADP/ATP 
translocase 2) (Adenine 
nucleotide translocator 2) 
(ANT 2) - Mus musculus 
(Mouse), 298 aa. 


I. .298 
L.298 


260/298 (87%) 
273/298 (91%) 


e-149 


BAB84673 \ 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 2 - Bos 
taurus (Bovine), 298 aa. 


L.298 
L.298 


259/298 (86%) 
273/298 (90%) 


e-149 



PFam analysis predicts that the NOV41a protein contains the domains shown 
the Table 4 IF. 



in 





Table 41F. Domain Analysis of NOV41a 


Pfam Domain 


NOV4 la Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mitotan* 


7.. 105 


34/125 (27%) 
85/125 (68%) 


4.6e-22 


mitocarr 


112..208 


35/125 (28%) 
82/125 (66%) 


7.4e-22 


mitq_carr 


209..298 


27/125 (22%) 
66/125 (53%) 


3.2e~09 



252 



WO 03/010327 



PCT/US02/14199 



Example 42. 

The NOV42 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 42A. 



{ — — • 

Table 42A. NOV42 Sequence Analysis 




SEQIDNO: 133 


936 bp 


NOV42a, 
CG96422-01 DNA Sequence 


GCCACCATGACGGAAGAAGCCATCTCCTTCGCCAAGGACTTCCTAGCTGGAGGrATrA 
CCGCCGCTATCTCCAAGACAGCCGTGGCATCGATCAAGCGGGTCCAGCTGCTGCTGCA 
GATGCAGCACGCCAGCATGCCGATGGCCGCCGCCAAGCAGTGCAAGGGCATCGTGGAC 




cc^tgtcatccgctactccccc^cgc^gcccro^tttcgccttcmggataagta 

caagcagatcttcctggcgggcgtggacaagcacacgcagttctgcaggtactttgcg 

ggcjvacctggcctctggcggcacggccgtcgtgtatcccctggatttcaccagaaccc 

gcctggcagccgacgtgggaaagtcaggcacggagcgcgagttccgaggcctgggaga 

ctgcctagtgaagatcjm3caagtccgacggcatccggggcct 

gtctccgtgcaggccatcatcatctaccaggcagcctacttcagggtgtacgatacgg 

ccaatggcatgttccccgaccccaagaacacacacatcctggtgagctggatgaccgc 

gcagaccgtgacggccgtggctggcgtgctctccaaacccttcgacacagtgcggcgg 

cggacgatgatgcagtcccggcgcaaaggagctgacatcatgtacacgggcaccgtcg 

ACrGTTGGAGGAAGATCTTCAGAGATGAGCGGGGCGAGGCCTTTTTCAAGGGTGTGTG 

gtccaacgccctcaaaggcatgggggtgggggcgggcttcgtgctggtcctgtacgac 
gagcrgaaggtcagctaagggccttggcatcctccacacacacacauaggaaccaag 
agaaccac 




ORF Start: ATG at 7 


ORF Stop: TAA at 886 




SEQIDNO: 134 


293 aa 


MW at 32354.3kD 


NOV42a, 
CG96422-01 Protein Sequence 


MTEQAI S FAKD FLAGG I TAAI SKTAVAS I KRVQLLLQMQHASM PMAAAKQCKG IVDCI 
VRI PKDQGVLS FWRGNLANVI RYSPTQALNFAFKDKYKQI FLAGVDKHTQFCRYFAGN 
IiASGGTAWYPLDFTRTRLAADVGKSGTERE FRGLGDCLVKI SKSDG IRGLYQGFSVS 
VQAI 1 1 YQAAYFRVYDTANGMFPDPKNTH ILVSWMT AQTVTAVAGVLSKPFDTVRRRT 
MMQS RR KG AD I MYTGTVD CWRK I FRD ERG EAF F KGVWSN AL KG MGVG AG FVL. VL YD E L 
KVS 



Further analysis of the NOV42a protein yielded the following properties shown in 
Table 42B. 



Table 42B. Protein Sequence Properties NOV42a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located 
in cytoplasm; 0.2809 probability located in lysosome (lumen); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV42a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 42C. 
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Table 42C Geneseq Results for NOV42a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV42a Residues/ 
Match Residues 


Identities/ 

^imilaritiAC 4~nr tlif* 
OiiilJlal lllco lUl UlC 

Matched Region 


Expect 
Value 


A AU 10380 

i 

i 

I 


Human adenine nucleotide 
translocator3 (ANT3)- 
Homo ^aniens ^95? aa 
[WO200185944-A2, 15- 
NOV-2001] 


1..291 
L.295 


253/297 (85%) 
264/297 (88%) 


e-140 


AAU01200 


Human adenine nucleotide 

trarKloratnr-^ fAWTV^ 
u aiuii/waiui j j j j 

protein - Homo sapiens, 298 
aa. [WO2001 32876- A2, 10- 
MAY-2001] 


1..291 


253/297 (85%) 

'X/l/OOT /COO/A 


e-140 


AAM41427 


Human polypeptide SEQ ID 
NO 6358 - Homo sapiens, 
323 aa. [WO200153312-AI, 
26-JUL-2001] 


1..291 
26..320 


253/297 (85%) 
264/297 C88%) 


e-140 


AAM39641 

! 


Human polypeptide SEQ ID 
NO 2786 - Homo sapiens, 
298 aa. [WO200153312-A1, 
26-JUL-2001] 


I. .291 
1..295 


253/297 (85%) 
264/297 (88%) 


e-140 


1 AAY71033 


Human adenine nucleotide 
translocator ANT3 - Homo 
sapiens, 298 aa. 
[WO200026370-A2, 11- 
MAY-2000] 


1..291 
1..295 


253/297 (85%) 
264/297 (88%) 


e-140 



In a BLAST search of public sequence datbases, the NOV42a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 42D. 
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Table 42D. Public BLASTP Results for NOV42a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV42a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


PI 2236 


ADP,ATP carrier protein, liver 
isoform T2 (ADP/ATP 
translocase 3) (Adenine 
nucleotide translocator 3) (ANT 
3) - Homo sapiens (Human), 
298 aa. 


1..291 
1..295 


253/297 (85%) 
264/297 (88%) 


e-140 


P32007 


ADP,ATP carrier protein, 
isoform T2 (ADP/ATP 
translocase 3) (Adenine 

mif^lf^rttiH** tr2*nclr\r , a+r»r *X\ ( A \IT 

iiuticuiiuc ir dnbiut/dior j ) ^/\i> l 

3) - Bos taurus (Bovine), 298 
aa - 


1..291 
1.295 


246/297 (82%) 
261/297 (87%) 


e-136 


P05141 


ADP,ATP carrier protein, 
fibroblast isoform (ADP/ATP 
translocase 2) (Adenine 
nucleotide translocator 2) (ANT 
2) - Homo sapiens (Human), 
298 aa. 


1..291 
1..295 


234/297 (78%) 
256/297 (85%) 


e-132 


BAB84673 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 2 - Bos 
taurus (Bovine), 298 aa. 


1..291 • 
1..295 


233/297 (78%) 
254/297 (85%) 


e-131 


Q09073 


ADP,ATP carrier protein, 
fibroblast isoform (ADP/ATP 

translocase 2) (Adenine 
nucleotide translocator 2) (ANT 
2) - Rattus norvegicus (Rat), 
298 aa. 


I. .291 
1..295 


232/297 (78%) 
255/297 (85%) 


e-131 
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PFam analysis predicts that the NOV42a protein contains the domains shown in 
the Table 42E. 



Table 42E. Domain Analysis of NO V42a 


Pfam Domain 


NOV42a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mito_carr 


7.. 105 


33/125 (26%) 
83/125 (66%) 


9.2e-23 


mito_carr 


112.202 


38/125 (30%) 
78/125 (62%) 


1.9e-18 


mito_carr 


203. .293 


25/125 (20%) 
69/125(55%) 


2.9e-05 



Example 43. 

The NOV43 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 43 A. 



Table 43A. NOV43 Sequence Analysis 




SEQ ID NO: 135 


858 bp 


NOV43a, 
CG96442-01 DNA Sequence 


ATGTTATGGGTATCTTTGATCTCTGAATTTGAAGGAGGGCTTACTGAGTGTGTCTGGC 

TCATTGG CACTCCTGAGGCAGAACACTTTTCAGGCAAGG GAGTACTACCTGTG CTGTT 

CAAATTACGTCTTGTGGGTGGCCAACTACCTCATCTGAATGCAGGGCTTGTGGTGGTT 

TGGTATCGGAACTCATCATC^TCCTTTCCAACCTCCATAGGGCACCTTC 

AGAAACTGGAAAGAAAAATGCTTGCTCTAGGGCTCCTTGGGCAGTGGCTCAGTGGCAT 

CTGCCTC^GTGGCTCCCCTTTTCCATACTCCT 

GGGTATTTAGAAGTGGCCrrACTTTCATGTCAAGGAAGAAAGTAAGAGAGAGAGAGAGA 
GAGAGGAGAACGGAGAACATTCTCCCC^TCAAAATGTCCOTTCCCAAATTCTATC^G^ 
GCCTCITCTAGGCTTTGAGAGCATGTACTGTGTACACGCTGCTATCACTGCATTTCAT 
CCCCrATGGCTGTCCCCTTC^CTGCACGGGGCAGCCCTGTGCTTTCCGTTTTTCAAGA 
TCGAATGTAGCIAAACAAGTTATCTATGCAGTTAATGATGATTTATTGAGGGCCTACAA 
AACGGGCACAGTGGCAACGGCTTCGGGGAAGAAAAGAGGGCAAGGTGCACGGGAAGAA 
ATGAGCTCAAAGGAGATTGTGCCAATTACTGCACCTGTCAGTGTTCAACAACAGATAA 
AATTAATTCGGAAACCCAAACAGTCTGCAGAAAACCGAACACAATTATCTAAGAAATT 
AAGTTATACCGTTGCCATGACCTTTAATCAATCATCCAACCGTTAA 




ORF Start: ATG at 1 


ORF Stop: TAA at 856 




SEQ ID NO: 136 


285 aa 


MWat31776.3kD 


NOV43a, 
CG96442-01 Protein Sequence 


MLWVSLISEFEGGLTECVWLIGTPEAEHFSGKGVLPVLFKLRLVGGQLPHLNAGLVW 
WYRNSSSSFPTSIGHLPEQQKLERKMLALGIXGQW^ 

GYLEVAYFHVKEESKREREREENGEHSPHQNVPSQILSVPLLGFESMYCVHAAITAFH 
PLWLS PSLHGAALCFPFFKI ECSKQVI YAVNDDIiLRAYKTGTVATASGKKRGQGAREE 
MSSKEIVPITAPVSVQQQIKLIRKPKQSAENRTQLSKKLSYTVAMTFNQSSNR 



Further analysis of the NOV43a protein yielded the following properties shown in 
Table 43B. 
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Table 43B. Protein Sequence Properties NOV43a 


PSort analysis: 


0.4010 probability located in microbody (peroxisome); 03000 probability located in 
nucleus; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


Cleavage site between residues 16 and 17 



A search of the NOV43a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 43C. 



Table 43C. Geneseq Results for NOV43a 




Geneseq 
Identifier 


Protein/Organis 

m/Length 
[Patent #, Date] 


NOV43a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched 
Region 


Expect 
Value 


| ABG01918 

i 
1 

I 


Novel human 

diagnostic 
protein #1909- 
Homo sapiens, 

434 aa. 
[WO2001 75067- 
A2, 11-OCT- 
2001] 


59. 140 
219..314 


30/100(30%) 
41/100(41%) 


5.8 


ABG01918 


Novel human 

diagnostic 
protein #1909- 
Homo sapiens, 

434 aa. 
[ WO2001 75067- 
A2,ll-OCT- 
2001] 


59.. 140 
219..314 


30/100(30%) 
41/100(41%) 


5.8 


ABG26879 


Novel human 
diagnostic 
protein #26870- 
Homo sapiens, 

800 aa. 
[WO200175067- 
A2,ll-OCT- 
2001] 


91-156 
328..393 


20/68 (29%) 
31/68(45%) 


7.6 


ABG26879 


Novel human 
diagnostic 
protein #26870 - 
Homo sapiens, 

800 aa. 
[WO200175067- 
A2, ll-OCT- 
2001] 


91.. 156 
328..393 


20/68(29%) 
31/68(45%) 


7.6 
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AAY08899 


S. peregrina 


66..111 


17/46(36%} 


9 9 




cysteine 


286.327 


25/46(53%) 






protease-like 








protein 1 - 










Sarcophaga 










peregrina, 550 








i 


aa. 








I 


[JP11146789-A, 








t 


02-JUN-1999] 









In a BLAST search of public sequence datbases, the NOV43a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 43D. 



Table 43D. Public BLASTP Results for NOV43a 

f 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV43a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9X935 


CONSERVED 
HYPOTHETICAL 
PROTEIN - Streptomyces 
coelicolor, 309 aa. 


I3L.192 
192..248 


19/62 (30%) 
27/62 (42%) 


8.4 


Q9RNK7 


HYPOTHETICAL 
PROTEIN - Zymomonas 
mobilis, 415 aa. 


14I..178 
221. .258 


12/38(31%) 
24/38 (62%) 


8.4 



PFam analysis predicts that the NOV43a protein contains the domains shown in 
the Table 43R 



Table 43E. Domain Analysis of NOV43a 



Pfam Domain 



NOV43a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 44. 

The NOV44 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 44A. 



Table 44A. NOV44 Sequence Analysis 



SEQ ID NO: 137 



999 bp 
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NOV44a, 
CG96501-01 DNA Sequence 



GGTTTTAATGCTTCCGGGTTGCCACTG CAGTGGCATTTCTGACTGTGGGAGCCTCAGT 



TTCCCAGTCATTGATGAG CCTGAG CAGCAGCAAAGTACCACTGTAAG CCATGAGATGT 
CTCGTCTGAACTGGAAACCCTTTGTATATGACGCCCTTGCCTCTATCACTGCTGAGTT 
TGGAACTTTCCCCATGGACCTTGCCAAAACACGACIT 

lTGTCCGTTTCAAAGAAACAAAATATAGACGGATGTTTCATGCTTTGTTTTGGATCT 
ATAAAGCGGAGGGGGGATTGGCTCTGTATTCAGGAATTGCTCCTGTTTTGCAAAGACA 
AGCATCATATGGCACCATTAAAATTGGGATTTACCAAAGCTTGAAGCAATTATCTGTA 
GAACGTTTAGAAGATGAAACTCTTTTAATCAACATGATCTGTGGGGTAGTGTCAGGGG 
TGATATTTTCCACTATAGCCAATCCCACCGATGTTCTAAAGATTCGAATGCAGGCTCA 
AGGAAGTTTGTTCCAAGGGAGCATGATTGGCAGCTTCATCGATATATACCAACAGGAA 
GGCACCAGGGGTCTGTGGAGGAGTGTGGTTCCAACTGCTCAGCATGCTGCCATCGTTG 
TGGGAGTAGAGCTACCAGTCTATGATTTTACTAAGAAGCACTTAATATTGTCAGGAAT 
GATGGAAGACACAACTTTAACTCACTTTGTTTCCAGCTTTACATATGGTTTGGCTGGG 
GCTCTTGCCTCTAATCCAGGTGATGTGGCAGGCACTCACGTGATGAACCAGAGGGCAA 
TCGTGGGACATGTGGATCTCTATAAGGGCACTTTGGATGGTATTTTAAAAATGTGGAA 
ACATGAGGGCTTTTTTTTGTATTCTAAAGGATTTTGGCCAAACTGGCTTTGGCGTGGA 
CCCTGGAACATCATTCTTAAAATTACATATGAGAAGCTCAAGAGGCTTTA AATCTAAG 
GACTGAATTATAT 





ORF Start: ATG at 8 


ORF Stop: TAA at 977 




SEQIDNO: 138 


323 aa 


MWat36200.6kD 


NOV44a, 
CG96501-01 Protein Sequence 

i 


ML PGCHCS GISD03SLSFPVIDE PEQQQSTTVSH EM S RLNWKPFVYDALAS ITAE FGT 
FPMDLAKTRLQVQGQSIDWFKETKyRRMFHALFWIYKAEGGLALYSGIAPVLQRQ 
YGTI KIGI YQSLKQLS VERLEDETLLI NM I CGWSGVI FSTI ANPTDVLKIRMQAQGS 
LFQGSM I GS FID I YQQEGTRGLWRS WPTAQHAAI WGVELPVYDFTKKHLILSGMME 
DTTLTHFVSSFTYGLAGAIiASNPGDVAGTHVMNQRAIVGPTVpLYKGTLDGILKMWKHE 
GF FLYS KGFWPNWLWRGPWNI ILKITYEKLKRL 



Further analysis of the NOV44a protein yielded the following properties shown in 
Table 44B. 



Table 44B. Protein Sequence Properties NOV44a 


PSort analysis: 


0.7480 probability located in microbody (peroxisome); 0.7000 probability located in 
plasma membrane; 0.2000 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV44a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 44C. 
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Table 44C. Geneseq Results for NOV44a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV44a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAY94669 


Murine uncoupling protein 
isoform mUCP5S amino 
acid sequence - Mus 
musculus, 325 aa. 
[WO200032624-A2, 08- 
JUN-2000] 


27..323 
27..323 


257/297 (86%) 
266/297 (89%) 


e-144 


AAY94668 


Murine uncoupling protein 
isoform mUCP5L amino 
acid sequence - Mus 
musculus, 322 aa. 
[WO200032624-A2, 08- 
JUN-2000] 


27..323 
24..320 


257/297 (86%) 
266/297 (89%) 


e-144 


AAE06056 


Human gene 16 encoded 
secreted protein HM1AP86, 
SEQIDNO:118-Homo 

sapiens, 334 aa. 
[WO200151504-A1, 19- 
JUL-2001] 


27..323 
36..332 


258/297 (86%) 
264/297 (88%) 


e-144 


AAY87079 


Human secreted protein 
sequence SEQ ID NO: 1 1 8 
- Homo sapiens, 335 aa. 
[WO200004140-Al,27- 
JAN-2000] 


27..323 
36..332 


258/297 (86%) 
264/297(88%) 


e-144 


AAY94666 


Human uncoupling protein 
isoform hUCP5S amino 
acid sequence - Homo 

sapiens, 322 aa. 
[WO200032624-A2, 08- 
JLTN-2000] 


27.323 
24..320 


258/297 (86%) 
264/297 (88%) 


e-144 



In a BLAST search of public sequence datbases, the NOV44a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 44D. 
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Table 44D. Public BLASTP Results for NOV44a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV44a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9Z2B2 


Brain mitochondrial carrier 

protein-1 (BMCP-1) 
(Mitochondrial uncoupling 
protein 5) (UCP 5) (Solute 
carrier family 25, member 
14) - Mus musculus (Mouse), 
325 aa. 


27..323 
27..323 


257/297 (86%) ■ 
266/297 (89%) 


e-144 


095258 


Brain mitochondrial carrier 

protein-1 (BMCP-1) 
(Mitochondrial uncoupling 
protein 5) (UCP 5) (Solute 
carrier family 25, member 
14) - Homo sapiens 
(Human), 325 aa. 


27..323 
27..323 


258/297 (86%) 
264/297 (88%) 


e-144 


Q9JMH0 


BRAIN MITOCHONDRIAL 

CARRIER PROTEIN-1 - 
Rattus norvegicus (Rat), 322 
aa. 


27..323 
24..320 


255/297 (85%) 
265/297 (88%) 


e-143 


| Q9EP88 


BRAIN MITOCHONDRIAL 
CARRIER PROTEIN 

BMCP1 (BRAIN 
MITOCHONDRIAL 
CARRIER PROTEIN-1) - 
Rattus norvegicus (Rat), 325 
aa. 


27..323 
27..323 


255/297 (85%) 
265/297 (88%) 


e-143 


Q9CR58 


4933433D23RIK PROTEIN 
- Mus musculus (Mouse), 
291 aa. 


36..323 
1..289 


205/289 (70%) 
231/289(78%) 


e-113 



PFam analysis predicts that the NOV44a protein contains the domains shown in 
the Table 44E. 
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Table 44E. Domain Analysis of NOV44a 


Pfam Domain 


NOV44a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mito_carr 


39..138 


36/125 (29%) 
75/125 (60%) 


1.5e-I7 


mito can* 


140..231 


27/125 (22%) 
74/125 (59%) 


2.8e-2l 


. mito_carr 


234.323 


29/125 (23%) 
70/125 (56%) 


0.00021 



Example 45. 

The NOV45 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 45 A. 



J — — — ■ , . 

Table 45A. NOV45 Sequence Analysis 


i 


SEQ ID NO: 139 


1065 bp 


| NOV45a, 

! CG96557-01 DNA Sequence 


ATGCTCCGCAGACTGGCCCGAGCGGCGGCCGTGGTGGGCCTGCGCAGCGACAAGGTGA 

CCAGGCCCGGGCCGGCCATGAGGTGCGTCATGGCCXjAGGCGGTCGTGGGGGACGACGA 

CTACGGCGAAGACCGGGTCCGTGATGAGCTGCAAGAAATGGCTGTGGAGCTGCTAGGG 

CTGGGGCGC^CCCTCTTTGTGCCCTCCTUVCACTATGGCCAACCTCATCTCTGGGTCTG 

ACG ACTGCAGGTGCCTCGGCTGCGTGTC C^CCAAG AAGCAGCATTGCAGCAGCTT CTC 

CAGCAGGCGATGCTCTGTGGAGGGGCCTCC7VGGCGCAGAGGGCTGGA 

CTCCCTCCCAGAGCTGCCCCACTGCACTCTGGACTGGCTGAGCTGGAGAGGATGATCA 

CGCGGGGCCTCCAGAGTCCCTACCACC^GGTCTGCXSAGCT 

ACACAGC^TCTCGGGGGTCCGGATCCTCCCCATCMCTACCTACGCCAGGTGCACCTC 
CTGGCCCAGACCTATGGGGCCTGGGTCCACCTGGACAGGGCCCAGCTGATGAACGTGG 




CTGCCTCTCCAAGGGCCTGGGCGCCCCAGTGGGGTCCCTGGCTGGGAGGCCCAAGGAC 

TTCATTGAAGAAGCCTGGCGCCTCCAAAAAGCCCTGGGTGGAAGCATGCACCAGGTGG 

GGATGCrGGCTGOSGTGGCCCTGCTGTGCCGTa^GGGTCCCCCAGaiGGCCTGCAGG 

CTTCTGGATCCTGGGACTCAGGGTGGCCATTGAGTCACTAGGGCGGGACCCAGGGCAC 

CAGCTGCTI^CAGCAGACCCTACTCAGCGCAGAGTCCAGCCTGGGGA^ 

ACGGCCAGGATTGTGATGCTCTGCAGAGTCTGGCAGAGCCTCTAACCTGGGTCTTCCA 

TCTGCTCACTGAGTCCTCCAACAATCCCATGATGTCCCTACTTCACAATGAGGAAACA 

GAGG CCCAGAGAGGTCAAT AG 




ORF Start: ATG at 1 


ORF Stop: TAG at 1063 




SEQ ID NO: 140 


354 aa 


MW at38439.8kD 


NOV45a, 
CG96557-01 Protein Sequence 


MLRRLARAAAWGLRSDKVTRPGPAMRCVMAEAW 

LGRTLFVPSNTMANLI SGSDDCRCLGCVSTKKQHCS S FS SRRCS VEGPPGAEGWS ALP 
LPPRAAPLHSGLAELERM I TRGLQS PYHQVCELI CLENSHS I SGVRILP INYLRQVHIi 
liAQTYGAWVHIJDRAQLMNVVIiAIJIVPPTYIVEHCDSVSFCLS KGLGAPVGSLAGRPKD 
FI EEAWRLQKALGGSMHQVGMLAAVALLCRRGS PSRPAGFWI LGLRVAI ESLGRDPGH 
QI^CK?rLI^AESSIX3KPSVDGQDa3AL^^ 
EAQRGQ 



Further analysis of the NOV45a protein yielded the following properties shown in 
Table 45B. 
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Table 45B. Protein Sequence Properties NOV45a 


PSort analysis: 


0.8673 probability located in mitochondrial matrix space; 0.5542 probability located 

in mitochondrial inner membrane; 0.5542 probability located in mitochondrial 
intermembrane space; 0.5542 probability located in mitochondrial outer membrane 


Signal P analysis: 


Cleavage site between residues 17 and 18 



A search of the NOV45a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 45C. 



Table 45C. Geneseq Results for NOV45a 


j Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV45a Residues/ 
Match Residues 


Identities/ V 
Similarities for the 
Matched Region 


Expect 
Value 


AAG42377 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 52844 - 

Arabidopsis thaliana, 356 aa. 
[EPI033405-A2, 06-SEP-2000] 


I2..260 
4..24S 


98/252 (38%) 
129/252 (50%) 


4e-36 


AAG42376 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 52843 - 

Arabidopsis thaliana, 358 aa. 
[EP1033405-A2, 06-SEP-2000] 


12..260 
6..250 


98/252 (38%) 
129/252 (50%) 


4e-36 


AAG42378 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 52845 - 

Arabidopsis thaliana, 339 aa. 
[EP1033405-A2, 06-SEP-2000] 


26..260 
1..231 


88/238 (36%) 
119/238(49%) 


5e-31 


AAG 10781 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 9239 - 
Arabidopsis thaliana, 332 aa. 
[EP1033405-A2, 06-SEP-2000] 


26..260 
1..231 


94/241 (39%) 
125/241 (51%) 


5e-31 


AAG73780 


Human colon cancer antigen 
protein SEQ ID NO:4544 - 
Homo sapiens, 272 aa. 
[WO200122920-A2, 05-APR- 
2001] 


10..268 
3..251 


91/261 (34%) 
135/261 (50%) 


2e-30 



In a BLAST search of public sequence datbases, the NOV45a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 45D. 
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Table 45D. Public BLASTP Results for NOV45a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV45a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


T00716 


L-allo-threonine aldolase 
homolog F22013.ll - 
Arabidopsis thaliana, 484 aa. 


12..260 
90..334 


98/252 (38%) 
129/252 (50%) 


le-35 


AAL86346 


HYPOTHETICAL 38.9 KDA 
PROTEIN - Arabidopsis thaliana 
(Mouse-ear cress), 358 aa. 


12..260 
6..250 


98/252 (38%) 
129/252 (50%) 


le-35 


Q9FRS2 


F22013.ll - Arabidopsis 
thaliana (Mouse-ear cress), 352 
aa. 


12..260 
. 6..250 


98/252 (38%) 
129/252 (50%) 


le-35 


O0705I 

| 
i 

I 


L-allo-threonine aldolase (EC 
4.1.2.-) (L-alIo-TA)(L-alIo- 

threonine acetaldehyde- lyase) - 
Aeromonas jandaei, 338 aa. 


12..259 
4..241 


100/249(40%) 
131/249 (52%) 


3e-34 


Q8ZGE4 

| 

i 


L-ALLO-THREONINE 
ALDOLASE (EC 4.1.2.-) - 
Yersinia pestis, 339 aa. 


11. .268 
2..249 


93/260 (35%) 
136/260 (51%) 


le-31. 



PFam analysis predicts that the NOV45a protein contains the domains shown in 
the Table 45E. 



Table 45E. Domain Analysis of NOV45a 



Pfam Domain 



NOV45a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 46. 



The NOV46 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 46A. 
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Table 46A. NOV46 Sequence Analysis 




SEQ ID NO: 141 


945 bp 


NOV46a, 
j CG9658I-01 DN A Sequence 

I 

| 
j 

j _ 


TCTGCCCAGTTAAAGTGCAGGCGTTAGGCTAAAGCTGGGACAAACCTGGAGCGGCTTC 
ATTGGCTTTCCTACTTCTTTTGCATCCCAGAGATGCATAAGCTTAAATCGTCTCAGAA 
GG ACAAGGTCCG CCAGTTTATGG CGTG CACTCAGG CTGGCG AGAGAACTGCTATCT AC 
TGCTTAACGCAG AATG AGTGGAGACTAGACG AGGCCACGGACAG CTTCTTCCAAAACC 
CAGACTCGCTCCACAGGGAGTCCATGCGGAACGCTGTGGACAAGAAGAAGCTGGAGCG 
GCTGTACGGCAGGTACAAAGATCCACAAGATGAAAACAAAATTGGAGTCGATGGGATT 
CAACAGTTTTGTGATGATCTGAGCCTGGATCCTGCCAGTATCAGTGTATTGGTCATAG 
CGTGGAAGTTCAGGGCAGCAACTCAGTGTGAATTTAGCAGAAAGGAATTTCTAGATGG 
CATGACAGAACTTGGGTGTGACAGCATGGAGAAGCTAAAGGCTCTTCTGCCAAGACTG 
GAG CAGGAG CTGAAGG AC ACAG C C AAGTTT AAAG AT TTTT AT C AGTTT AC CTTCAC CT 
TCGCTAAGAACCCAGGGC^GAAAGGTTTAGACTTAGAAATGGCTGTTGCGTATTGGAA 
ATTAGTGTTATCTGGAAGGTTTAAATTTTTAGATCTCTGGAACACATTCITAATGGAA 
CATCACAAAAGATCAATTCCAAGGGACACCTGGAACCTCCTGCITGGACTTTGGAAAC^ 
TGATTGCGGATGATATGTCTAACTACGATGAAGAAGGAGCTTGGCCCGTTCTTATAGA 
TGATTTTGTAGAATATGCACGGCCAGTAGTCACAGGTGGAAAACGCAGCXrrTTTCTAG 
G C AG C AAGTTAAG CAGGAGT AAGATT ATGAAATG AT TT GT ATCC TG C AAG GAG ATTG C 
AGTCAGTTCCTGGGTGC 




ORF Start: ATG at 91 


ORF Stop: TAG at 868 


j 


SEQ ID NO: 142 


259 aa 


MWat30179.1kD 


| NOV46a, 

.CG96581-01 Protein Sequence 

i 

i — — . 1 _ 


MHKLKSSQKDKVRQFMACI^AGERTAIYCLTQNEWRLDEATDSFFQNPDSLHRESMRN 
AVDKKKLERLYGRYKDPQDENKIGVDGI QQFCDDLSLD PASI SVLVIAWKFRAATQCE 
FSRKEFLDGOTEI^CT)SMEKLKALLPRLEQEl»KDTAKFKDFYQFTFTFAKNPGQKGLD 
LEMAVAYWKLVLSGRF KFLDLWNTFLMEHHKRS I PRDTWNLLLDFGNMI ADDMSNYDE 
EG AWP VLI DDFVEYAR PWTGGKRSL F 




SEQ ID NO: 143 


808 bp 


j NOV46b, 
CG9658I-02 DNA Sequence 

j 


GAGATGCATAAGCTTAAATCGTCTCAGAAGGACAAGGTCCGCCAGTTTATGGCGTGCA 

CT CAGGCrGGCGAGAGAACTGCTATCrrACTGCTTAACGCAGAATGAGTGGAGACrrAGA 

CGAGGCC^OSGACAGCTTCTTCCAAAACCCAGACTCGCTCCACAGGGAGTCCATGCGG 

AACGCTGTGGACAAGAAGAAGCTGGAGCGGCTGTACGGCAGGTACAAAGATCCACAAG 

ATGAAAACAAAATTGGAGTCGATGGGATTCAACAGTTTTGTGATGATCTGAGCCTGGA 

TCCTGCCAGTATCAGTGTATTGGTCATAGCGTGGAAGTTaf^GCAGC^CTCAGTGT 

GAATTTAGCAGAAAGGAATTTCTAGATGGCATGACAGAACTTGGGTGTGACAGCATGG 

AGAAGCTAAAGGCTCTTCTG CCAAG ACTGG AG CAGGAG CTGAAGG ACACAG CCAAGTT 

TAAAGATTTTTATCAGTTTACCITCACCrTCGCr AAG AACCC^ G CAGAAAGGTTT A 

G ACTTAG AAATGG CTGTTGCGT ATTGG AAATTAG TGTTAT CTGGAAG GTTT AAATTTT 

TAGATCTCTGGAACACATTCTTAATGGAACATCACAAAAGATCTUV^ 

CCGGAACCTCCTGCTGGACTTTGGAAACATC^TTGCGGATGATATGTCTAACT 

G AAGAAGGAG CTTGGCC CGTTCTT AT AGATG ATTTTGT AG AATATG CACGGC CAG TAG 

TCAC^GGTGGAAAACGCAGCCTTTTCTAGGCAGCAGTTAAGCAGGAGTAAGATT 




ORF Start: ATG at 4 


ORF Stop: TAG at 781 




SEQ ID NO: 144 


259 aa 


MWat30149.1kD 


NOV46b, 
CG96581-02 Protein Sequence 


MHKLKS SQKDKVRQFMACTQ AGERTAI YCLTQNEWRLD EATDS FFQNPDSLHRESMRN 
AVD KKKLERLYGRYKD PQDE^TK I GVDG I O^FCDDLSLD PAS I SVLVIAWKFRAATQCE 
FSRKEFLDGMTELGCDSMEKLKALLPRLEQELKDTAKFKDFYQ 

LEMAVAYWKLVLSGRFKFLDLWNTFLMEHHKRS I PRDTRNLLLDFGNMI ADDMSNYDE 
EGAWPVliI DDFVEYARP WTGGKRS LF 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 46B. 
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Table 46B. Comparison of NOV46a against NOV46b. 


Protein Sequence 


NOV46a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV46b 


1..259 
I. .259 


258/259 (99%) 
258/259 (99%) 



Further analysis of the NOV46a protein yielded the following properties shown in 
Table 46C. 



Table 46C. Protein Sequence Properties NOV46a 


PSort analysis: 

i 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.1000 probability located in mitochondria! matrix space; 0.1000 
probability located in lysosome (lumen) 


| SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV46a protein against the Geneseq database, a proprietary 
5 database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 46D. 
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Table 46D. Geneseq Results for NOV46a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV46a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAM40345 


Human polypeptide SEQ ID 
NO 3490 - Homo saoiens 244 
aa. [WO200153312-Al,26- 
JUL-2001] 


16..259 
L.244 


244/244(100%) 
944/944 (\C\()0/ n \ 


e-144 


AAM42131 


Human polypeptide SEQ ID 
NO 7062 - Homo <wmipn<; 994 
aa. [WO200153312-Al,26- 
JUL-2001] 


2..204 

99 994 


200/203 (98%) 


e-115 


AAY94962 


Human secreted protein clone 
mtl24_3 protein sequence SEQ 
ID NO: 1 30 - Homo sapiens, 
244 aa. [WO200009552-A1, 
24-FEB-2000] 


16..255 
1..240 


185/240 (77%) 
218/240(90%) - 


e-1 15 


AAB93082 


Human protein sequence SEQ 
ID NO: 1 1 9 1 7 - Homo sapiens, 
186 aa. [EP 107461 7- A2, 07- 
FEB-2001] 


1..I73 
1..173 


173/173 (100%) 
173/173 (100%) 


le-98 


ABB63270 


Drosophila meianogaster 
polypeptide SEQ ID NO 16602 
- Drosophila meianogaster, 291 
aa. [WO200171042-A2, 27- 
SEP-2001] 


2..248 
5..247 


152/248 (61%) 
192/248 (77%) 


le-87 



In a BLAST search of public sequence datbases, the NOV46a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 46E. 
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Table 46E. Public BLASTP Results for NOV46a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV46a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q96GG9 


RP42 HOMOLOG - Homo 
sapiens (Human), 259 aa. 


1..255 
1..255 


201/255 (78%) 
234/255 (90%) 


e-123 


Q9QZ73 


RP42 - Mus musculus 
(Mouse), 259 aa. 


L.255 
1..255 


201/255 (78%) 
235/255 (91%) 


e-123 


Q9HCT3 


RP42 PROTEIN - Homo 
sapiens (Human), 259 aa. 


1..255 
L.255 


200/255 (78%) 
234/255 (91%) 


e-123 ' 


Q99NE7 


PUTATIVE LEUCINE- 
ZIPPER PROTEIN - Mus 
musculus domesticus (western 
European house mouse), 259 
aa. 


1 ..255 
L.255 


200/255 (78%) 
233/255 (90%) 


e-122 


AAL78673 

1 


LEUCINE ZIPPER PROTEIN 
- Homo sapiens (Human), 259 
aa. 


1 ..255 
L.255 


199/255 (78%) 
232/255 (90%) 


e-122 



PFam analysis predicts that the NO V46a protein contains the domains shown in 
the Table 46F. 



Table 46F. Domain Analysis of NOV46a 



Pfam Domain 



NOV46a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 47. 

The NO V47 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 47A. 



268 



WO 03/010327 



PCT/US02/14199 



Table 47A. NOV47 Sequence Analysis 




SEQ ID NO: 145 


1072 bp 


NOV47a, 
CG96624-01 DNA Sequence 


CCCTCCTTGTCTGTCTCCGGGATTCAGGCCTCCCTCCCTGACATGGAGAGTAACCTGT 
CTGG CCTGGTGCCTGCTG CCGGG CTGGTG CCTGCG CTGCCACCTGCTGTGACCCTGGG 
GCTG ACAG CTG CCTACACC ACCCTGTATG CCCTGCTCTTCTTCTCCGTCTATGCCCAG 
CT CTGGCTGGTG CTTCTGTATGGG CACAAGCGTCTCAGCT ATC AG ACGGTGTTCCTGG 
CC CTCTGTCTG CTCTGGG CCGCCTTG CGTACCACCCT CTTCTCCTTCTACTTCCGAGA 
TACTCCCCGCGCCAACCGCCTGGGGCCCTTGCCCTTCTGGCTTCTCTACTGCTGCCCC 
GTCTGCCTGCAGTTCTTCACCTTGACGCTTATGAACCTCTACTTTGCCCAGGTAAGGC 
TCGCTGTCCGAGGGGCCTTTGTGGGGGCCTCGCTGCTCTTTCTGCTGGTGAACGTGCT 
GTGTGCTGTGCTCTCCCATCGGCGCCGGGCACAGCCCTGGGCCCTGCTGCTTGTCCGC 
GTCCTGGTGAGCGACTCCCTGTTCGTCATCTGCGCGCTGTCrCTTGCTGCCTGCCTCT 
GCCTCGTCGCCAGGCGGGCGCCCTCCACTAGCATCTACCTGGAGGCCAAGGGGACCAG 
TGTGTGCCAGGCGGCCGCGATGGGTGGCGCCATGGTCCTGCTCTATGCCAGCCGGGCC 
TGCTACAACCTGACAGCACTGGCCTTGGCCCCCCAGAGCCGGCTGGACACCTTCGATT 
ACGACTGGTACAATGTGTCTGACCAGGCGGACCTGGTGAATGACCTGGGGAACAAAGG 
CTACCTGGTATTTGGCCTCATCCTCTTCGTGTGGGAGCTACTGCCCACCACCCTGCTG 




TCTTTGACCGG GCTGGG CACTGTGAAG ATG AGGGCTGCTCCTGG G AGCACAGCCGGGG 
TGAGAGCACC7^GGTAGGAGCCGTGGCACTGCCTCAGTACCCCTGCCCTACCCGCCCAC 
CCCGCTGGCTCCATCAAGCTATGGGGGA 




ORF Start: ATG at 43 


ORF Stop: TAG at 1000 




SEQ ID NO: 146 


3I9aa 


MW at 35534.2kD 


NOV47a, 
CG96624-01 Protein Sequence 

i 


MESNLSGLVPAAGLVPALPPAVTLGLTAAYTTLYALLFFSVYAQLWLVLLYGHKRLSY 
QTVFIALCLLWAALRTTL FS FY FRDT PRANRLGPL PFWLLYCC P VCLQFFTLTLMNLY 
FAQVRLAVRG AFVG AS LL FLLVNVLCAVL SHRRRAQPWALLLVRVLVS DS LFVI CALS 
LAACLCLVARRAPSTSIYLEAKGTSVCQAAAMGGAMVLLYASRACYNLTALAIiAPQSR 
LDTFDYDWYNVSDQADLVNDLGNKGYLVFGLILFVWELLPTTLLVGFFRVHRPPQDLF 
ASRSYFFDRAGHCEDEGCSWEHSRGESTR 


r ■ — 


SEQ ID NO: 147 




1200 bp 


NOV47b, 
CG96624-02 DNA Sequence 


GATAAGGCCTCCCTCCCTGACATGGAGAGTAACCTGTCTGGCCTGGTGCCTGCTGCCG 


GGCTGGTGCCTGCGCTGCCACCTGCTGTGACCCTGGGGCTGACAGCTGCCTACACCAC 

CCTGTATGCCCTGCTCTTCTTCTCCGTCTATGCCCAGCTCTGGCTGGTGCTTCTGTAT 

GGGCACAAGCGTCTCAGCTATCAGACGGTGTTCCTGGCCCTCTGTCTGCTCTGGGCCG 

CCTTGCGTACCACCCTCTTCTCCTTCTACTTC^ 

GGGGCCCTTGCCCTTCTGGCTTCTCTACTGCTGCCCCGTCrrcCCT 

TTGACGCTTATGAACCTCTACTTTGCCCAGX3TAAGGCTCGCTGTCCGAGG 

TGGGGGCCrCGCTGCTCTTTCTGCTGGTGAACGTGCTGTGTGCTGTGCTCTCCCATCG 

GCGCCGGGCACAGCCCTGGGCCCTGCTGCTTGTCCGCGTCCTGGTGAGCGACTCCCTG 

TTCGTCATCTGCGCGCTGTCTCTTGCTGCCrGCCTCTGCCTCGTCGCCAGGCGGGCGC 

CCTCCACTAGCATCTACCTGGAGGCCAAGGGGACCAGTGTGTGCCAGGCGGCCGCGAT 

GGGTGGCX3CCATGGTCCTGCTCTATGCCAGCCGGGCCTGCTACAACCTGACAGCACTG 

GCCTTGGCCCCCCAGAGCCGGCTGGACACCTTCGATTACGACTGGTACAATGTGTCTG 

ACCAGGTGGG CATACGCATGTCTGCCACCCAGG CGG ACCTGGTGAATGACCTGGGGAA 

CAAAGGCTACCTGGTATTTGGCCTCATCCTCTTCGTGTGGGAGCrrACTGCCCACCACC 

CTGCTGGTGGGCTTCTTCCGGGTGCACCGGCCCCCACAGGACCTGTTTGCCTCTCGGT 

CCTACTTCTTTGACCGGGCTGGGCACTGTGAAGATGAGGGCTGCTCCTGGGAGCACAG 

CCGGGGTGAGAGCACCAGTATGTCX3GGCAGTCTAGGCTCTGGGAGCTGGTATGGTGCC 

ATCGGG CGTGAGCCGGGCTGGTATGGGGG CAGCCAGACGAAGACCACTCCTCTGCTCT 

TCTCCCAGGTGCCAGGACCAGGCGGCCACCACCACAGTCTCTACTCCACCCCACAGAC 

GTGATCCCCGTCGCCTGCTGGGAAGAATACCGAGGCCCTG 




ORF Start: ATG at 22 


ORF Stop: TGAat 1162 




SEQ ID NO: 148 


380 aa 


MW at41810.1kD 


NOV47b, 
CG96624-02 Protein Sequence 


MESNLSGLVPAAGLVPALPPAVTLGLTAAYTTLYALLFFSWAQLWLVLLYGHKRLSY 
QTVFLALCLLWAALRTTLFS FYFRDTPRANRLGPLP FWLLYCCP VCLQFFTLTLMNLY 
FAQVRIAWGAFVGASLLFLLVNVLCAVLSHRRRAQPWALLLVRVLVSDSLFVICALS 
LAACLCLVARRAPSTS I YLEAKGTSVCQAAAMGGAMVLLYASRACYNLTALALAPQSR 
LDTFDYDWYNVSDQVG IRMS ATQADLVNDIX3NKGYLVFGLI LFVWELLPTTLLVGFFR 
VHRPPQDLFASRSYFFDRAGHCEDEGCSWEHSRGESTSMSGSLGSGSWYGAIGREPGW 
YGGSQTKTTPLLFSQVPGPGGHHHSLYSTPQT 
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Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 47B. 



Table 47B. Comparison of NOV47a against NOV47b. 


Protein Sequence 


NOV47a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV47b 


1..314 
1..323 


256/323 (79%) 
256/323 (79%) 



Further analysis of the NOV47a protein yielded the following properties shown in 
Table 47C. 



j . . , - . . 

| Table 47C. Protein Sequence Properties NOV47a 


PSort analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi body; 
0.1000 probability located in endoplasmic reticulum (lumen) 


; Signal P analysis: 


Cleavage site between residues 44 and 45 



5 A search of the NOV47a protein against the Geneseq database, a proprietary 

database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 47D. 
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Table 47D. Geneseq Results for NOV47a 


Geneseq 
Identifier 

j 


Prntpin/Oroanicm/I ^noth 

I i u it* ill/ \~/JgaiU5]ll/L<CllglII 

[Patent #, Date] 


iNvjv** /a tvesiaues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


■ AAEO9061 

i 


Human seven-transmembrane 
protein 19459- Homo sapiens, 
396 aa rWO700 1 S0 1 16- A? 

16-AUG-2001] 


1..318 
I..343 


317/343 (92%) 
317/343 (92%) 


e-180 


AAU04571 

i 


Human G-protein coupled 
receptor like protein, GPCR #8 
- Homo sapiens, 13 14 aa. 

rWO?001 S^4S4-A9 II IT 

% 2001] 


1..3I8 
1.343 


317/343 (92%) 
317/343 (92%) 


e-180 


AAY69992 

i 

t 

1 
t 


Human receptor-associated 
protein from Incyte clone 
786873 - Homo sapiens, 346 
aa. [WO200008155-A2, 17- 
FEB-2000] 


I ..318 
I..293 


267/343 (77%) 
267/343 (77%) 


e-142 


i AAU04572 

; 
i 

i 


Human G-protein coupled 
receptor model sequence #4 - 

Homo sapiens, 1 78 aa. 
[WO2001 53454- A2, 26-JUL- 
2001] 


56..2I8 
1..I78 


162/178 (91%) 
162/178 (91%) 


2e-85 


AAY60207 


Human endometrium tumour 
EST encoded protein 267 - 

Homo sapiens, 296 aa. 
[DE19817948-A1, 21-OCT- 
1999] 


26..289 
1..278 


159/279 (56%) 
197/279 (69%) 


2e-85 



In a BLAST search of public sequence datbases, the NOV47a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 47E. 
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Table 47E. Public BLASTP Results for NOV47a 


Protein 
Accession 
I Number 


Protein/Organism/Length 


NOV47a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Value 


! CAC69286 

| _ 


SEQUENCE 1 FROM PATENT 
WO0159U6-Homo sapiens 
(Human), 396 aa. 


1..318 
1..343 


317/343 (92%) 
317/343 (92%) 


e-180 


Q96NI9 

! 
I 

1 


CDNA FLJ3I532 FIS, CLONE 
NT2RI2000597 - Homo sapiens 
(Human), 417 aa. 


I..319 
1..344 


317/344 (92%) 
317/344(92%) - 


e-179 


: Q9NQC5 

i 

i - 


HYPOTHETICAL 23.3 ICDA 
PROTEIN PRECURSOR - 
Homo sapiens (Human), 210 aa. 


1..187 
1..201 


182/202(90%) 
182/202(90%) 


4e-96 


/~\C\ it TT-\rv 

Q9JHD9 


PUTATIVE SEVEN PASS 

TRANSMEMBRANE 
PROTEIN - Mus musculus 
(Mouse), 385 aa. 


14..310 
1 8.339 


180/323 (55%) 
221/323 (67%) 


2e-93 


060478 

: 


PUTATIVE SEVEN PASS 

TRANSMEMBRANE 
PROTEIN (SIMILAR TO 
TRANSMEMBRANE 7 
SUPERFAMILY MEMBER I) 
(UPREGULATED IN 
KIDNEY) - Homo sapiens 
(Human), 399 aa. 


14..3I0 
33..3S4 


177/323 (54%) 
220/323 (67%) 


4e-93 



PFam analysis predicts that the NOV47a protein contains the domains shown in 
the Table 47F. 



Table 47F. Domain Analysis of NOV47a 



Pfam Domain 



NOV47a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 48. — — 

The NOV48 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 48A. 
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Table 48A. NOV48 Sequence Analysis 




SEQIDNO:149 1 JH4 hn 


NOV48a, 
CG96747-01 DNA Sequence 

■ 
t 

i 

i 


ATGGCCCGGCTGGGCGCGCTGCTCCTGGCCGCCGCCCTGGGTGCACTGCTCAGCTTCG 
CGCTCCTGGCCGCCGCGGTCGCCAGCGACTACTGGTACATCCTGGAGGTGGCGGACGC 
CGGCAATGGCAGCGCCTGGCCCGGGCGCGCAGAGCTGCTCTCCTCGCACTCGGGGCTC 
TGGCGCATCTGCGGAAGACCTCCTGACATGCACCTGTGTGCACGTGTGCACACGCGAG 
GCCTATGTTTCCCAGGGCAGAACGGCTGCATCCCGCTGGTCGACCCTTTTGCCAGTGA 
GAGCCTGGACGTCTCCACCTCGGTGCAGCACCTCATCTTGCTGCACCGTGCAGTCATT 
GTGGTCCTGCCCCTGAGCCTGGTCCTTCTCGTGTGTGGCTGGATCTGCGGCCTGCTCA 
GCTCCCTGGCCCAGAGCGTGTCTCTGCTGCTTTTCACCGGCTGCTACTTCCTGCTGGG 
GGGTGTCCTGACACTGGCGGGGGTCAGCATCTACATCAGCTACTCGCACCTGGCCTTT 
GCGGAGACGGTGCAGCAGTATGGCCCGCAGCACATGCAGGGCGTCCGCGTCAGCTTCG 
GCTGGTCCATGGCCCTGGCCTGGGGCTCCTGTGCCTTGGAGGCATTCAGCGGAACCCT 
CCTGCTCTCAG CTGCCTGGACCCTCAG CCTGAG CCCCCCAATCTGTGGTCATCTGAGT 
CCCCAGCAGGGCAGGCTTGGTGGAGAAGAGGCTGATGAGAGGGCCCGAGAGCCCCTCC 

GATTTGCAGGGGTGGGGGGCAAGGAGCTGAGCGATCCAGATGTACCCCTCTGCCCCCT 
CCCTTGTTCTCAAGCCTGCTAG 




ORF Start: ATG at 1 


ORF Stop: TAG at 832 




SEQIDNO: 150 


277 aa I M W at 29226.6kD 


NOV48a, 
;CG96747-0! Protein Sequence 


MARLGALLLAAALGALLSFALIJU^VASDYl^ILEVADAGNGSAWPGRAELLSSHSGL 
WRI CGRPPDMHLCARVHTRGLCFPGQNGC I PLVDPFASES LDVSTS VQHLI LLHRAVI 
VVLPLSL VLLVCGWI CGLLS SLAQ S VS LLL FTGC YFLLGGVLTLAGVS I YI S YSHLAF 
AETVQQYG PQHMOGVR VS FX3WSMALAWG SCALEAFSGTLLLS AAWTLSLS PP I CGHLS 
PQQGRLGGEEADERAREPLRFAGVGGKELSDPDVPLCPLPCSQAC 



Further analysis of the NOV48a protein yielded the following properties shown 
Table 48B. 



1 — — . — , 

Table 48B. Protein Sequence Properties NOV48a 


PSort analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0. 1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 29 and 30 



A search of the NOV48a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 48C. 
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Table 48C Geneseq Results for NOV48a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV48a 
Residues/ 
Match Residues 


" Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


\ AAM23670 
i — 


Human EST encoded protein SEQ 
ID NO: 1 195 - Homo sapiens, 57 
aa. [WO200154477-A2, 02-AUG- 
2001] 


186..241 
] ..57 


51/57(89%) 

JJ'J / \7Z /O) 


le-22 


AAB73979 


Human stargazin-like protein 
CACNG8 - Homo sapiens, 327 
aa. [ WO200 1 2 1 79 1 -A2, 29- 
MAR-2001] 


5..168 
7 161 


49/172(28%) 

Ol / I IJL {HO/0) 


2e-09 


AAY84*?7? 

i 

i 
i 

i 


A human voltage-gated calcium 

channel designated 
CAC1MGLIKE3 - Homo sapiens, 
327 aa. [ WO2000 1 4224-A 1 , 16- 
MAR-2000] 


5..I68 
7..16I 


49/172(28%) 
81/172(46%) 


2e-09 


AAY70462 

i 


Human membrane channel 
protein- 12 (MECHP-12) - Homo 
sapiens, 323 aa. [WO20001271 1- 
A2, 09-MAR-2000] 


5..I68 
7..I58 


42/169(24%) 
79/169(45%) 


3e-09 


ABB 11 805 

i 

i 

i 

1 


Human voltage gated Ca channel 

subunit homologue, SEQ ID 
NO:2175 - Homo sapiens, 325 aa. 
[WO200157188-A2, 09-AUG- 
2001] 


5..I68 
9.. 160 


42/169 (24%) 
78/169 (45%) 


7e-09 



In a BLAST search of public sequence datbases, the NOV48a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 48D. 
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Table 48D. Public BLASTP Results for NOV48a 


Protein 
Accession 
"Number 


Protein/Organism/Length 


NOV48a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9D563 


493051 1J1 1RIK PROTEIN - Mus 
musculus (Mouse), 222 aa. 


3..224 
4..2I5 


98/231 (42%) 
133/231 (57%) 


2e-38 


; Q9JJV4 

i 


Voltage-dependent calcium channel 
gamma-4 subunit (Neuronal 
voltage- gated calcium channel 
gamma-4 subunit) - Mus musculus 
(Mouse), 327 aa. 


5.. 168 
7..161 


50/172 (29%) 
81/172(47%) 


3e-09 


Q98UH4 


IPR328-L1KE PROTEIN - Gallus 
gallus (Chicken), 3 14 aa. 


1..168 
3..157 


49/171 (28%) 
78/171 (44%) 


4e-09 


Q8WXS5 

i 
t 


VOLTAGE-DEPENDENT 
CALCIUM CHANNEL GAMMA- 
8 SUBUNIT - Homo sapiens 
(Human), 426 aa. 


5..216 
18..226 


59/232 (25%) 
98/232(41%) 


5e-09 


| CAC36506 

i 


SEQUENCE 1 FROM PATENT 
WOO 12 1791 - Homo sapiens 
(Human), 327 aa. 


5..168 
7..161 


49/172 (28%) 
81/172 (46%) 


6e-09 



PFam analysis predicts that the NOV48a protein contains the domains shown in 
the Table 48E. 



Table 48E. Domain Analysis of NOV48a 


Pfam Domain 


NOV48a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PMP22_Claudin 


4..2I4 


48/232(21%) 
139/232 (60%) 


0.023 



Example 49. 



The NOV49 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 49A. 
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Table 49A. NOV49 Sequence Analysis 




SEQIDNO: 151 


537 bp 


NOV49a, 
CG96789-01 DN A Sequence 

i 

i 

i 

} 

\ 


CGCCTCCCGGC^CGGGTGGCCGAGGCGTAGCGCTGCGACCCCCX5CACCCCrCTGCX5AA 
CATGGCGCTGCGAGTGGTGCGGAGCGTGCGGGCCCTGCTCTGCACCCTGCGCGCGGTC 
CCGTCACCCGCCGCGCCCTGCCCGCCGAGGCCCTGGCAGCTGGGGGTGGGCGCCGTCC 
GTACGCTGCGCACTGGACCCGCTCTGCTCTCGGTGCGTAAATTTACAGAGAAACACGA 
ATGGGTAACAACAGAAAATGGCATTGGAACAGTGGGAATCAGCAATTTTGCACAGGAA 
GCGTTGGGAGATGTTGTTTATTGTAGTCTCCCTGAAGTTGGGACAAAATTGAACAAAC 
AAGGTTGGCTGATCAAGATGACACTGAGTAACCCTTCAGAACTAGATGAACTTATGAG 
TGAAGAAGCATATGAGAAATACATAAAATCTATTGAGGAGTGAAAATGGAACTCCTAA 
ATAAACTAGTATGAAATAACGCAAGCCAGCAGAGTTGTCTTAAATTAGTGGTGGATAG 
AAGACTTAGAATAGC 




ORF Start: ATG at 60 


ORF Stop: TG A at 447 




SEQIDNO: 152 


129 aa 


MW at 14212.3kD 


1 NOV49a, 
{CG96789-01 Protein Sequence 


MALRWRSVRALLCTLRAVPSPAAPCPPRPWQLGVGAVRTLRTGPALLSVRKFTEKHE 
WVTTENGIGTVGISNFAQEALGDVVYCSLPEVGTKLNKQGWLIKMTLSNPSELDELMS 
EEAYEKYIKSIEE 



Further analysis of the NOV49a protein yielded the following properties shown in 
Table 49B. 



j 

i. 


Table 49B. Protein Sequence Properties NOV49a 


i 

\ PSort analysis: 

! 


0.9200 probability located in mitochondrial matrix space; 0.6000 probability located 

in mitochondrial inner membrane; 0.6000 probability located in mitochondrial 
intermembrane space; 0.6000 probability located in mitochondrial outer membrane 


i / 

| SignalP analysis: 


Cleavage site between residues 19 and 20 



A search of the NOV49a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 49C. 
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Table 49C. Geneseq Results for NOV49a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV49a 
rvesiuucs/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABG 11725 


Novel human diagnostic protein 
#11716- Homo sapiens, 1 85 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..I29 
10..185 


124/176 (70%) 
126/176 (71%) 


6e-61 


ABG 11725 


Novel human diagnostic protein 
#1 1716 - Homo sapiens, 185 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..129 
10.. 185 


124/176 (70%) 
126/176 (71%) 


6e-61 


ABG 18844 


Novel human diagnostic protein 
#18835 - Homo sapiens, 184 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..129 
9.. 184 


103/176 (58%) 
111/176 (62%) 


2e-46 


ABG 18844 


Novel human diagnostic protein 
#1 8835 - Homo sapiens, 1 84 aa. 
[WO200175067-A2, .1 l-OCT-2001] 


I..129 
9.. 1 84 


103/176 (58%) 
111/176 (62%) 


2e-46 


AAB58488 


Lung cancer associated polypeptide 

sequence SEQ ID 826 - Homo 
sapiens, 102 aa. [WO200055 1 80-A2, 
21-SEP-2000] 


7..60 
1..54 


52/54 (96%) 
52/54(96%) 


le-24 



In a BLAST search of public sequence datbases, the NOV49a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 49D. 
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Table 49D. Public BLASTP Results for NOV49a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV49a 
Residues/ 
ividiun txcsiuues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P23434 

i 


Glycine cleavage system H protein, 
mitochondrial precursor - Homo 
sapiens (Human), 173 aa. 


1..129 
1..I73 


128/173(73%) 
128/173(73%) 


5e-63 


Q9N12I 


Glycine cleavage system H protein, 

mitochondrial precursor - 
Oryctolagus cuniculus (Rabbit), 173 
aa. 


1.129 
1..173 . 


114/173(65%) 
119/173(67%) 


le-54 


P20821 

i 
i 


Glycine cleavage system H protein, 
mitochondrial precursor - Bos taurus 
(Bovine), 173 aa. 


1.129 
1-173 


113/173(65%) 
118/173(67%) 


le-53 


! Q96GY5 

> 

i 

! 


GLYCINE CLEAVAGE SYSTEM 
PROTEIN H (AMINOMETHYL 
CARRIER) - Homo sapiens 
(Human), 98 aa. 


1..97 
1..97 


96/97 (98%) 
96/97 (98%) 


2e-50 


! Q9QYU8 


H PROTEIN - Rattus norvegicus 
(Rat), 169 aa. 


1..126 
I..I68 


102/170(60%) 
112/170(65%) 


3e-46 



PFam analysis predicts that the NOV49a protein contains the domains shown in 
the Table 49E. 



J . " 

Table 49E. Domain Analysis of NOV49a 


Pfam Domain 


NOV49a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


GCV_H ' 


51. .96 


25/51 (49%) 
46/51 (90%) 


2.3e-22 


GCV_H 


97.. 127 


14/35(40%) 
31/35 (89%) 


2.4e-l 1 



Example 50. 



The NOV50 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 50A. 
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Table 50A. NOV50 Sequence Analysis 




SEQIDNO: 153 


440 bp 


NOV50a, 
CG97253-01 DNA Sequence 


CCTGGACTCAATCATGGCTTGTGGTCTGGTCGCCAGCAACCTGAATCTCAAACCTGGA 
GAGTGCCTTCGAGTGCGAGGCGAGGTGGCTCCTGACGCTAAGAGCTTCX3TGCTGAACC 
TGGGCAAAGACAGCAACAACCTGTGCCTGCACTTCAACCCTCGCTTCAACGCCCACGG 
CGACGCCAACACCATCGTGTGCAACAGCAAGGACGGCGGGGCCTGGGGGACCGAGCAG 
CGGGAGGCTGTCTTTCCCTTCCAGCCTGGAAGTGTTGCAGAGGTGTGCATCACCTTCG 
ACCAGGCCAACCTGACGGTGACTTCAAGATCAAATGTGTGG CCTTTGACTG AAAT CAG 
CCAGCCCATGGCCCCCAATAAAGGCAGCTGCCTCTGAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 




ORF Start: ATG at 14 


ORF Stop: TGA at 383 




SEQIDNO: 154 


123 aa 


MWat 13177.8kD 


NOVSOa, 
CG97253-01 Protein Sequence 


MACGLVASNLNLKPGECLRVRGE VAPDAKS FVLNLGKDSNNLCLHFN PRFNAHGDANT 
IVCNSKDGGAWGTEQREAVFPFQPGS VAEVCITFDQANLTVTSRSNVWPLTE I SQPMA 
PNKGSCL 



Further analysis of the NOV50a protein yielded the following properties shown in 
Table 50B. 



} *— « ■ ' 1 ■ — i ~ . 

i 

j Table 50B. Protein Sequence Properties NOV50a 


• PSort analysis: 

i 

i « 


0.6500 probability located in cytoplasm; 0.1586 probability located in lysosome 
(lumen); 0.1000 probability located in mitochondrial matrix space; 0.0000 
probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOVSOa protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 50C. 
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Table 50C. Geneseq Results for NOV50a 


Geneseq 
Identifier 


{ 

Protein/Organism/Length [Patent #, Date] 


NOV50a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY80675 


Human galectin-1 fusion protein, SEQ ID 
NO:9 - Homo sapiens, 142 aa. 
[WO200006724-A 1 , 1 0-FEB-2000] 


1..99 
8..106 


99/99(100%) 
99/99(100%) 


le-54 


AAW55959 


Human HL-60 lectin - Homo sapiens, 
aa. i wuyoUojJD-Ax, Ud-MAK- 
1998] 


1..99 

i no 

1..99 


99/99(100%) 
99/99 (100%) 


le-54 


AAR08153 


Human lectin gene product - Homo 
sapiens, 135 aa. [JP02262597-A, 25- 
OCT-1990] 


1..99 
1..99 


99/99(100%) 
99/99(100%) 


o le " 54 


AAR52745 


Amino acid sequence of both HL-60 and 
placenta lectins - Homo sapiens, 135 aa. 
[W0941 1497-A, 26-MAY-1994] 


I. .99 
1..99 


98/99(98%) 
99/99 (99%) 


3e-54 


AAP91964 


HL-60 and placenta lectins - Homo 
sapiens, 135 aa, [EP337799-A, 18-OCT- 
1989] 


1..99 
1..99 


98/99 (98%) 
99/99 (99%) 


3e-54 



In a BLAST search of public sequence datbases, the NOV50a protein was found to 
have homology to the proteins shown in the BLAST? data in Table 50D. 
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Table 50D. Public BLAST? Results for NOVSOa 


Protein 
Accession 
Number 


Protein/Orpan i sm/Len 0th 


NOV50a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9D0X0 

i 

1 


18 DAYS EMBRYO CDNA, RIKEN 
FULL-LENGTH ENRICHED 
LIBRARY, CLONE: 1 1 10058J24, FULL 
INSERT SEQUENCE - Mus musculus 
(Mouse), 152 aa. 


1..123 
L.149 


103/149 (69%) 
112/149 (75%) 


2e-54 


LNHUGB 


galectin 1 [validated] - human, 135 aa. 


1..99 
1..99 


99/99(100%) 
99/99(100%) 


3e-54 


E981607 


AMINO ACID SEQUENCE FOR GBP - 
vectors, 135 aa. 


1..99 
1..99 


99/99(100%) 
99/99(100%) 


3e-54 


CAA01896 

| 


BETA GALACTOSIDE BINDING 
PROTEIN - synthetic construct, 135 aa. 


1..99 
1..99 


98/99 (98%) 
98/99 (98%) 


le-53 


| P09382 

! 

! 

| 

1 


Galectin- 1 (Beta-galactoside-binding 
lectin L- 14-1) (Lactose-binding lectin 1) 
(S-Lac lectin 1) (Galaptin) (14 kDa 
lectin) (HPL) (HBL) - Homo sapiens 
(Human), 134aa. 


2..99 
1 .98 


98/98(100%) 
98/98(100%) 


le-53 



PFam analysis predicts that the NOV50a protein contains the domains shown in 
the Table 50E. 



Table 50E. Domain Analysis of NOV50a 


Pfam Domain 


NOV50a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Gal-bind_lectin 


3..113 


65/139(47%) 
100/139 (72%) 


4.6e-45 



Example 51. 



The NOV51 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 5 1 A. 
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Table 51 A. NOV51 Sequence Analysis 



SEQ ID NO: 155 



4297 bp 



NOV51a, 
CG97400-01 DNA Sequence 



GCAGGGCCCCTGAAGACTGGCGATCCCGCGCCCGACTACCTTGGTCGTCCGGATATGC 
TCCATCGCAAGGAGACGTCAGCCGGCAGCCGGTCTCACAGGCGTACCATACGGCTACA 
G AAACAGGG CGGTG ACAGCG ACAGAGAG CAAGCGGGAACTCCCTCC ACCAGCCAGCGC 
CGCGCGTCTGCCTGCCTCGAGTCCCCCGGGAGGCCGCGGGGTTTGGGGAAGTGTTTCT 
AGGAGACGG CGCTCACCGGCTGCACCTGCG CCGTTGACGCC AC CGGGG CCGG CAG ACA 
GACCCGCGGCGCTGGCTGGTGGAGGGAGTTCCCGCTTGCTCTCTGTCGCTGTCACCGC 
CCTGTTTCTGTAGCCGTATGGTACGCCTGTGAGACCGGCTGCCGGGTGACGTCTCCTT 
GCGATGGAGCATATCCGGACGACCAAGGTCGAACAAGTAAAATTACTTGACCGATTCA 
GTACCAGCAACAAGTCATTAACAGGAACACTGTATCTTACGGCTACACATCTATTATT 
TATCGACTCTCATGAAAAAGAAACCTGGATATTACACCACCATATTGCCTCAGTAGAG 
AAACTTGCTTTGACTACTTCTGGATGCCCCCTTGTGATAC^GTGCAAGAACTTCAGAA 
CTGTGCATTTCATTGTTCCCAGAGAAAGAGATTGCCATGATATTTACAACTCrrTTGCT 
ACAACTGTCAAAACAAGCAAAATATGAAGATCTCTATGCATTTTCTTATAATCCCAAA 
CAAAATGATTCAG AACGACTACAAGGCTGG CAG CTCATTGATCTCGCTGAGG AATATA 
AGAGGATGGGAGTGCCAAACTCACACTGGCAGTTGTCTGATGCCAACCGGGACTAC2\A 
G ATTTGTGAAACTT AC CCCAGAGAACTTTATGTTCCCCGG ATAGCAAG CAAACCAATA 
ATTGTTGGTAGTTCCAAGTTCCGGAGCAAGGGAAGATTCCCIAGTTCTTTCCTACTATC 
ATCAAGATAAGGAGGC!TGCCATTTGTCGATGTAGTCAGCCIACTCTCTGGATTCAGTGC 
CAGGTGCCTGGAGGATGAACATTTGCTTCAAGCCATTAGTAAAGCCAATCCAGTCAAT 
CGCTATATGTACGTCATGGATACCAGGCCAAAACTGAATGCAATGGCCAACAGAGCAG 
CTGGAAAAGGTTATGAAAATGAAGACAACTATTCCAATATTAGATTTCAGTTTGTTGG 
AATTGAAAATATTCATGTCATGAGGTCCAGCCTTCAGAAATTATTGGAAGTCAATGGC 
ACTAAAGGGCTTTCTGTCAATGATTTCTACTCCGGTTTGGAGAGCTCGGGATGGCTTC 
GCCATATCAAAGCTGTTATGGATGCTGCAGTCTTCTTGGCCAAAGCAATAACAGTTGA 
AAATGCAAGTGTGTTGGTGCATTGTTCCGATGGTTGGGATAGGACTTCCCAGGTTTGT 
TCCCTGGGTTCTCTTTTATTGGATTCCTACTACAGGACAATCAAAGGATTCATGGTTT 
TAATAGAAAAGGATTGGATCTCTTTTGGACATAAATTTTCAGAGAGGTGTGGCCAGTT 
GGATGGTGACCCJIAAGGAAGTCTCACCAGTGTTTACTCAGTTCTTGGAATGTGTGTGG 
CATTTG^CCGAACAGTTTCCACAAGCCTTTGAATTCAGTGAAGCATTT(TTTCTTCAGA 
TCCATGAGCATATTCATTCATGCCAGTTTGGAAACTTCCTTGGAAATTGTCCCAAGGA 
AAGAGAAGAGCTCAAGTTGAAGGAGAAGACTTATTCCCTGTGGCCATTTCITTTGGAA 
GACCAAAAGAAGTACTTAAATCCTCTCTACAGTTCCGAATCTCACAGATTTACAGTTT 
TGGAG CCAAATAC AGTATCTTT CAATTTTAAGTTT TGGAGG AA CATGTACCATCAGTT 
TGATCGAACACTGCATCCTAGGCAGTCTGTATTTAATATAATTATGAATATGAATGAG 
CAAAATAAACAATTAGAGAAAGATATTAAAGACCTAGAATCTAAAATTAAACAACGCA 
AAAATAAGCAAAC^GATGGCATCCTCACCAAGGAATTGTTACATTC^ 

GATGCTCTTCGAACTATAGAGGGCAGCAGCCCGGCAGATAATCGTTATAGTGAATATG 

CAGAAGAGTTTTCTAAATCAGAACCTGCTGTGGTCAGCTTAGAGTATGGTGTGGCAAG 

AATG ACTTGTTAGACTCAT AG AGTTTTTTCTGCAATG ATTG CAGTACAAGAAAAG GAT 

TATTGTGAGGATCGTCTGTAAGC^TAACC^AAAGGAATTTGTCTAATAAC^ 

GGTTTAACAGTAGGCTAATAGTTGAAGGAAGGATAATAACTACCCTTGTGAGAGAAAT 

ATGTCATT TTAATTG CATTT CCAGCAAGGAATG ACATTCAGTT CTGTAAGAAATGAGT 

GGTATTTGATGTATTTACTCAAAAC^C^TTTGCACTGTACACTAGTGAATTGACGTT 

TATGATTTATGrrAATTCAGCCAAACATAAATAACCTTCCTTAAGTACAATT^ 

CAAGAAAACAAAATTTGACAACATAGTTTCTTAATAAATGATATGGC^TGTACTT^ 

ATT ATGTAG CTTTGTAACTATGAATATTTACATATTTTG CCTT TTAGTG ATATTT AAT 

GTTAAAGTG CCATG AAAAAT ATTTCT AAGAAAG CCTTAAATTC CCAGTGGATTCTTTA 

CCCITAAGTTTTACAGCCTACAACAAGATlTTriXjTTTT G l^^ 

TTGTTTTGTTTGTAAAGAATTGTGCTCTC!ATTACrGCTGGGGGTG 

TTCTATATAAACACrrTGTAGAAGTACACrGTTCACGTTTAGCCTCCCCCACT^ 

TTCAAAATTAATGAAACTGAAGGTTTATTCTGATCATAATTTGT^ 

GATAATTTATTATTT ACAG CTTAGAATAT TG ATTT CTTGAATACGTAT AAG CACATTT 

GACrGTCTTTTATATATGGATTACTGCATTCCATTGATTOT 

TATTT CrTTCACAACTGTGT AAG TTTAAAGAGCTAAAG CnXrTAAAACrGTTCTGAGAA 
ACAATGAATAGTACATGTATATGTATATTTTTAAACTGCCITATTGCrC^TGAGTTG 
CTGTTC CTGG AGT ATCT AACTTT CAGTTTACTCAAAAACTTTTGGGTAAATAAAAAAG 
GGAAGTACAG ATAGTTTAGGTTG TG (2ATGGTTG <ZATG AATTTTGAAGTCATTTCTATG 
TGGAG CATT ATTTTTCITCTTGTTAAAT AT AG AAAAAAAAAGAGATT CTAGTACAAAG 
TTACTGTTTAACAAAAG CIAACATAJU^CTCZTGGG AAAG ATTTCATTTTG C CATGTTATA 
TTTACTGTTTATTCTGTGTACTAGTACATATCTTTAAAT^^ 

AAACATAAAAACCCCAAAACTAT C ACT TGG AATTAGCAATATCACCCAAC TGG CTTTA 
AAATTG AAAATTTAAATAACATGGTGGCATGAACTAC^TTCGAGTATTAGG CAATTT 
G CATAGTGTTCCTCATG CTTACTTTCTGTTACAC CTCTATTATATTAGTTT^ 
AACAT CITT1TCAG ACCAAAAAAAACTTTATTGTATGAGAG CTT ATCTTATCCTGTTT 
ATTTlTCCAATGCrrTrTCTGTAATA(^TTATGT 

CAG CAACAGAAATG CACTATAAAATATAGTATGTGATTAACCTU^TCCTG CTTCCATAT 
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TTAAGCACTGGGAATGGAAACTTAATCTCTGTGACTAQVAA 

TGCTGTTCCAGTCACTGATTGTGGTTTTAGAATC^ 

AACTCTGGTTTATTATCTTAATCTTC^CTTCAGACATACATTT 

TCACAGGTGGACTGAGAAATCAGTTACATCTTAAGTGACCTACAGGGTATATGTTGGC 

AAAAGCAGACTGTGTATATGTCTTATAAAGTTGAATTTATGTTCAGTGTGTTTGGAAG 

TGTATAGCATGTAAATTATTTCATATATGATTTAAAGGTAATTAAATGTTCACATTTT 

ACTTTGAATGTTTTTCTTCTGGATAAAACAAATGGAATCAAAAAAAAAAAAAAAAAAA 

AAATT 




ORF Start: ATG at 410 


ORF Stop: TAG at 2273 




SEQ ID NO: 156 


621 aa MW at 71922.1kD 


NOV51a, 
CG97400-01 Protein Sequence 


MEHIRTTKVEQVKLLDRFSTSNKSLTGTLYLTATHLLFIDSHQKETWILHHHIASVEK 
LAItTTSGCPLVIQCKNFRTVHFIVPRERDCHDIYNSLLQLSKQAKYEDLYAFSYNPKQ 
NDSERLG^WQLIDLAEEYKRMGVPNSHWQLSDANRDYKICETYPRELYVPRIASKPII 
VGSSKFRSKGRFPVIjSYYHQDKEAAICRCSQPLSGFSARCLEDEHLLQAISKANPVNR 
YKVVMDTRPKLNAMANRAAGKGYENEDNYSNIRFQFVGIENIHVMRSSLQKLLEVNGT 
KGLSVNDFYSGLESSGWLRHIKAVMDAAVFIAK^ITVENASVLVHCSDGWDRTSQVCS 
LGSLLLDSYYRTIKGFMVLIEKDWISFGHKFSERCGQLDGDPKEVSPVFTQFLECVWH 
LTEQFPQAFEFSEAFLLQIHEHIHSCQFGNFLGNCPKEREELKLKEKTYSLWPFLLED 
QKKYLNPL YS SESHRFTVLE PNTVS FNFKFWRNMYHQFDRTLHPRQS VFNI I MNMNEQ 
NKQliEKDIKDLESKIKQRKNKQTDGILTKELLHSVHPESPNLKTSLCFKEQTLLPVND 
ALRTI EGSS PADNRYS EYAEEFS KS E PAWSLEYGVARMTC 



Further analysis of the NOV51a protein yielded the following properties shown in 
Table 51B. 



Table 51B. Protein Sequence Properties NOV51a 


PSort analysis: 


0.3600 probability located in mitochondrial matrix space; 0.1723 probability 
located in microbody (peroxisome); 0.1000 probability located in lysosome 
(lumen); 0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV5 1 a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 5 1 C. 
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Table 51C. Geneseq Results for NOVSla 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV51a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


; AAB73229 

i 
j 


Human phosphatase MTMR7 h - Homo 
sapiens, 629 aa. [WO2001 12819^A2, 22- 
FEB-2001] 


8..S34 
82..613 


338/533 (63%) 
430/533 (80%) 


0.0 


, 

5 ABR66R18 

AUUUUO 1 O 


L/rubupniid meianogasicr poiypepiiue 

SEQ ID NO 27246 - Drosophila 
melanogaster, 676 aa. [WO20017I042- 
A2, 27-SEP-2001] 


1 A 

I4..JHZ 

1..525 


386/530 (72%) 


e-176 


ABB59964 

1 

! 

! 
i 


Drosophila melanogaster polypeptide 

SEQ ID NO 6684 - Drosophila 
melanogaster, 676 aa. [ WO200 1 7 1 042- 
A2, 27-SEP-2001] 


14..542 
1..525 


282/530 (53%) 
386/530 (72%) 


e-176 


| ABB65465 

j 

i 

i ' 

1 


Drosophila melanogaster polypeptide 

SEQ ID NO 23 1 87 - Drosophila 
melanogaster, 639 aa. [WO200171042- 
A2, 27-SEP-2001] 


■ 52..542 
2..4S8 


260/491 (52%) 
355/491 (71%) 


e-163 


; AAW36451. 

i 


Human tyrosine phosphatase related 
protein - Homo sapiens, 621 aa. 
[WO97350I5-A2, 25-SEP-1997] 


22..510 
74..567 


197/501 (39%) 
297/501 (58%) 


4e-99 



In a BLAST search of public sequence datbases, the NOV51a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 5 ID. 
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Table 51D. Public BLASTP Results for NOVSla 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV51a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


! Q96P80 

1 


MYOTUBULARIN RELATED 
PROTEIN 6 - Homo sapiens 
(Human), 621 aa. 


1..621 
1..621 


620/621 (99%) 
620/621 (99%) 


0.0 


Q8VEI1 


HYPOTHETICAL 70.9 KDA 
PROTEIN - Mus musculus 
(Mouse), 617 aa. 


1..621 
1 ..617 


558/621 (89%) 
588/621 (9^%) 


0.0 


Q9Y217 


Myotubularin related protein 6 (EC 
3.1.3.48) - Homo sapiens (Human), 
465 aa (fragment). 


157..621 
1..465 


464/465 (99%) 
464/465 (99%) * 


0.0 


Q96EF0 


SIMILAR TO HYPOTHETICAL 
PROTEIN FLJ20126 - Homo 
sapiens (Human), 704 aa. 


1..540 
1..541 


332/543 (61%) 
430/543 (79%) 


0.0 


; Q9W1Q6 


BCDNA:GH04637 PROTEIN - 
Drosophiia melanogaster (Fruit 
fly), 676 aa. 


14..542 
1..525 


282/530(53%) 
386/530 (72%) 


e-175 



PFam analysis predicts that the NOV5 la protein contains the domains shown in 
the Table 5 IE. 



Table 51E. Domain Analysis of NOVSla 






Identities/ 




Pfam Domain 


NOV51a Match Region 


Similarities 


Expect Value 




for the Matched Region 




No Significant Matches Found 



Example 52. 



The NOV52 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 52A. 
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Table 52A. NOV52 Sequence Analysis 




SEQIDNO: 157 


807 bp 


NOV52a, 
CG97462-01 DNA Sequence 

i 

! 

i 


ATGGCTGCCAAAGTGTTTGAGTTCATCAGCAAGTTTGGCCTGGCCTTAGCTGTTGCAG 
GAGG CCTGAATGAACTCTGCCTTAATGTGG ATG CTGGGCACAGAG CTGTCATCTTTG A 
CCTATTCCGTGGAGTACAGGACATTGTGGTAGGGGAAAGGACTCACTTTCTCATTCCA 
TGGGTACAGAAACCAATTAT CTTTGACTGCCCTTCTCG ACCACGT AATGTGCCAG CCA 
TCACTGGTAGCAAAGATTTACAGAATGTCAAC^TCACACTGCTCATCCTCTTCTGGCC 
TGTCACTAGCCAGTTTCCTTGCATCTTCACCAGCATCAGAGAGGACTATGATGAGCAG 
GTGCTGCCATCCGTCACGACCAAGATCCTCAAGTCCGTGGTGGCTAGCTTTGATGCTG 
GAGAACTAATCAC CCAGAG AGAG CTGGTCTCCAGGGAGCAATTACTTACAGAGCGAGC 
AGCCACCTTTGGGCTCATCCTGGATGACGTGTCCTTGACACATCTGACCTTCGGGAAG 
GAGTTCACAGAAGCGGTGGAAGCCAAACAGGTAGCTCAGCAGGAAGCAAGGGCCAGAT 
TTGTGGTGGAAAAGG CTGAGCAGCAGAAAAAGGTGG CCATCATCTCTGCTGAGGGCTA 
CTCCAAGGCAGCTGAGCTGATTGCCAACTCACTGGCCACCGCAAGGGACCGCCTGATG 
GAGCTCTGCAAG CTGG AAGCTGCGGAGGACATCGCGTACCAG CTCTCACGCTCTCGG A 
ACATCACCTATCCGCCGGCTGGGCAGTCCGTGCTCCTCCAGCTGCCCCAGTGA 


I ■ : 


ORF Start: ATG at 1 


ORF Stop: TGA at 805 


! 


SEQIDNO: 158 


268 aa 


MWat29706.9kD 


; NOV52a, 
CG97462-0I Protein Sequence 

i 
i 

1 


MAAKVFEF I S KFG LALAVAGG LNE LCLNVDAGHRAV I FDLFRGVQD I WGERTHFLI P 
WVQKPI I FDCPSRPRNVPAITG S KDLQNVNI TLLI LFWPVTSQFPC I FTS IREDYDEQ 
VLPSVTTKILKS WASFDAGEL I TQRELVS REQLLTERAATFGLI LDDVSLTHLTFGK 
EFTEAVEAKQVAQQEARARFWEKAEQQKKVAI I S AEGYS KAAEL I ANS LATARDRLM 
ELCKLEAAEDIAYQLSRSRNITYPPAGQSVLLQLPQ 



Further analysis of the NOV52a protein yielded the following properties shown in 
Table 52B. 



i — — — — — — — — ' , — , 

j Table 52B. Protein Sequence Properties NOV52a 


j PSert analysis: 

1 


0.5500 probability located in endoplasmic reticulum (membrane); 0.2632 probability 
located in lysosome (lumen); 0.1000 probability located in endoplasmic reticulum 
(lumen); 0. 1 000 probability located in outside 


SignalP analysis: 


Cleavage site between residues 22 and 23 



A search of the NOV52a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 52C 
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Table 52C. Geneseq Results for NOV52a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV52a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG73845 

i 
1 


Human colon cancer antigen protein 
SEQ ID NO:4609 - Homo sapiens, 279 
aa. [WO200I22920-A2, 05-APR- 
2001] 


I..268 
8..279 


237/272 (87%) 
243/272 (89%) 


e-120 


AAB43874 


Human cancer associated protein 
sequence ID NU: I j 1 9 - Homo 
sapiens, 279 aa. [WO200055350-A1, 
21-SEP-2000] 


1..268 

o 0*70 


237/272 (87%) 
24^/272 (89%) 


e-120 


AAW54352 

i 
i 

i 

! 

i 


Heat shock 27 kD protein and 
prohibitin (admixture) - Homo sapiens, 
471 aa. [WO9810291-A1, 12-MAR- 
1998] 


L.268 
200.:471 


237/272 (87%) 
243/272 (89%) 


e-120 


AAR42215 

1 


Human prohibitin - Homo sapiens, 272 
aa. [JP0527I294-A, 19-OCT-1993] 


1..268 
1..272 


237/272 (87%) 
243/272 (89%) 


e-120 


i AAR 13466 


Prohibitin - Rattus rattus, 272 aa. 
[USN7612674-N, 16-JUL-1991] 


1 ,.268 
I..272 


236/272 (86%) 
243/272 (88%) 


e-120 



In a BLAST search of public sequence datbases, the NOV52a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 52D. 



Table 52D. Public BLASTP Results for NOV52a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV52a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P35232 


Prohibitin - Homo sapiens 
(Human), 272 aa. 


1..268 
1..272 


237/272 (87%) 
243/272 (89%) 


e-120 


P24142 


Prohibitin (B-cell receptor 
associated protein 32) (BAP 32) - 
Mus musculus (Mouse), and, 272 
aa. 


L.268 
L.272 


236/272 (86%) 
243/272 (88%) 


e-120 


Q9VIZ4 


LETHAL (2) 37CC PROTEIN - 
Drosophila melanogaster (Fruit 
fly), 276 aa. 


L.267 
1..271 


178/271 (65%) 
212/27! (77%) 


4e-91 


Q9BKU4 


HYPOTHETICAL 30.0 KDA 
PROTEIN - Caenorhabditis 
eiegans, 275 aa. 


2..266 
5..273 


157/269 (58%) 
203/269 (75%) 


2e-78 


001720 


$6-MI£^~ Like Molecule 


2.267 


155/270(57%; 


le-76 
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TC-PRO-1 - Toxocara canis, 274 
aa. 


4..273 


201/270(74%) 





PFam analysis predicts that the NOV52a protein contains the domains shown in 
the Table 52E. 



Table 52E. Domain Analysis of NOV52a 



] 

Pfam Domain 


NOV52a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Band_7 


12..203 


49/206 (24%) 
169/206 (82%) 


4.6e-66 



Example 53. * 

The NOV53 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 53 A. 



j Table 53 A. NOV53 Sequence Analysis 




SEQ ID NO: 159 


1527 bp 


NOV53a, 
CG97472-01 DNA Sequence 


ATTGAGGCCATGGGGACAAAGAAGGTCACCCCATCTCTGATCTTCGCCATCACAATTC5 
CTACAATCGGCTCTTTCCAATTTGGCTACAACACTGGGGTCATCAATGCTCCTGAGAT 
GATCATAAGGGAATTTATCAACACTTTC 
ATGTTGTG<nTGTCCCTrcGGTCCTTGTCTA^ 

TTAGCTCCITTTCTGTTGGACTGTTTCT 

TACTGTCAACCTGTTGGCTGCC3VCTGGTCGCTGCCTTATGGGACTGTGTAAAGTAGCT 

GAGTTGGTTGAAATACTGATCCrGGCCTACTTGGT^AT/TGGCCT 

GCA.CAGGTTTTGTGCCCACGTACATCGGAGACATCTCACCrATTGGCCTGCAAGGTGT 

CTTTGGC^CTCTCAACC^CTGGGCATTGTTGTTGGAATTCTGGTGGCCCAGATCTTT 

GATCTGGAAT^C^TCCTGGGGTCTGAAGACCTATGGCCTGTGCTAT^AGGCT^CCC^ 

TCTCTCCTGCTATGCTACAAAGTGCIAGCCCTTCCTTTTTGCCCTGAAAGTCCCAGATT 

CTTGCTCATTAACAGAAAAGAAGAGGAGAATGCTAAGGAGATCCTCCAGTGGTTGTGG 

GG CAC CCAGG ATGT ATCCCAAGACATCC AGGAG ATGAAAGATG AGAGTG CAAGGATGG 

CA CAAGAAAAGCAAGTCACTGTGCTGGAG CTCTTTAG AGT ATCCAG CTACCGACAG CC 

CATCATCATTTCCATCATG CTCCAG CTCTCTCAGCAG CTCTCTGTAATCAATACTGTG 

TTCTAT^ACTCAACAGGAATCTTTAAGGGTGC^GGTCT^ 

CCATCGGTGTGGGTGTGGT^AATACTATCTTCACTAT^ 

AAGGGCAGTAAGAAGGACTCTACATATGATAGGCCTCGGAGGGAT^ 




ATCCTCATGTCTGTTTGTTTGTTATTGAAGGATGAGTGTAATGGGATAAGC^TTTGTCT 

GTATTGGGGCTATCTTGGTCTT^GTGGTCTTCTT^GAAATTGG<3CCA 

CTGGT^ATTGTGGCTGAACTCTTCAGCCAGGGCCCTGGCCCAGCTGTGATGGCAGTG 

GCCGGCTGCTCCACCTGGACCTCC^CTTCCrrAGT(^GATTGCTTT^CC 

CTCACTATTTAGGAGCCrACGTTTTTATT^^ 

GATCTTTACCTTCTTGAAAGTCCCTGAGATGTC 

AAGGCCTTTGAGGGGCAGGCACACGATGCAAATAGATCTGGAAGGGACTGTGTCATGG 
AGATGAACAGTATCCAG CCTG CTAAGGAG ACCACCAC CAATGTCT AAATCATGCCTCC 
TTCAACCTCCCTCCCAGCA 




ORF Start: ATGat 10 


ORF Stop: TAA at 1495 




SEQ ID NO: 160 


495 aa MWat54631.8kD 


NOV53a, 
CG97472-01 Protein Sequence 


MGTKKVTPSLIFAITIATIGSFQFGYOTGfVTNAPEMIIREFINT^ 
LSLWCLSMAI FSIGCMISSFSVGLFVNSFDRRNSMLTVNLLAATX3GCI^GL<2KVAELV 
EI L I lAYLVIGLFCRLCTG FVPTYI GDI S PIGLQGVFGTLNQLGIWGIIjVAQI FDLE 
FI LGS EDLWPVLLG F P IS P AMIjQS AAL PFCPES PRFLL INRKE EENAKE I LQWLWGTQ 
DVSQDI QEMKDESARMAQEKQVTVLELFRVSSYRQPI I ISIMLQLSQQLSVINTVFYY 
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STGI FKGAGVQEPI CVTIGVGVVNTI FT IVSLFLVERAVRRTLHMIGLGGMAFCS I IjM 
S VCLLLKDECNG I S FVC I GAI LVFWFFE IGPDHI PWFIVAELFSQGPGPAVMAVAGC 
STfH'SNFLVRIiFPSAAYYLGAYVFI I FTD FLITFLI FTFFKVPEMCYRTFKD I TKAF 
EG QAHDANRS GRDCVMEMN S IQPAKETTTNV 



Further analysis of the NOV53a protein yielded the following properties shown in 
Table 53B. 



Table 53B. Protein Sequence Properties NOV53a 


PSort analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi body; 
0. 1 000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 22 and 23 



A search of the NOV53a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 53C 



Table 53C Geneseq Results for NOV53a 



1 

j Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV53a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


! 

Expect 
Value 


AAY27289 


Glucose transporter protein GLUT3 - 
Homo sapiens, 494 aa. [US5942398-A, 
24-AUG-1999] 


1..491 
1..492 


375/492 (76%) 
420/492 (85%) 


0.0 


AAR11360 


Glucose Transporter Protein from CHO 
cells - Cricetulus sp, 492 aa. 
[WO9103554-A, 21-MAR-1991] 


3..474 
5..478 


276/474 (58%) 
349/474 (73%) 


e-154 


ABB57244 


Mouse ischaemic condition related 
protein sequence SEQ ID NO:652 - 

Mus musculus, 492 aa. 
[WO200188188-A2, 22-NOV-2001] 


3..474 
5..478 


276/474 (58%) 
347/474(72%) 


e-154 


AAW17835 


Human glucose transporter GLUT-1 - 
Homo sapiens, 492 aa. [W097 15668- 
A2,01-MAY-1997] 


3..474 
5..478 


275/474 (58%) 
347/474 (73%) 


e-153 


AAB30522 


Amino acid sequence of a consensus 
GLUT polypeptide - Synthetic, 493 aa. 
[US6136547-A, 24-OCT-2000] 


6..487 
10..490 


272/482 (56%) 
341/482(70%) 


e-150 



In a BLAST search of public sequence datbases, the NOV53a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 53D. 
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Table 53D. Public BLASTP Results for NOV53a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV53a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


PI 1169 

1 
I 

i 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 

Homo sanipn*; fHnman^ dQf\ 


1..495 
1..496 


418/496 (84%) 
444/496 (89%) 


0.0 


AAL89710 


GLUCOSE TRANSPORTER 14 
LONG FORM - Homo sapiens 
(Human), 520 aa. 


3..495 
27..520 


408/494 (82%) 
436/494 (87%) 


0.0 


AAL89709 


GLUCOSE TRANSPORTER 14 
SHORT FORM - Homo sapiens 
(Human), 497 aa. 


4..495 
5. .497 


407/493 (82%) 
436/493 (87%) 


0.0 


P47842 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Canis familiaris (Dog), 495 aa. 


1..493 
1..494 


390/494 (78%) 
431/494 (86%) 


0.0 


P47843 

i 

i 
I 


Solute carrier family 2, facilitated 

glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Ovis aries (Sheep), 494 aa. 


1..491 
I ..492 


375/492 (76%) 
420/492 (85%) 


0.0 



PFam analysis predicts that the NOV53a protein contains the domains shown in 
the Table 53E. 



Table 53E. Domain Analysis of NOV53a 


Pfam Domain 


NOV53a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


sugar_tr 


12..464 


173/489 (35%) 
383/489 (78%) 


3.5e-143 



Example 54. 



The NOV54 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 54A. 
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Table 54A. NOV54 Sequence Analysis 




SEQIDNO: 161 


2049 bp 


NOV54a, 
CG97528-0I DNA Sequence 


TACACATTTTCCCTCATCTCTCTCACTGAC&C^CACAAA^ 

AATT CTG TG ACT CT AT C CT CTAATTG CAAAAG ACTGGAAATTCT AATTCATTCCAACT 
CTTCCCTTGCAGACAAGAGAACAATGCCCTGGACATGGCTCCAGAGATCCACATGACA 
GGCCCAATGTGCCTCATTGAGAACACTAATGGGGAACTGGTGGCGAATCCAGAAGCTC 
TGAAAATCCTGTCTGCCATTACACAGCCTGTGGTGGTGGTGGCAATTGTGGGCCTCTA 
CCGCACAGGAAAATCCTACCTGATGAACAAGCTAGCTGGGAAGAATAAGGGCTTCTCT 




CC^AAAAGCCAGAACACACCTTAGTCCTGCTTGACACTGAGGGCCTGGGAGATGTAAA 
GAAGGGTG ACAACCAG AATGACTCCTGG ATCTTCACCCTG G CCGT CCTCCTGAGCAGC 
ACTCTCGTGTACAATAGCATGGGAACCATCAACCAGCAGGCTATGGACCAACTGCAGT 
ATGTGACAGAGCTGACACATCGAATCCGATCAAAATCCTCACCTGATGAGAATGAGAA 
TGAGGATTCAGCTGACT/rTGTGAGCITCTTCCCAGATTTTGTGTGGACACTGAGAGAT 
TTCTCCCTGGACTTGGAAGCAGATGGACAACCCCTCACACCAGATGAGTACCTGGAGT 
ATTCCCTGAAGCTAACGCAAGGTACCAGTCAAAAAGATAAAAATTTTAATCTGCCCCG 
ACTCTGTATCCGGAAGTTCTTCCCAAAGAAAAAATGTTTC 

CACCG CAGG AAGCTTG C C C AG CTTG AG AAACTAC AAG ATG AAG AG CTG G ACCCTGAAT 

TTGTGCAACAAGTAGCAGACTTCTGTTCCTACATCTTTAGCAATTCCAAAACTAAAAC 

TCTTTCAGGAGGCATCAAGGTCAATGGGCCTCGTCTAGAGAGCCTAGTGCTGACCTAT 
ATCAATGCTATCAGCAGAGGnGATrTnr , r , r'Tnr7i r rT:r:2if:n nrT , r , ^r , 'rr , r ,r rr'r t r*r^ r r r rr'r^ 

CCCAGATAGAGAACrCAGCCGCAGTGCAAAAGGCTATTGCCCACTATGACCAGCAGAT 

GGGCCAGAAGGTGCAGCTGCCCGCAGAAACCCTCCAGGAGCTGCTGGACCTGCACAGG 

GTTAGTGAGAGGGAGGCCACTGAAGTCTATATGAAGAACTCTTTCAAGGATGTGGACC 

ATCTGTTTCAAAAGAAATTAAAGGCCCAGCTAGACAAAAAGCGGGATGACTTTTGTAA 

ACAGAATCAAGAAGCATCATCAGATCGTTGCTCAGCTTTACTTCAGGTCATTTTCAGT 

CCTCTAGAAGAAGAAGTGAAGGCGCGAATTTATTCGAAACCAGGGGGCTATTGTCTCTl 

TTATTCAGAAGCTACAAGACCTGGAGAAAAAGTACTATGAGGAACCAAGGAAGGGTCC 

TAAGGCTGAAGAGATTCTGCAGACATACTTGAAATCCAAGGAGTCTGTGACCGATGCA 

ATTCTACAGACAGACC^GATTCTCACAGAAAAGGAAAAGGAGATTGAAGGTGTGGAAT 

GTGTAAAAGCTGAATCTGCACAGGCrrrCAGCAAAAATGGTGGAGGAAATGCAAATAAA 

GTATCAGCAGATGATGGAAGAGAAAGAGAAGAGTTATCAAGAACATGTGAAACAATTG 

ACTGAGAAGATGGAGAGGGAGAGGGCCCAGTTGCTGGAAGAGCAAGAGAAGACCCTCA 

CTAGTAAACTTCAGGAACAGGCCCGAGTACTAAAGGAGAGATGCCAAGGTGAAAGTAC 

CCAACTTCAAAATGAGATACAAAAGCTACAGAAGACCCTGAAAAAAAAAACCAAGAGA 

TATATGTCGCATAAGCTAAAGATCTAAACAACAGAGCTTTTCTGTCATC 

GG CATAACTGAAACAATTTTAG AATTTG GAAC AAGTG TC ACT AT ATTTG AT AATAATT 

AGATCTTG CATCATAACAC 




ORF Start: ATG at 151 j 


ORF Stop: TAA at 1939 




SEQIDNO: 162 | 


596 aa 1 MW at 68176.6kD 


NOV54a, 
CG97528-01 Protein Sequence 


MAP E IHMTGPMCIil ENTNG ELVANPE ALKI LSAITQPVVVVAI VGLYRTGKSYLMNKL 
AGKNKGFSLGSTVKSHTKXSIWMWC^PHPKKPEHTLVLLDTEGLGDVKKGDNQNDSWIF 
TLAVIiLSSTLVyNSMGTINQQAMDQLQYVTELTHRIRSKSSPDENENEDSADFVSFFP 
DFVWTLRDFSLDLEADGQPLTPDEYLEYSLKLTQGTSQKDKNFNLPRLCIRKFFPKKK 
CFVFDLPIHRRKLAQLEKLQDEELDPEFVQQVADFCSYIFSNSKTKTLSGGIKVNGPR 
LESLVLTYINAI SRGDLPCMENAVLALAQI ENSAAVQKAIAHYDQQMGQKVQLPAETL 
QELLDLHRVS EREATE VYMKNS FKDVDHLFQKKLKAQLDKKRDDFCKQNQ EAS SDRCS 
ALLQVI FS PIiEEEVKAGI YSKPGGYCLFIQKLQDLEKKYYEEPRKGPKAEEILQTYLK 
SKESVTDAILG/TDQILTEKEKEIEGVECVKAESAQASAKMVEEMQIKYQQMMEEKEKS 
YQEHVKQLTEKM ERERAQLLEEQE KTLTS KLQEQARVLKERCOGESTQLQNE I QKLQK 
TLKKKTKRYMSHKLKI 
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Further analysis of the NOV54a protein yielded the following properties shown in 
Table 54B. 



Table 54B. Protein Sequence Properties NOV54a 


PSort analysis: 


0.8000 probability located in nucleus; 0.6000 probability located in endoplasmic 
reticulum (membrane); 0.3000 probability located in microbody (peroxisome); 
0. 1 000 probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV54a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 54C. 



Table 54C. Geneseq Results for NOV54a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV54a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM41334 


Human polypeptide SEQ ID NO 6265 - 
Homo sapiens, 605 aa. [WO200153312- 
A1,26-JUL-2001] 


I. .596 

II. .605 


591/596 (99%) 
592/596 (99%) 


0.0 

i 


AAM41333 


Human polypeptide SEQ ID NO 6264 - 
Homo sapiens, 605 aa. [WO200153312- 
A1,26-JUL-2001] 


I. .596. 

II. .605 


591/596 (99%) 
592/596 (99%) 


0.0 


ABB 12327 


Human guanylate binding protein 
homologue, SEQ IDMO:2697 - Homo 
sapiens, 605 aa. [WO200157188-A2, 09- 
AUG-2001] 


I. .596 

II. .605 


591/596 (99%) 
592/596 (99%) 


0.0 


AAU25444 


Human mddt protein from clone 
LG:444850.9:2000MAY19 - Homo 
sapiens, 570 aa. [WO200162922-A2, 30- 
AUG-2001] 


I. .554 

II. .563 


547/554 (98%) 
549/554 (98%) 


0.0 


AAM39547 


Human polypeptide SEQ ID NO 2692 - 
Homo sapiens, 592 aa. [WO200153312- 
A1,26-JUL-2001] 


1..585 
1..586 


514/587 (87%) 
550/587 (93%) 


0.0 



In a BLAST search of public sequence datbases, the NOV54a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 54D. 
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Table 54D. Public BLASTP Results for NOV54a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV54a 
Residues/ 
Match Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


P32455 


Interferon-induced guanylate-binding 
protein 1 (Guanine nucleotide- binding 
protein 1) - Homo sapiens (Human), 592 
aa. 


L.585 
1..586 


508/587 
(86%) 

547/587 
(92%) 


0.0 


P32456 


Interferon-induced guanylate-binding 
protein 2 (Guanine nucleotide- binding 
protein 2) - Homo sapiens (Human), 59 i 
aa. 


1..583 
1..582 


441/583 
(75%) 

508/583 
(86%) 


0.0 


Q9NV33 

i 


CDNA FU10961 FIS, CLONE 
PLACE1000588, HIGHLY SIMILAR 
TO INTERFERON-INDUCED 
GUANYLATE-BINDING PROTEIN 1 - 
Homo sapiens (Human), 447 aa 
(fragment). 


144..582 
. 10..447 


434/439 
(98%) 

435/439 
(98%) 


0.0 


Q01514 


Interferon-induced guanylate-binding 
protein 1 (Guanine nucleotide- binding 
protein 1) (Interferon-gamma inducible 
protein MAG-I) - Mus musculus 
(Mouse), 589 aa. 


1..579 
I..578 


392/579 
(67%) 

481/579 
(82%) 


0.0 


Q96PP8 


GUANYLATE BINDING PROTEIN 5 - 
Homo sapiens (Human), 586 aa. 


1..582 
1..577 


394/582 
(67%) 

469/582 
(79%) 


0.0 



PFam analysis predicts that the NOV54a protein contains the domains shown in 
the Table 54E. 



Table 54E. Domain Analysis of NOV54a 


Pfam Domain 


NOV54a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


GBP 


6..280 


196/284(69%) 
256/284(90%) 


1.6e-192 


GBP_C 


282..579 


169/305 (55%) 
255/305 (84%) 


2.5e-I44 
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Example 55. 

The NOV55 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 5 5 A. 



Table 55A. NOV55 Sequence Analysis 


| SEQIDNO:163 j 11 84 bp 


NOV55a, 
CG97629-01 DNA Sequence 


TATCTAGAGATGCTTTAAATTCCTGAGGGGTTGGAGGACTTAGCTTACGTGCTAAGAT 
GCTAGAGTCTGGTTTCTCACTCTCATGorTRnfiRaPrtTZi a nsTrTm^^^^^ 


ATGCAAAACTGGACTTCTTCGGACAGGGACATTTTGCTACAGCCTATAAGGCCAGAGA 
TAAGAACACCAACCAAATCGTCACCATTAAGAAAATCAAACTTGGACZACAGGTCAGAA 
G C T AAAG GTGGTATAAATAG AACAG C CTTAAG G G AG AT AAAG CT CTTACAG AAACT AA 

GTCATCCGAACACAATTGGTCTCC^GACACATTTGGACATAAGTCTAACATTAGCCT 
TGTCTTGGATGTTATGGAAACTGACCTAGAGGTTATAATAAAGGATAACAGTCTTGTG 
CTGACACCATCCCACATTCAAGCCTACATGTTGATGGCTTTTCAGGGCTTAGAATATT 
TGCATCAGCATTGGATTCTACACAGGGATCTGAAACCAAAGGACTTCTTGTTAGATAA 
GAATGGAGTTCTGAAACTGGCAGACTTTGTCCTGGTCAAATCATTTGGGAGCCCCAGC 
AGGACTTACACACATCAAGTTGTGACCAGATGGTATTGGGCCCCTGTGGAGTTATTGT 
TTGGAACTAGGATGTATGATGTGGGAATGGACATGTGGGCTGTTTGTTGTTTATTAGC 
AGAATTGCTTCTAAGGGTTCCTTTTTTGCCTGGAGAGTCAGACCTTGATCAGCTAACA 
AGAATATTTGAAACTTTGGGTACACCAACTGAAGAACAGTGGCCTGACATGTGTAGCC 
TTCC^GATTCTGTGACATTTAAGAGTTTCCCTGGGGTCCCACTACAACACATCTTCAC 
TGCAGCTGGGGACGACTTGCTGGATCTCATCCAAGGCTTGTTCTTCTTTAACCCGTGT 
ACTAGGATCACAGCCTCACAGGCATTGAAGACCAAGGACTTCAGAAACTGTCCAGGGC 
CAACACCTGGATGCCAGCTTCCAAGACCAAACTGTTCAGTGGAAGCATTAAAGGAACC 
GTCCAATCCAACCATGGCAACAAAACAGAAAAAGACAGAGGCCTTAGAACAAGGAATA 
TTGCCCAAGAAGCTATGTTTTTAGTTCCAGTGAACAATGGACAGTTCCACTGCTGAAA 
G AAATG ATCC AAAGG CAAATAATA 




ORF Start: ATG at 83 ORF Stop: TAG at 1 1 24 


t 


SEQ ID NO: 1 64 j 347 aa ] M W at 39095.0kD 


NOV55a, 
CG97629-01 Protein Sequence 


MAGDVKSQANRYAKLDFFGQGHPATAYKARDKNTNQIVTIKKIKLGHRSEAKGGINRT 
AIJIEIKLLQKLSHPNTIGLLDTFGHKSNISLVLDVMETDLEVIIKDNSLVLTPSHIQA 
YMLMAFCK3LEY1jHQHWILHRI>L^ 

TRWYWAPVELLFGTRMYDVGMDMWAVCCLLAELLLRVPFLPGESDLDQLTRIFETLGT 
PTEEQWPDMCSLPDSVTFKSFPGVPLQHIFTAAGDDLLDLIQGLFFFNPCTRITASQA 
LKTKDFRNCPGPTPGCQLPRPNCSVEALKEPSNPTMATKQKKTEALEQGILPKKLCF 



Further analysis of the NOV55a protein yielded the following properties shown ii 
5 Table 55B. 



Table 55B. Protein Sequence Properties NOV55a 


PSort analysis: 


0.7000 probability located in plasma membrane; 0.5582 probability located in 
microbody (peroxisome); 0.2000 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV55a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 55C. 
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Table 55C. Geneseq Results for NOV55a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV55a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB85505 


Human protein kinase SGK149 - 
Homo sapiens, 332 aa. 
[WO200155356-A2, 02-AUG-2001] 


23..347 
11. .332 


268/325 (82%) 
285/325 (87%) 


e-153 


AAB53379 


Human colon cancer antigen protein 
sequence SEQ ID NO:919 - Homo 
sapiens, 260 aa. [WO200055351-A1, 
21-SEP-2000] 


94..347 
8..260 


214/254 (84%) 
227/254(89%) 


e-124 


AAG75087 


Human colon cancer antigen protein 
SEQ ID NO:585 1 - Homo sapiens, 260 
aa. [WO2001 22920- A2, 05-APR-200I] 


94..347 
8..260 


214/254 (84%) 
227/254 (89%) 


e-124 


ABB59809 


Drosophila melanogaster polypeptide 

SEQ ID NO 6219 - Drosophila 
melanogaster, 353 aa. [WO200171042- 
A2, 27-SEP-2001] 


1..339 
1..342 


201/344 (58%) 
252/344 (72%) 


e-113 


AAG 14009 ; 


Arabidopsis thaliana protein fragment 

SEQ ID NO: 13712 - Arabidopsis 
thaliana, 348 aa. [EP 103 3405 -A2, 06- 
SEP-2000] 


10..319 
11..316 


151/310(48%) 
209/310(66%) 


2e-85 



In a BLAST Search of public sequence datbases, the NOV55a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 55D. 
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Table 55D. Public BLASTP Results for NOV55a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV55a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q03147 

i 


Cell division protein kinase 7 (EC 
2.7.1.-) (CDK-activating kinase) (CAK) 
(TFIIH basal transcription factor 
complex kinase subunit) (39 kDa 
protein kinase) (P39 Mol5)(Protein- 
tyrosine kinase MPK-7) (CR4 protein 
kinase) (CRK4) - Mus musculus 
(Mouse), 346 aa. 


1..347 
I. .346 


298/347 (85%) 
313/347 (89%) 


e-173 


P50613 

\ 

1 


Cell division protein kinase 7 (EC 
2.7. 1.-) (CDK-activating kinase) (CAK) 
(TFIIH basal transcription factor 
complex kinase subunit) (39 kDa 
protein kinase) (P39 Mo 15) (STK1) 
(CAK1) - Homo sapiens (Human), 346 
aa. 


1..347 
1..346 


292/347 (84%) 
308/347 (88%) 


e-169 


! 

i 


cell division cyde-2-related protein 
kinase (EC 2.7. 1 .-) CRK4 - mouse, 346 
aa. 


1..347 
I. .346 


292/347 (84%) 
309/347 (88%) 


e-169 


1 P51952 


Cell division protein kinase 7 (EC 
2.7.1 .-) (CDK-activating kinase) (CAK) 
(TFIIH basal transcription factor 
complex kinase subunit) (39 protein 
kinase) (P39 Mo 15) - Rattus norvegicus 
(Rat), 329 aa (fragment). 


9..338 
1.329 


281/330 (85%) 
295/330 (89%) 


e-163 


S51085 


CdK-activating kinase Cdk7 - rat, 312 
aa (fragment). 


9..320 
1..31 1 


269/312(86%) 
281/312(89%) 


e-156 



PFam analysis predicts that the NOV55a protein contains the domains shown in 
the Table 55E. 



Table 55E. Domain Analysis of NOV55a 


Pfam Domain 


NOV55a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


pkinase 


12..296 


99/301 (33%) 
217/301 (72%) 


I.4e-61 
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Example 56. 

The NOV56 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 56A. 



1 — — — , 

| Table 56A. NOV56 Sequence Analysis 




SEQ ID NO: 165 


1821 bp 


NOV56a, 
CG97648-0! DNA Sequence 

< 

% 

\ 

i 

\ 
f 

! 
j 

i 

i 

* 

1 


CGGGAAGGGAAAGCAGCCAGCAGCCCTCCAGCCCTCTTGTGCTTTCCCTGGGAGTGCG 

CCCCGTG CTCAGCCATGGTGG ACATGGGGGCCCTGGACAAC CTGATCGCCAAC ACCRf* 

CTACCTG CAGG CCCGG AAG CCCTCGG ACTGCG ACAG CAAAGAGCTGCAGCGGCGGCGG 

CGTAGCCTGGCCCTGCCCGGGCTGCAGGGCTGCGCGGAGCTCCGCCAGAAGCTGTCCC 

TGAACTTCCJVCAGCCTGTGTGAGCAGCAGCCCATCGGTCGCCX5CCTCTTCCGTGACTT 

CCTAGC CACAGTG CCCACGTTCCGCAAGGCGGC AACCTTCCTAG AGGACGTGCAG AAC 

TGGGAGCTGGCCGAGGAGGGACCCACCAAAGACAGCGCGCTGCAGGGGCTGGTGGCCA 

CTTGTGCGAGTGCCCCTGCCCCGGGGAACCCGCAACCCTTCCTCAGCCAGGCCGTGGC 

CACCJ^GTGCCAAGCAGCCACCACTGAGGAAGAGCGAGTGGCTGCAGTGACGCTGGCC 

AAGGCTGAGGCCATGGCTTTCTTGC1AAGAGCAGCCCTTTAAGGATTTCGTGACCAGCG 

CCTTCTACGACAAGTTTCTGCAGTGGAAACTCTTCGAGATGCAACCAGTGTCAGACAA 

GTACTTC^CTGAGTTCAGAGTGCTGGGGAAAGGTGGTTTTGGfGGAGGTATGTGCCGTC 

CAGGTGAAAAACACTGGG AAGATGTATG CCTGTAAG AAACTGGACAAGAAGCGG CTG A 

AGAAGAAAGGTGGCGAGAAGATGGCTCTCTTGGAAAAGGAAATCTTGGAGAAGGTCAG 

CAGCCCTTTCATTGTCTCTCTGGC CTATGCCTTTGAG AG CAAGACCC ATCTCTG CCTT 

GTCATGAGCCTGATGAATGGGGGAGACCTCAAGTTCCACATCTACAACGTGGGCACGC 

GTGGCCTGGACATGAGCCGGGTGATCTTTTACTCGGCCCAGATAGCCTGTGGGATGCT 

GCACCTCCATGAACTCGG CATCGTCTATCGGGACATGAAGCCTGAGAATGTG CTTCTG 

GATGACCTCGGCAACTGCAGGTTATCTGACCTGGGGCTGGCCGTGGAGATGAAGGGTG 

GCAAGCCCATCACCCAGAGGGCTGGAACCAATGGTTACATGGCTCCTGAGATCCTAAT 

GGAAAAGGTAAGTTATTCCTATCCTGTGGACTGGTTTGCCATGGGATGCAGCATTTAT 

GAAATGGTTGCTGGACGAACACCATTCAAAGATTACAAGGAAAAGGTCAGTAAAGAGG 

ATCTGAAGCAAAGAACTCTGCAAGACGAGGTCAAATTCCAGCATGATAACTTCACAGA 

GGAAGCAAAAGATATTTGCAGGCTCTTCTTGGCTAAGAAACCAGAGCAACGCTrA 

AGCAGAGAAAAGTCTGATGATCCC^GGAAAC^TCATTTCTTTAAAACGATCAACTTTC 

CTCGCCTGGAAGCTGGCCTAATTGAACCCCCATTTGTGCCAGACCCTTCAGTGGTTTA 

TGCGAAAGA(^TCGCTGAAATTGATGATTTCTCTGAGGTTCGGGGGGTGGAATTTGAT 

GAOW^GATAAGCAGTTCTTCAAAAACTTTGCGAC^GGTGCTGTTCCTATAGCATGGC 

AGGAAGAAATTATAGAAACGGGACTGTTTGAGGAACTGAATGACCCCAACAGACCTAC 


CrCTTTACCAGACAGGCAGCAGGAGTCTCGGCTGACATAATCCTCGAATGTTCCACAC 
G TGGAAAT CTGTGGAATGAGGGC 




ORF Start: ATG at 73 


ORF Stop: TA A at 1732 




SEQ ID NO: 166 


553 aa 


MW at 62210.8kD 


NOV56a, 
CG97648-01 Protein Sequence 


MVDMGALDNLIANTAYLQARKPSDCDSKELQRRRRSLALPGLQGCAELRQKLSLNFHS 
LCEQQPIGRRLFRDFLATVPTFRKAATFLEDVQNWELAEEGPTKDSALQGLVATCASA 
PAPGNPQPFLSQAVATKCQAATTEEERVAAVTLAKAEAMAFLQEQPFKDFVTSAFYDK 
FLQWKLFEMQPVSDKYFTEFRVLGKGGFGEVCAVQVKNT^ 

EKMALLEKEILEKVSSPFIVSLAYAFESKTHLCLVMSLMNGGDLKFHIYNVGTRGLDM 
SRVI FYS AQI ACXjMLHLHEIjG I VYRDMKPENVLLDDLGNCRLSDLGIiAVEMKGGKPIT 
QRAGTNGYMAPE I LMEKVS YS YPVDWFAMGCS I YEMVAGRTPFKDYKEKVS KEDLKQR 
TLQDEVKFQHDNFTEEAKDICRLFLAKKPEQRLGSREKSDDPRKHHFFKTINFPRLEA 
GLIEPPFVPDPSWYAKDI AE IDDFSEVRGVEFDDKDKQFFKNFATGAVPIAWQEEI I 
ETGLFEELNDPNRPTGCEEGNS S KSGVCLLL 



Further analysis of the NOV56a protein yielded the following properties shown in 
Table 56B. 
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Table 56B. Protein Sequence Properties NOV56a 


PSort analysis: 


0.9685 probability located in nucleus; 0.1 101 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV56a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 56C. 



Table 56C. Geneseq Results for NO V56a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#,Date] 


NOV56a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAE 16273 


Human kinase PKIN-19 protein - 
Homo sapiens, 553 aa. 
[ WO200 1 96547-A2, 20-DEC- 
2001] 


1..553 
1..553 


553/553(100%) 
553/553(100%) 


0.0 


AAG77815 

i 


Human G-protein coupled receptor 
kinase 1 protein - Homo sapiens, 
553 aa. [WO200168869-A2, 20- 
SEP-2001] 


1..553 
1..553 


553/553 (100%) 
553/553 (100%) 


0.0 


AAU03502 


Human protein kinase #2 - Homo 
sapiens, 553 aa. [WO200 138503- 
A2,31-MAY-2001] 


1..553 
1..553 


551/553 (99%) 
551/553 (99%) 


0.0 


AAG77816 


Human G-protein coupled receptor 
kinase 2 protein - Homo sapiens, 
353 aa. [WO200168869-A2, 20- 
SEP-2001] 


1.350 
1..350 


350/350(100%) 
350/350(100%) 


0.0 


AAY24423 


GRJK4 polymorphism GRK4-alpha 
protein sequence - Homo sapiens, 
578 aa. [W09935279-A1, 15-JUL- 
1999] 


7..531 
3..526 


253/528 (47%) 
348/528 (64%) 


e-140 



In a BLAST search of public sequence datbases, the NOV56a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 56D. 
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Table 56D. Public BLASTP Results for NOV56a 


Protein 
Accession 
Number 


Prote in/Organism/Length 


INUVJOa tVCMUUCb/ 

Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q8WTQ7 

! 


G-PROTEIN-COUPLED 
RECEPTOR KINASE 7 (G 
PROTEIN-COUPLED * 
RECEPTOR KINASE 7) - Homo 
sapiens (Human), 553 aa. 


1 ..553 
1..553 


553/553 (100%) 
553/553 (100%) 


0.0 


Q8WMV0 


RETINA G PROTEIN- 
COUPLED RECEPTOR 
KINASE 7 - Bos taurus (Bovine), 
552 aa. 


1..553 
1..552 


475/553 (85%) 
508/553 (90%) 


0.0 


Q8WPI5 


G PROTEIN-COUPLED 
RECEPTOR KINASE 7 - Sus 
scrofa (Pig), 553 aa. 


1 ..553 
1..553 


470/553 (84%) 
509/553 (91%) 


0.0 


Q9Z2G7 


G PROTEIN-COUPLED 
RECEPTOR KINASE GRK7 - 
Spermophilus tridecemlineatus 
(Thirteen-Iined ground squirrel), 
548 aa. 


2..553 
1..548 


468/552 (84%) 
507/552 (91%) 


0.0 


073658 


OLGRK-C - Oryzias latipes 
(Medaka fish), 557 aa. 


1..553 
I ..557 


330/563 (58%) 
416/563 (73%) 


0.0 



PFam analysis predicts that the NOV56a protein contains the domains shown in 
the Table 56E. 



Table 56E. Domain Analysis of NOV56a 


Pfam Domain 


NOV56a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RGS 


55..78 


13/24 (54%) 
22/24(92%) 


5.6e-07 


pkinase 


191. .454 


94/294 (32%) 
202/294 (69%) 


5.2e-70 


pkinase_C 


455..471 


8/17(47%) 
13/17(76%) 


0.3 



Example 57. 



The NOV57 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 57A. 
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Table 57A. NOV57 Sequence Analysis 




SEQIDNO: 167 


1722 bp 


NOV57a, 
CG97658-01 DMA Sequence 

■ 


GCATGGTGGCACCATCTACCACAGGTCTCCCCCAGCCGATAGTGATGGAGGCACTGGA 
CGAGGCTGAAGGGCTCCAGGACTC^CAGAGAGAGATGCCGCCACCCCCTCCTCCCTCG 
CCGCCCTCAGATCCAGCTCAGAAGCCACCACCTCGAGGCGCTGGGAGCCACrCCCTCA 
CTGTC^GGAGCAGCCTGTGCCTGTTCGCTGCCTC^CAGTTCCTGCTTGCCTGTGGGGT 
GCTCrGGTTCAGCGGTTATGGCCACATGTGGTCACAGAACGCCACAAACCTCGTCTCC 
TCTTTGCTGACGCTCCTGAAACAGCTGGAACCCACGTCCTGGCTTGACTCTGGGACGT 
GGGGAGTCCCCGGTCTGCTGCTGGTCTTTCTGTCCGTGGGCCTGGTCCTCGTTACCAC 
CCTGGTGTGGCACCTCCTGAGGACACCCCCAGAGCCACCCACCCCACTGCCCCCTGAG 
GACAGGCGCCAGTCAGTGAGCCGCCAGCCCTCCTTCACCTACTCAGAGTGGATGGAGG 
AGAAGATCGAGGATGACTTCCTGGACCrCGACCCGGTGCCCGAGACTCCTGTGTTTGA 
TTGTGTGATGGACATCAAGCCTGAGGCTGACCCCACCTCACTCACCGTCAAGTCCATG 
GGTCTGCAGGAGAGGAGGGGTTCCAATGTCTCGCTGACCCTGGACATGTGCACTCCGG 
GCTGCAACGAGGAGGGCTTTGGCTATCTCATGTCCCCACGTGAGGAGTCCGCCCGCGA 
GTACCTGCTCAGCGCCTCCCGTGTCCTCCAAGCAGAAGAGCTTCATGAAAAGGCCCTG 
GACCCTTTCCTGCTGCAGGCGGAATTCrTTGAAATCCCCATGAACTTTGTGGTACCGA 
AAGAGTACGACATCCCTCGGAGGTGTCGGAAGAACCGGTACAAAACC^TACTTCCCAA 
CeCTCACAGCAGAGTGTGTCTGACCTCACCAGACCCTGACGACCCTCTGAGTTCCTAC 
ATCAATGCCAACTACATCCGGGGCTATGGTGGGGAGGAGAAGGTGTACATCGCCACTC 
AGGG ACCCATCGTC AGCACGGTCGCCGACTTCTGG CG CATGGTGTGG CAGGAGCACAC 
GCCCATCATTGTCATGATCACCAACATCGAGGAGATGAACGAGAAATGCACCGAGTAT 
TGGCCGGAGGAGCAGGTGGCGTACGACGGTGTTGAGATCACTGTGCAGAAAGTCATTC 
ACACGGAG<3ATTACCX3GCTGCGACTCATCTCCCTCAAGAGTGGGACTGAGGAGCGAGG 
CCTGAAGCATTACTGGTTCACATCCTGGCCCGACCAGAAGACCCCAGACCGGGCCCCC 
CCACTCCTGCACCTGGTGCGGGAGGTGGAGGAGGCAGCCCAGCAGGAGGGTCCCCACT 
GTGCCCCCATCATCGTCCACTGCAGTGCAGGGATTGGGAGGACCGGCTGCTTCATTGC 
CACCAGCAT CTG CTGCCAG CAGCTGCGGCAGG AGGGTGTAGTGG ACATCCTG AAGACC 
ACGTGCCAG CTCCGTCAGGACAGGGGCGGCATGATCCAG CACTGCG AGCAGT ACCAGT 
TTGTGCACCACGTCATGAGCCTCTACGAAAAGCAGCTGTCCCACCAGTCCCCAGAATG 
ACTG CG CTT CTC CTACAAGGT TCTCTGGG CACTG CCAGC CTG AG TCTCGGCCCTCACC 
CAGGGCCCTG CCTCGGGTCGGAATTCGATATCAAG CTTAT 




ORF Start: ATG at 3 


ORF Stop: TGA at 1623 




SEQIDNO: 168 


540 aa 


MWat 60799.6kD 


MOV57a, 
CG97658-01 Protein Sequence 


MVAPSTTGLPQPIVMEALDEAEGLQDSQREMPPPPPPSPPSDPAQKPPPRGAGSHSLT 
VRSS LCLFAASQFLLACGVLWFS G YGHMWS QNATNLVS S LLTLLKQLEPTS WLDSGTW 
GVPGLLLWLSVGLVLVTTLVWHLLRTPPEPPTPLPPEDRRQSVSRQPSPTYSEWMEE 
KI ED DFLDLD PVPETPVFDCVMD I KPE ADPTS LTVKSMGLQERRGSNVSLTLDMCTPG 
CNEEGFGYIjM s prees areylls ASRVLQAEELHEKALDPFLLQAE FFE I PMNFWPK 
EYDIPGRCRKNRYKTILPNPHSRVCLTSPDPDDPLSSYINANYIRGYGGEEKVYIATQ 
GPIVSTVADFWRMVWQEHTPIIVMITNIEEMNEKCTEYWPEEQVAYDGVEITVQKVIH 
TEDYRLRLISLKSGTEERGLKHYWFTSWPDQKTPDRAPPLLHLVREVEEAAQQEGPHC 
API IVHCSAGIGRTGCFIATS ICCQQLRQEGWDILKTTCQLRQDRGGMIQHCEQYQF 
VHHVMSLYEKQLSHQSPE 



Further analysis of the NOV57a protein yielded the following properties shown in 
Table 57B. 



Table 57B. Protein Sequence Properties NOV57a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.1000 
probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV57a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 57C. 



Table 57C. Geneseq Results for NOV57a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#,Date] 


NOV57a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAG67630 


Amino acid sequence of a human 
protein - Homo sapiens, 537 aa. 
[WO200 1 093 1 6-A J , 08-FEB- 
2001] 


8..540 
5..537 


532/533 (99%) 
532/533 (99%) 


0.0 


AAG67451 


Amino acid sequence of a human 
polypeptide - Homo sapiens, 537 
aa. [WO200109345-A1, 08-FEB- 
2001] 


8..540 
5..537 


532/533 (99%) 
532/533 (99%) 


0.0 


AAM39481 


Human polypeptide SEQ ID NO 
2626 - Homo sapiens, 565 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


8..540 
33..565 


527/533 (98%) 
529/533 (98%) 


0.0 


AAM41267 

< 


Human polypeptide SEQ ID NO 
6198 - Homo sapiens, 600 aa. 
[WO2001533l2-Al,26-JUL- 
2001] 


8..540 
68..600 


526/533 (98%) 
528/533 (98%) 


0.0 


AAE 14457 


Human protein phosphatase-7 - 

Homo sapiens, 541 aa. 
[WO200196546-A2, 20-DEC- 
2001] 


8..540 
9..541 


526/533 (98%) 
528/533 (98%) 


0.0 



In a BLAST search of public sequence datbases, the NOV57a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 57D. 
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Table 57D. Public BLASTP Results for NOV57a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV57a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


P54829 


Protein-tyrosine phosphatase, non- 
receptor type 5 (EC 3. 1 .3.48) 
(Protein-tyrosine phosphatase 

striatum-enriched) (STEP) (Neural- 
specific protein-tyrosine 
phosphatase) - Homo sapiens 
(Human), 537 aa (fragment). 


8..540 
5..537 


532/533 (99%) 
532/533 (99%) 


0.0 


P54830 


Protein-tyrosine phosphatase, non- 
receptor type 5 (EC 3.1 .3.48) 
(Protein-tyrosine phosphatase 
striatum-enriched) (STEP) (Neural- 
specific protein-tyrosine 
phosphatase) - Mus rnusculus 
(Mouse), 541 aa. 


8..540 
9..541 


474/533 (88%) 
495/533 (91%) 


0.0 


P35234 


Protein-tyrosine phosphatase, non- 
receptor type 5 (EC 3. 1 .3.48) 
(Protein-tyrosine phosphatase 
striatum-enriched) (STEP) (Neural- 
specific protein-tyrosine 
phosphatase) - Rattus norvegicus 
(Rat), 369 aa. 


172..540 
1 .369 


346/369 (93%) 
354/369 (95%) 


0.0 


Q9BE09 


HYPOTHETICAL 42.1 KDA 
PROTEIN - Macaca fascicularis 

(Crab eating macaque) 
(Cynomplgus monkey), 370 aa. 


128..440 
1..313 


304/313 (97%) 
306/313(97%) 


0.0 


JC4263 


protein-tyrosine-phosphatase (EC 
3.1.3.48), receptor type PCPTP1 
precursor - rat, 656 aa. 


110..538 
222..654 


220/442 (49%) 
286/442 (63%) 


e-114 



PFam analysis predicts that the NOV57a protein contains the domains shown in 
the Table 57& 



Table 57E. Domain Analysis of NO V57a 


Pfam Domain 


NOV57a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Y_phosphatase 


297..529 


105/278(38%) 
197/278 (71%) 


2.3e-103 
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Example 58. 

The NOV58 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 58A. 



Table 58A. NOV58 Sequence Analysis 



SEQ ID NO: 169 



2937 bp 



NOV58a, 
CG97842-01 DNA Sequence 



AT^GTTCTCCACTATCCTTCTGTTTTTCTAGGTAACTATAACTACCCAATATTGCAG 
C^TGGAGTCCJVTGCTTAATAAATTGAAGAGTACTGTTACAAAAGTAACAGCTGATGT 
CACTAGTGCTGTAATGGGAAATCCTGTCACTAGAGAATTTGATGTTGGTCGACACATT 
G CCAGTGGTGGCAATGGG CTAG CTTGG AAGATTTTTAATGGCACAAAAAAGT CAACAA 
AGCAGGAAGTGGCAGTTTTTGTCTTTGATAAAAAACTGATTGACAAGTATCAAAAATT 
TGAAAAGGATCAAATCATTGATTCTCTAAAACGAGGAGTCCAACAGTTAACTCGGCTT 
CGACACCCTCGACTTCTTACTGTCCAGC^TCCTTTAGAAGAATCCAGGGATTGCTTGG 
C^TTTTGTACAGAACCAGTTTTTGCCAGTTTAGCCAATGTTCTTGGTAACTGGGAAAA 
TCTACCTTCCCCTATATCTCCAGACATTAAGGATTATAAACTTTATGATGTAGAAACC 
AAATATGGTTTGCTTCAGGTTTCrGAAGGATTGTCATTCTTGCATAGCAGTGTGAAAA 
TGGTGCATGGAAATATCACTCCTGAAAATATAATTTTGAATAAAAGTGGAGCCTGGAA 
AATAATGGGTTTTGATTTTTGTGTATCATCAACCM^ 

ATTTGGCTCCTGAATACATACTTTCTGTGAGCTGTGAAACAGCCAGTGATATGTATTC 
TTTAGGAACTGTTATGTATGCTGTATTTAATAAAGGGAAACCTATATTTGAAGTCAAC 
AAGCMGATATTTACAAGAGTrrCAGTAGGCAGTTGGATCAGTTGAGTCGTTTAGGAT 
CTAGTTCACTTACAAATATACCTGAGGAAGTTCGTGAACATGTAAAGCTACTGTTAAA 
TGTAACTCCGACTGTAAGACC^GATGCAGATCAAATGACAAAGATTCCCTTCTTTGAT 
GATGTTGGTGCAGTAACACTGCAATATTTTGATACCTTATTCCAAAGAGATAATCTTC 
AGAAATCACAGTTTTTCJ^GGACTGCCAAAGGTTCTACCAAAACTGCCCAAGCGTGT 
CATTGTGCAGAGAATTTTGCCTTGTTrGACTTCAGAATTTGTAAACCCTGACATGGTA 
CCTTTTGTTTTGCCCAATGTTCTAC1TATTGCTGAGGAATGCACCAAAGAAGAATATG 
T C AAATTAATT CTTC CTG AACTTG G C C CTGTG TTT AAG C AG C AGGAG CC AAT C CAG AT 
TTTGTTAATTTTCCTACAAAAAATGGATTTG CTACT AACCAAAACCC CTCCTGATGAG 
ATAAAGAACAGTGTTCTACCCATGGTTTACAGAGCACTAGAAGCTCCTTCCATTCAGA 
TCCaGGAGCTCTGTCTAAACATCATTCCAACCTTTGCAAATCTTATAGACTACCCATC 
CATGAAAAACGCTTTGATACCAAGAATTAAAAATGCTTGTCTACAAACATCTTCCCTT 
G CGGTT CGTGT AAATTCATTAGTGTG CTTAGGAAAG ATTTTGGAAT ACTTGGAT AAGT 
GGTTTGT ACTTGATG ATAT CCTACCCTTCTTAC1AACAAATTC CATCCAAGG AAC CTGC 
GGTCCTGATGGGAATTTTAGGTATTTACAAATGTAC^ CAT AAGAAGTTGGGA 
AT CACCAAAGAG CAG CTGGCCGGAAAAGTG TTG CCTCAT CTT ATTC CCCTGAGTATTG 
AAAACAATCTTAATCTTAATCAGTTCAATT CTTT CATTTCCGTCAT AAAAGAAATGCT 
T AATAG ATTGGAGTCTG AACATAAG ACTAAACTG G AG CAACTTCAT ATAATG CAAGAA 
CAGCAGAAAT CTTTGGAT ATAGG AAATCAAATG AATGTTT CTG AGG AG ATGAAAGTTA 
CAAATATTGGGAATCAGCAAATTGACAAAGTTTTTAACAACATTGGAGCAGACCTTCT 
GACTGG CAGTGAGTCCGAAAATAAAG AGG ACG GGTT ACAGAATAAACAT AAAAG AG CA 
TGACTTACIACTTGAAGAAAAACAAAAATT AG CAAAAGAAC AAGAG CAGG CACAGAAG C 
TGAAAAGCCAGCAGCCTCTTAAACCCCAAGTC 

GACTAAGG ACTTGACAG ACACACTG ATGGATAATATGT (^TC CTTG ACCAGC OT CT 
GTTAGTACCCCT\AAATCTTCTGCTTCAAGTACTTTCACTTCT 

TTGGTATGATGTITTCTA^CCAACTGATAATACAAAGAGAAATTTGACAAATGGCCT 
AAATGCC^TATGGGCTTTCAGACTTC^G^ATTCAAC^TGCCCGTTAATACAAACCAG 
AACTTCTACAGTAGTCCAAGCACAGTTGGAGTGACCAAGATGACTCTGGGAACAC 
CCACTTTGCCAAACITCAATGCTTTGAGTGTTCCTCCT 
ACAAAG ACCCACAGATATG TCTG C CCTTAATAATCTCTTTGG CCCTCAGAAACCCAAA 
GTTAG CATG AACCAGTTATCACAAC7VG AAACCZAAAT(ZAGTGG CTT AATCAGTTTOT 
CTCCTCAAGGTTCTCCAACTATGGGCAGTTC^ 

AGGACAATCTGLrri ttGGTATG CAGGGTAATCCTTTCl u l"l'AACCCACAGAACTTTG CA 
CAGCC^CCAACTACTATGACCAATAGCAGTTC^GCrAGCAAT 
TTGGGTGAGGTGTCITACTTCTATTTTGAAGGATTATT^ 
GCTGATTTACATCTTTATATAGTTGG CTTGG AGG AAG 
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ORF Start: ATG at 61 


ORF Stop: TGA at 2848 




SEQIDNO: 170 


929 aa 


MW at!03707.8kD 


NOV58a, 
CG97842-01 Protein Sequence 


MESMLNKLKSTVTKVTADVTSAVMGNPVTRE FDVGRHIASGGNGLAWKI FNGTKKSTK 
QEVAVFVFDKKLIDKYQKFBKDQIIDSLKRGVQQLTRLRHPRLLTVQHPLEESRDCIA 
FCTE P VFAS LANVLGNWENL PS PIS PD I KD YKLYD VETKYGLLQVS EGLS FLHS S VKM 

VHGNITPENIILNKSGAWKIMGFDFCVSSTNPSEQEPKFPCKEWDPNLPSLCLPNPEY 
IAPEYILSVSCETASDMYSLGTVMYAVFNKGKPIFEVNKQDIYKSFSRQLDQLSRLGS 
SSLTNIPEEVREHVKLLLNVTPTVRPDADQKTKIPFFDDVGAVTLQYFDTLFQRDNLQ 
KSQFFKGLPKVLPKLPKRVIVQRILPCLTSEFWPDMVPFVLPNVLLIAEECTKEEYV 
KL I L P ELG PVFKQQEPI Q I LL I FLQKM0LLLTKT P PDEI KNS VL PMVYRALE AP S I QI 
QELCLNIIPTFANLIDYPSMKNALIPRIKNACLQTSSIAVRVNSLVCLGKILEYLDKW 
FVLDDILPFLQQIPSKEPAVIiMGIIjGIYKCrrFTHKKLGITKEQLAGKVLPHLIPLSIE 
NNLNLNQFNS FI S VI KEMLNRLESEHKTKLEQLH IMQEQQKS LD I GNQMNVSE EMKVT 
NIGNQQIDKVFNNIGADLLTGSESENKEDGLQNKHKRASLTliEEKQKLAKEQEQAQKL 
KSQQPLKPQVHTPVATVKQTKDLTDTLMDNMSSLTSLSVSTPKSSASSTFTSVPSMGI 
GMMFSTPTDNTKimiTNGLNANMGFQTSGE^PVNTNQNFYSSPSTVGVTKMTLGTPP 
TLPNFNALSVPPAGAKQTQQRPTDMSAIiNNLFGPQKPKVSMNQLSQQKPNQWLNQFVP 

PQGSPTMGSSVMGTQMNVIGQSAFGMQGNPFFNPQNFAQPPTTMTNSSSASNDLKDLF 
G 



Further analysis of the NOV58a protein yielded the following properties shown ii . 
Table 58B. 



1 

1 Table 58B. Protein Sequence Properties NOV58a 

f 


PSort analysis: 


0.4564 probability located in mitochondrial matrix space; 0.3000 probability located 
in microbody (peroxisome); 0.3000 probability located in nucleus; 0.1507 
probability located in mitochondrial inner membrane 


| SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV58a protein against the Geneseq database, a proprietary 
5 database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 58C. 
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Table 58C. Geneseq Results for NOV58a 


Geneseq 
Identifier 

i 

i 


Protein/Organism/Length [Patent #, 
Date] 


NOV58a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


\ AAB85782 


Human kinase PKIN-I - Homo 
sapiens, 792 aa. [WO20016099I-A2, 
23-AUG-2001] 


153. .929 
12..792 


771/781 (98%) 
772/78! (98%) 


0.0 


AAG67427 


Amino acid sequence of a human 
protein kinase/protein phosphatase - 

Homo sapiens, 756 aa. 
[WO200109345-A1, 08-FEB-2001] 


174..929 
I. .756 


755/756 (99%) 
755/756 (99%) 


0.0 


AAB95177 


_ Human protein sequence SEQ ID 
NO: 17239 - Homo sapiens, 756 aa. 
[EP 1 0746 1 7- A2, 07-FEB-200 1 ] 


I74..929 
1..756 


755/756 (99%) 
755/756 (99%) 


0.0 


, AAB95772 

i 

! 


Human protein sequence SEQ ID 
NO: 1871 1 - Homo sapiens, 735 aa. 
[EP 1 0746 1 7-A2, 07-FEB-200 1 ] 


195..929 
I. .735 


732/735 (99%) « 
734/735(99%) 


0.0 


AAB65678 


Novel protein kinase, SEQ ID NO: 

206 - Homo sapiens, 505 aa. 
[WO200073469-A2, 07-DEC-2000] 


1..496 
1..496 


494/496 (99%) 
494/496 (99%) 


0.0 



In a BLAST search of public sequence datbases, the NOV58a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 58D. 
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Table 58D. Public BLASTP Results for NOV58a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV58a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9P2I7 


KIAA1360 PROTEIN - Homo sapiens 
(Human), 796 aa (fragment). 


138..929 
I. .796 

• 


788/796 (98%) 
789/796 (98%) 


0.0 


Q96ST4 


CDNA FU 14645 FIS, CLONE 
NT2RP2001839, WEAKLY SIMILAR 
TO SCY1 PROTEIN - Homo sapiens 
(Human), 756 aa. 


174..929 
L.756 


755/756 (99%) 
755/756 (99%) 


0 0 


Q9H7V5 


CDNA FLJ14212 FIS, CLONE 
NT2RP3003500, WEAKLY SIMILAR 
TO SCY! PROTEIN - Homo sapiens 
(Human), 735 aa. 


I95..929 
1..735 


732/735 (99%) 
734/735 (99%) 


0.0 


Q9NVH3 j 


CDNA FLJI0735 FIS, CLONE 
NT2RP3001407, WEAKLY SIMILAR 
TO SCY1 PROTEIN - Homo sapiens 
(Human), 446 aa. 


484..929 
1..446 


445/446 (99%) 
445/446 (99%) 


0.0 


Q96EF4 J 

i 


UNKNOWN (PROTEIN FOR 
MGC:8814) - Homo sapiens (Human), 
446 aa. 


484..929 
I. .446 


444/446 (99%) 
444/446 (99%) 


0.0 



PFam analysis predicts that the NOV58a protein contains the domains shown i 
the Table 58E. 



Table 58E. Domain Analysis of NOV58a 


Pfam Domain 


NOV58a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


pkinase 


159..327 


44/203 (22%) 
116/203(57%) 


3e-07 



Example 59. 



The NOV59 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 59A. 
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Table 59A. NOV59 Sequence Analysis 



SEQIDNO: 171 



2933 bp 



NOV59a, 
CG98021-OI DN A Sequence 



TGTTCTCTGCTGGATTCGCAGCAAC GAGCACAG CACGTAGGG CGTAGTTGGTGCTGGA 
TGGATGTTTGTTGAATGAATGAATGATGAATGGCTGGCACCTTGTCTGCTCATCCCTA 
ACTCCTGTTCCTTCATCTGTGCAGCCCTAATCTTTGTTTCCTCATCTGTCCATCCCTT 
TATTTGTG^ATCCTCATTCTTAGCCCCTTCACTGCCCTTCTCCATCTCTTCCTCCTTG 
TTCATTTGTCCCTGTTCTCTGTCCTCTACTCCACTCATGCCCATCTCTGTCCCCTTGA 
CTTAC CCAGTC CCTGCTACTATCT CCATCCCT AATTTCTGCCCTCTTGTCTGTCTACT 
CCTAATTCCTTTTCCTTGTCCATCCCTAATACCTGTCACCTTGTCCTTCTTCCTCGAA 
TCTCCATCCCTAATCCATCTGCCCCTAATCTCTGTCCCCTTTGCCCATCCrrTCCTTTT 
CTCGGTCTCTCTTTCCACCCTTATCTCGACACCTGCCCACCCTGCACTCCCATTCTGT 
TTCCCATCTGCACCCTTGCCCCATCCCTCCCACACACAGGACCAGACGGCCACCATGT 
CAGGAGACTACGAGGATGACCTCTGCCGGCGGGCACTCATCCTGGTCTCGGACCTCTG 
TGCGCGGGTCCGAGATGCTGACACCAACGACAGGTGCCAGGAGTTCAATGACCGAATC 
CGAGGCTATCCCCGGGGTCCAGATGCAGACATCTCCGTGAGCCTGCTGTCGGTCATCG 
TGACATTCTGTGGCATTGTCCTTCTGGGTGTCTCTCrCTTCGTGTCCTGGAAGTTGTG 
CTGGGTGCCCTGGCGGGACAAGGGAGG CTCGGCAGTGGGCGGTGG CCCCCTGCGCAAA 
GACCTAGGCCCTGGTGTCX3GGCTGGCAGGCCTGGTAGGCGGAGGCGGGCACCACCTGG 
CGGCTGGCCTGGGTGGCCATCCTCTGCTGGGCGGCCCACACCACCATGCCCATGCCGC 
CCACCATCCACCCTTTGCTGAGCrrGCrGGAGCCAGGCAGCCTGGGGGGTTCTGACACC 
CCTGAGCCCTCCTACTTGGACATGGACTCGTATCCAGAGGCTGCAGCAGCAGCAGTGG 
CCG CTGGGGTCAAACCGAG CCAAACATCCCCTGAGCTGCCCTCTGAGGGGGGAGCAGG 
CTCTGGGTTGCTCCTGCTGCCCCCCAGTGGTGGGGGCTTGCCCAGTGCCCAGTCACAT 
CAG CAGGTCACAAGCCTGG CACCCACTACCAGGTACCCAGCCCTGCCCCGACCCCTCA 
CCCAGCAGACTCTGACCTCCCAGCCGGACCCCAGCAGTGAGGAGCGCCCACCTGCCCT 
GCCCTTACCCCTGCCTGGAGGCGAGGAAAAAGCCAAACTCATTGGGCAGATTAAGCCA 
GAGCTGTACCAGGGGACTGGCCCTGGTGGCCGGCGGAGCGGTGGGGGCCCAGGCTCTG 

TGGCTCGGACCAGCTGGTGGTGAGGATCCTGCAGGCCCTGGACCTCCCTGCCAAGGAC 

TCCAACGGCITCrrCAGACCCCTACGTCAAGATCrrACCTGCTGCCTGACCGCAAGAAAA 

AGTTTCAGACCAAGGTGCACAGGAAGACCCTGAACCCCGTCrrCAATGAGACGTTTCA 

ATTCTCGGTGCCCCTGGCCGAGCTGGCCCAACGCAAACTGCACTTCAGCGTCTATGAC 

TTTGACCGCTTCTCGCGGCACGACCTCATCGGCCAGGTGGTGCTGGACAACCTCCTGG 

AG CTGG CCGAG CAGCCCCCTGACCGCCCGCTCTGGAGGG ACATCGTGGAGGGCGGCTC 

GGAAAAAGCAGATCTTGGGGAGCTCAACTTCTCACTCTGCTACCTCCCCACGGCCGGG 

CGCCTCACCGTGACCATCATCAAAGCCTCTAACCTCAAAGCGATGGACCTCACTGGCT 

TCTCAGACCCCTACGTGAAGGCCTCCCTGATCAGCGAGGGGCGGCGTCTGAAGAAGCG 

GAAAACCTCCATCAAGAAGAACACGCTGAACCCCACCTATAATGAGGCGCTGGTGTTC 

GACGTGGCCCCCGAGAGCGTGGAGAACGTGGGGCTCAGCATCGCCGTGGTGGACTACG 

ACTGCATCGGGCACAACGAGGTGATCGGCGTGTGCCGTGTGGGCCCCGACGCTGCCGA 

CCCGCACGGCCGCGAGCACTGGGCAGAGATGCTGGCCAATCCCCGCAAGCCC^TGGAG 

CACTGGCATCAGCTAGTGGAGGAAAAGACTGTGACCAGCTTCACAAAA^ 

GACTATCAGAGAAAGAGAACTCCGAGTGAGGGGTCTGGCCTAGGCCCGGGATCGGACC 

AGGCTCCCTCAGGACCCCATCCTTTCCTGCCCGGACCGTGAATTCATCTCCTTGAAGC 

CATAACGTCCGAGCTGCTGGTGCGGGGCAGCCCTGGCCCTAGGCTTCCTAACCCTGGA 

AGCGAGAGGATGAGAGGAGGCCGGCCCAGCTCCTTCTTTCZAGGGTGGGGGTCATTCAG 

CCTCCACTGTGTCTGTCTTTTCTTCCCTGGGGCTCCCCCTCGAGGCGAGGGGCCATGC 

TGTCCAAGACTCAGTGTCCCGACCCTTGTTCTCGCCGTGAATGTCAATGGGCCAATCC 
TCTGTCCTTTCAGACACACACACACCTGTGTCCACCCCTTCTGTTCGCCACACCCT 
GCGTCTGGCCGGTCCCCCCACTG CTG CTGCTATCAACGCCAGAATAAACACACTCTGT 
GGGTCTCACTCCAAAAAAAAAAAAAAAAAAAGC 





ORF Start: ATG at 635 


ORF Stop: TG A at 2405 




SEQIDNO: 172 


590 aa 


MW at 63303.0WD 


NOV59a, 
CG98021-01 Protein Sequence 


MSGDYEDDLCRRALILVSDLCARVRDADTNDRCQE FNDRI RGYPRGPDAD I SVSLLS V 
IVTFCGIVLLGVSLFVSWKLC^PWRDKGGSAVGGGPLRKDIXSPGVGLAGLVGGGGHH 
LAAGLGGHPLLGGPHHHAHAAHHPPFAELLEPGSLGGSDTPEPSYLDMDSYPEAAAAA 
VAAGVKPSQTSPELPSEGGAGSGLLLLPPSGGGLPSAQSHQQVTSIAPTTRYPALPRP 
LTQQTLTSQPDPSS EERP PALPLPLPGGEEKAKIi IGQI KPELYQGTGPGGRJISGGGPG 
SGEAGTGAPCGRISFALRYLYGSDQLWRILQALDLPAKDSNGFSDPYVKIYLLPDRK 
KKFQTKVHRKTI^PVFNETFQFSVPIiAEIAQ^ 

LELAEQPPDRPLWRDI VEGGSEKADLGELNFSLCYLPTAGRLTVTI I KASNLKAMDIiT 
G FS D P YVKAS L I SEGRRL KKRKTS I KKNTLNPTYNEALVFD VAP E S VENVGL S I AWD 
YDCIGHNEVIGVCKVGPDAADPHGREHWAEMLANPRKPVEHWHQLVEEKTVTSFTKGS 
KGLSEKENSE 
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Further analysis of the NOV59a protein yielded the following properties shown 
Table 59B. 



in 





Table 59B. Protein Sequence Properties NOV59a 


PSort analysis: 


0.6000 probability located in endoplasmic reticulum (membrane); 0 4600 
probability located in plasma membrane; 0.3128 probability located in microbody 
(peroxisome); 0.3000 probability located in nucleus 


SignalP analysis: 


Cleavage site between residues 28 and 29 



A search of the NOV59a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 59C. 
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Table 59C. Geneseq Results for NOV59a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV59a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU197I4 


Human novel extracellular matrix 
protein, Seq ID No 364 - Homo sapiens, 
295 aa. [WO200155368-A1, 02-AUG- 
2001] 


291..571 
1..281 


278/281 (98%) 
278/281 (98%) 


e-161 


AAU19715 


Human novel extracellular matrix 
protein, Seq ID No 365 - Homo sapiens, 
461 aa. [WO200155368-AI, 02-AUG- 
2001] 


77..583 
33..461 


235/509 (46%) 
304/509 (59%) 


e-116 


AAU 19847 


Human novel extracellular matrix 
protein, Seq ID No 497 - Homo sapiens, 
208 aa. [WO200I55368-A1, 02-AUG- 
2001] 


309..479 
19.. 1 89 


161/171 (94%) 
162/171 (94%) 


9e-88 


AAR97722 


Mouse inositol polyphosphate binding 
protein IP4-BP - Mus musculus, 422 aa. 
[JP08092290-A, 09-APR-1996] 


288..581 
130..422 


148/297(49%) 
203/297 (67%) 


4e-76 


ABB59660 


Drosophila melanogaster polypeptide 

SEQ ID NO 5772 - Drosophila 
melanogaster, 474 aa. [WO200171042- 
. A2,27-SEP-2001] 


301..581 
194..474 


148/287 (51%) 
200/287 (69%) 


le-75 



In a BLAST search of public sequence datbases, the NOV59a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 59D. 
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Table 59D. Public BLASTP Results for NOV59a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV59a 
Residues/ 
Match Residues 


Identities/ 

SimiljiritipQ fnr 

the Matched 
Portion 


Expect 
Value 


Q9BQG1 


Synaptotagmin III (SytlH) - Homo 
sapiens (Human), 590 aa. 


1..590 
1..590 


590/590 
(100%) 
590/590 

/ 1 f\(\0/\ 

\iv\)7o) 


0.0 


Q925B7 


SYNAPTOTAGMIN 3 - Rattus 
norvegicus (Rat), 588 aa. 


1..590 
1..588 


566/592 (95%) 
576/592 (96%) 


0.0 


P40748 


Synaptotagmin 111 (SytlH) - Rattus 
norvegicus (Rat), 588 aa. 


1..590 
1..588 


564/592 (95%) 
575/592 (96%) 


0.0 


035681 


Synaptotagmin III (SytlH) - Mus 
musculus (Mouse), 587 aa. 


1..590 
1..587 


561/592(94%) 
572/592(95%) 


0.0 


P24507 


Synaptotagmin C (Synaptic vesicle 
protein 0-P65-C) - Discopyge 
ommata (Electric ray), 537 aa. 


1..576 
1..512 


342/578(59%) 1 
405/578 (69%) 


0.0 



PFam analysis predicts that the NOV59a protein contains the domains shown in 
the Table 59E. 



Table 59E. Domain Analysis of NOV59a 


Pfam Domain 


NOV59a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


C2 


316..402 


56/97 (58%) 
82/97 (85%) 


1.4e-44 


C2 


448..536 


4.1/97(42%) 
79/97 (81%) 


1.5e~38 



Example 60. 



The NOV60 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 60A. 
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Table 60A. NOV60 Sequence Analysis 




SEQIDNO:173 


2133 bp 


NOV60a, 
CG9803O-01 DNA Sequence 

j 
1 

t 
\ 
t 

i 

I 
I 

I 

i 
1 

i 
1 


CGTTTACGGCATGTCATTCCTGGACACATGGC^TGTTCCATGGCGTGTGGrGnTnoa^ 
CTTGCAAGTATGAGAACCCAGCCCGCTGGAGTGAGCAGGAGCAAGCCATTAAGGGGGT 
TTACTC^TCCTGGGTC^CTGATAATATACTGGCC^^ 

CTGGAGAAGTACCACATCATTGATCAGTTCCTCAGCCATGGCATAAAAACAATAATCA 
ACCTCCAGCGCCCIX3GTGAGCATGCTAGCTGTGGGAACCCTCTGGAACAAGAAAGTGG 
CTTCACATACCTTCCTGAGGCTTTCATGGAGGCTGGCATTTACTTCTACAATTTCGGA 
TGGAAGGATTATGGTGTAGCGTCTCTTACTACTATCCTAGATATGGTGAAGGTGATGA 
CATTTGCCTTACAGGAAGGAAAAGTAGCTATCCATTGTCATGCAGGGCTTGGTCGAAC 
AGGTGTTTTAATAGCCTGTTACTTAGTTTTTGCAACGAGAATGACTGCTGACCAAGCA 
ATTATATTTGTGCGGGCAAAGCGACCCAATTCCATACAAACCAAGAGACAGCTCCTCT 
GTGTAAGGGAATTTACTCAGTTTCTAACTCCTCTCCGCAATATATTCTCTTGCTGTGA 
TCCCAAAGCACATG CTGTCACCTTACCTCAATATCTAATTCG CCAGCGTCATCTGCTT 
CATG GTTATGAGG CACGACTTCTG AAACACG TG CCAAAAATT AT CCAC CTAG TTTG CA 
AATTGCTGCTGGACTTAGCGGAGAACAGGCCAGTGATGATGAAGGATGTGTCCGAAGG 
ACCTGGTCTCTCTG CTG AAATAGAAAAGACAATGTCTGAGATGGTCACCATG CAGCTG 
GATAAAGAGTTACTGAGGCATGACAGTGATGTGTCCAACCCGCCTAACCCCACTGCAG 
TGGCAGCAGATTTTGACAATCGAGGCATGATTTTCTCCAATGAGCAACAGTTTGACCC 
TCTTTGGAAAAGGCGGAATGTTGAGTGCCTTCAACCCCTGACTCATCTGAAAAGGCGG 
CTCAGCTACAGTGACTCAGATTTAAAGAGGGCCGAGAACCTCCTGGAGCAAGGGGAGA 
CTCCACAGACAGTGCCTGCC(^GATCTTGGTTGGCCACAAGCCCAGGCAGCAGAAGCT 
CATAAGCCATTGTTACATCCCACAGTCTCCAGAACCAGACTTACACAAGGAAGCCTTG 
GTTCGCAGCACACTTTCTTTCTGGAGTC^GTCAAAGTTTGGAGGCCTGGAAGGACTCA 
AAGATAATGGGTCACCAATTTTCCATGGAAGGATCATTCCAAAGGAAGCACAGCAGAG 
TGGAGCTTTCTCTGCAGATGTTTCAGGCTCACACA^ 

AGCTTTGCAAATGTCCATAAGGATCCAAACCCTGCTCACCAGCAAGTGTCTCACTGTC 
AGTGTAAAACTCATGGTGTTGGGAGCCCTGGCTCTGTCAGGCAGAACAGCAGGACACC 
CCGAAGCCCTCTGGACTGTGGCTCCAGTCCCAAAGCACAGTTCTTGGTTGAACATGAA 
ACCCAGGACAGTAAAGATCTGTCTGAAGCAGCTTCACACTCTGCATTACAGTCTGAAT 
TGAGTGCTG AGGCAAGAAGAATACTGGCGG CCAAAGCCCTAG CAAATTTAAATGAATC 
TGTAGAAAAGGAGGAACTAAAAAGGAAGAAAGAG(nTAATTCCCGAGATGGAGCTTGG 
GAAAGAATATGTGGCGAGAGGGACCCTTTCATCCTATGCAGCTTGATGTGGTCTTGGG 
TGGAGCAACTGAAGGAGCCTGTAATCACCAAAGAGGATGTGGACATGTTGGTTGACAG 
GCGAGC^GATGCCGCAGAAGCACTTTTTTTATTAGAGAAGGGACAGCACCAGACTATT 
CTCTGCGTGTTGCACTGCATAGTGAACCTGCAGACAATTCCCGTGGATGTGGAGGAAG 
CTTTCCTTGCCCATGCCATTAAGGCATTCACTAAGGTTAATTTTGATTCTGAAAATGG 
ACCAACAGTTTACAACACCCTGAAGAAAATATTTAAGCACACGCTGGAAGAAAAAAGA 
AAAATGACAAAAGATGG CCCTAAGCCTGG CCTCT AGCTTTCACTC 




ORF Start: ATG at 28 


ORF Stop: TAG at 2122 




SEQ ID NO: 174 


698 aa 


MW at 78224.7kD 


NOV60a, 
CG98030-01 Protein Sequence 


MACSMACGGRAC^CYENPARWSEQEQAIKGVYSSWVTDNILAMARPSSELLEKYHIIDQ 
FLSHGIKTIINLQRPGEHASCGNPLEQESGFTYLPEAFMEAGIYFYNFGWKDYGVASL 
TTILDMVKVMTFALQEGKVAIHCHAGLGRTGVLI ACYLVFATRMTADQAI I FVRAKRP 
NSIQTIOIQLLCVREFTQFLTPLRNIFSCCDPKAHAV^ 

HVPKI IHLVCKLIjLDLAENRPA/MMKDVS EG PGLS AE I EKTMS EMVTMQLDKELLRHDS 
DVSNPPNPTAVAAD FDNRGM I FSNEQQFDPLWKRRNVECLQPLTHLKRRLS YSDSDLK 
RAENLLEG^ETPG^PAQILVGHKPRQQKLISHCYIPQSPEPDI^KEALVR5TLSFWS 
QSKFGGLEGLKDNGSPIFHGRIIPKEAQQSGAFSADVSGSHSPGEPVSPSFANVHKDP 
NPAHQQVSHCQCKTHGVGS PGSVRQNSRTPRS PLOCGS S PKAQFLVEHETQDSKDLS E 
AASHS ALQS ELS AEARRI LAAKALANLNES VE KEELKRKKELNSRDGAWERI CGERDP 
F I LCSLMWS WVEQLKEPVI TKEDVDMLVDRRADAAEALFLLEKGQHQTI LCVLHCI VN 
LQTIPVDVEEAFLAHAI KAFTKVNFDSENGPTVYNTLKKI FKHTLEEKRKMTKDGPKP 
SL 
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SEQIDNO:175 | 2147bp 


NOV60b, 
CG98030-02 DNA Sequence 

i 
| 

} 

! 


CGTTTACGGCATGTCIATTCCTG<3AC!AC!AT(3flr RTftTTnr'a TnnrT>^r^^ rrr riTjir AC 

CTTG CAAGTATGAGAACCCAGCC CGCTG GAGTG AG CAGG AG CAAG CCATTAAGGGGGT 

TTACTCATCCTGGGTCACTGATAATATACTGGCCATGGCCCGCCCATCCTCTGAGCTC 

CTGGAGAAGTACCACATCATTGATCAGTTCCTCAGCCATGGCATAAAAACAATAATCA 

ACCTCCAGCGCCCTGGTGAGCATGCTAGCTGTGGGAACCCTCTGGAACAAGAAAGTGG 

CTT CACATACCTTCCTGAGGCTTTCATGGAGG CTGG CATTTACTTCTACAATTTCGGA 

TGGAAGGATTATGGTGTAGCGTCTCTTACTACTATCCTAGATATGGTGAAGGTGATGA 

CATTTGCCTTACAGGAAGGAAAAGTAGCTATCCATTGTCATGCAGGGCTTGGTCGAAC 

AGGTGTTTTAATAGCCTGTTACTTAGTTTTTGCAACGAGAATGACTGCTGACCAAGCA 

ATTATATTTGTGCGGGCAAAGCGACCCAATTCCATACAAACCAGAGGACAGCTCCTCT 

GTGTAAGGGAATTTACTCAGTTTCTAACTCCTCTCCGCAATATATTCTCTTGCTGTGA 

TCCCAAAGCACATGCTGTCACCTTACCTCAATATCTAATTCGCCAGCGTCATCTGCTT 

CATGGTTATGAGGCACGACTTCTGAAACACGTGCCAAAAATTATCCACCTAGTTTGCA 

AATTGCTGCTGGACTTAGCGGAGAACAGGCCAGTGATGATGAAGGATGTGTCCGAAGG 

ACCTGGTCTCTCTGCTGAAATAGAAAAGACAATGTCTGAGATGGTCACCATGCAGCTG 

GATAAAGAGTT ACTGAGGCATGACAGTGATGTGTCCAACCCG CCTAACCCCACTG CAG 

TGGCAGCAGATTTTGACAATCGAGGCATGATTTTCTCCAATGAGCAACAGTTTGACCC 

TCTTTGGAAAAGGCGGAATGTTGAGTGCCTTCAACCCCTGACTCATCTGAAAAGGCGG 

CTCAGCTACAGTGACTCAGATTTAAAGAGGGCCGAGAACCTCCTGGAGCAAGGGGAGA 

CTCCACAGACAGTGCCTGCCCAGATCTTGGTTGGCCACAAGCCCAGGCAGCAGAAGCT 

CATAAGCCATTGTTACATCCCACAGTCTCCAGAACCAGACTTACACAAGGAAGCCTTG 

GTTCGCAGCACACTTTCTTTCTGGAGTCAGTCAAAGTTTGGAGGCCTGGAAGGACTCA 

AAGATAATGGGTCIACCAATTTTCCATGGAAGGATCATTCCAAAGGAAGCACAGCAGAG 

TGGAGCTTTCTCTGCAGATGTTTCAGGCTCACACAGCCCTGGGGAGCCAGTTTCACCC 

AGCTTTGCAAATGTCCATAAGGATCCAAACCCTGCTCACCAGCAAGTGTCTCACTGTC 

AGTGTAAAACTCATGGTGTTGC^AGCCCTGGCTCTGTCAGGCAGAACAGCAGGACACC 

CCGAAGCCCTCTGGACTGTGGCTCCAGTCCCAAAGCACAGTTCTTGGTTGAACATGAA 

ACCCAGGACAGTAAAGATCTGTCTGAAGCAGCTTCACACTCTGCATTACAGTCTGAAT 

TGAGTGCTGAGGCAAGAAGAATACTGGCGGCCAAAGCCCTAGCAAATTTAAATGAATC 

TGTAGAAAAGGAGGAACTAAAAAGGAAGGTAGAAATGTGGCAGAAAGAGCTTAATTCC 

CGAG ATGG AGCTTGGGAAAG AATATGTGG CG AGAGGGACCCTTTCATCCT ATG CAGCT 

TGATGTGGTCTTGGGTGGAGCAACTGAAGGAGCCTGTAATCACCAAAGAGGATGTGGA 

CATGTTGGTTGACAGGCGAGCAGATGCCGCAGAAGCACTTT1TTTATTAGAGAAGGGA 

CAGCACCAGACT ATTCTCTG CGTGTTGCACTG CAT AG TG AACCTG CAGACAAT TC CCG 

TGGATGTGGAGGAAGCTTTCCTTGCCCATGCCATTAAGGCATTCACTAAGGTTAGTTT 
TGATTCTGAAAATGGACCAACAGTTTACAACACCCTGAAGAAAATATTTAAGCACA 
CTGG AAGAAAAAAGAAAAATGACAAAAG ATG G CCCT AAG C CTGG CCTCTAG CTTT CAC 




ORF Start: ATG at 28 


ORF Stop: TAG at 2137 




SEQIDNO: 176 


703 aa 1 MW at 78800.4kD 


NOV60b, 
CG98030-02 Protein Sequence 

j 

. 

i 


MACSMACGGRACKYENPARWSEQEQAI KGVYSS WVTDNI LAMARPSS ELLEKVH 1 1 D Q 
FLSHGIKTIINLQRPGEHASCGNPLEQESGFTYLPEAFMEAGIYFYNFGWKDYGVASL 
TT I LDMVKVMT FALQEGKVAIHCHAGLGRTGVL I ACYLVFATRMTADQAI I PVRAKRP 
NS IQTRGQLLCVREFTQFLTPLRNI FSCCDPKAHAVTXPQYLIRQRHLLHGYEARI*L»K 
HVPKI IHLVCKLIjLDLAENRPVMMKDVSEGPGLS AE IEKTMSEMVTMQLDKELLRHDS 
DVSNPPNPTAVAADFDNRGMIFSNEQ^FDPLWKRRNVECLQPLTHLKRRLSYSDSDLK 
RAENLLEQGETPQTVPAQ I LVGHKPRQQKL I SHCY I PQS PEPDLHKEALVRSTLS FWS 

3SKFGGLEGLKDNGSPIFHGRIIPKEAQQSGAFSADVSGSHSPGEPVSPSFANVHKDP 

MPAHQQVSHCQCKTHGVGSPGSVRQNSRTPRSPLDCGSSPKAQFLVEHETQDSKDLSE 

W^HSALQSELSAEARRIIAAKALANLNESVEKEELKRKVEMWQKEI^SRDGAWERIC 

SERDPFILCSI^SWVEQLKEPVITKEDVDMLVDRJ^AAEALFLLEKGQHQTILCVL 

^CIVNLQTIPVDVEEAFLAHAIKAFTKVSFDSENGPTVYNTLKKIFKHTLEEKRKMTK 
X3PKPGL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 60B. 
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Table 60B. Comparison of NOV60a against NOV60b. 


Protein Sequence 


NOV60a Residues/ J Identities/ 
Match Residues I Similarities for the Matched Region 


NOV60b 


1 ..698 695/703(98%) 
1..703 j 697/703(98%) 



Further analysis of the NOV60a protein yielded the following properties shown in 
Table 60C. 



Table 60C Protein Sequence Properties NOV60a 


PSort analysis: 


0.9400 probability located in nucleus; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV60a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 60D. 



Table 60D. Geneseq Results for NOV60a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV60a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABG08740 


Novel human diagnostic protein 
#8731 - Homo sapiens, 476 aa. 
[WO200 1 75067-A2, 1 1 -OCT-200 1 ] 


432..662 
227..462 


229/236 (97%) 
231/236(97%) 


e-129 


ABG08740 


Novel human diagnostic protein 
#873 1 - Homo sapiens, 476 aa. 
[WO200175067-A2, 1 1-OCT-2001] 


432..662 
227..462 


229/236 (97%) 
231/236(97%) 


e-129 


AAG67632 


Amino acid sequence of a human 
protein - Homo sapiens, 447 aa. 
[WO200109316-A1, 08-FEB-2001] 


16..190 
171.335 


53/183(28%) 
79/183(42%) 


6e-10 


AAG67453 


Amino acid sequence of a human 
polypeptide - Homo sapiens, 447 aa. 
[WO200109345-A1, 08-FEB-2001] 


16.. 190 
171..335 


53/183(28%) 
79/183 (42%) 


6e-10 


AAU75364 


Human dual specificity phosphatase 
CDC14A L572P variant - Homo 
sapiens, 594 aa. [US6331614-B1, 
18-DEC-2001] 


91..177 
232..316 


32/87(36%) 
46/87(52%) 


3e-07 
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In a BLAST search of public sequence datbases, the NOV60a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 60E. 



Table 60E. Public BLASTP Results for NOV60a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV60a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
tne Matcnea 
Portion 


Expect 
value 


Q8WX19 


BA490F3.2.1 (NOVEL PROTEIN 
ISOFORM 1) - Homo sapiens 
(Human), 698 aa. 


L.698 
1..698 


696/698 (99%) 
697/698 (99%) 


0.0 


Q9N091 


UNNAMED PROTEIN PRODUCT 
- Macaca fascicularis (Crab eating 
macaque) (Cynomolgus monkey), 
683 aa. 


I. .682 
1..682 


654/682 (95%) 
663/682 (96%) 


0.0 


Q8WX18 


BA490F3.2.2 (NOVEL PROTEIN 
ISOFORM 2) - Homo sapiens 
(Human), 112 aa. 


587..698 
1..112 


112/112(100%) 
112/112(100%) 


le-59 


Q9UAX0 


T12B3.1 PROTEIN - 
Caenorhabditis elegans, 446 aa. 


3..246 
42..281 


92/244 (37%) 
147/244 (59%) 


2e-44 


060730 

I 


CDC14B3 PHOSPHATASE - 
Homo sapiens (Human), 471 aa. 


16.. 190 
201. .365 


54/183 (29%) 
81/183 (43%) 


le-10 



PFam analysis predicts that the NOV60a protein contains the domains shown in 
the Table 60F. 



Table 60F. Domain Analysis of NOV60a 


Pfam Domain 


NOV60a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


DSPc 


38.. 192 


34/181 (19%) 
98/181 (54%) 


0.001 1 


Y_phosphatase 


25..192 


34/290(12%) 
105/290(36%) 


0.0046 



Example 61. 



The NOV61 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 61 A. 
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Table 61A. NOV61 Sequence Analysis 




SEQIDNO: 177 


4172 bp 


NOV61a, 
CG98061-01 DN A Sequence 

i 

; 

! 
1 

j 

1 

1 

i • 
i 

i 

i 
<; 

\ 

| 

j 
1 

2 


ATGG CGGCGGCGGGAG CCCTGG AACGGAGCTTCGTGGAGCTAAGCGGAGCTGAGCGCG 

AAAGGCCGAGGCACTTTCGGGAATTCACAGTCTGCAGCATTGGGACTGCAAATGCCGT 

GGCTGGCGCCGTAAAATACAGTGAAAGCGCGGGAGGCTTTTACTACGTGGAGAGTGGC 

AAGTTGTTCTCCGTAACCAGAAACAGGTTCATTCATTGGAAGACCTCTGGAGATACAT 

TGGAGCTGATGGAGGAGTCACTGGACATAAATCTGTTGAATAATGCCATTCGCCTAAA 

ATTCCAAAATTGCAGTGTTTTACCTGGAGGGGTTTATGTCTCTGAGACTCAGAATCGT 

GTGATAATCTTGATGTTAACCAATCAAACAGTGCACAGGTTACTTTTACCACACCCCT 

CCCGGATGTATAGGAGTGAGTTGGTAGTTGACAGTCAGATGCAGTCAATATTCACTGA 

CATTGGAAAAGTTGATTTCACAGATCCTTGCAACTATCAGTTAATTCCAGCAGTACCT 

GGAATATCTCCTAATTCCACCGCCTCTACAGCCTGGCTCAGCAGTGATGGGGAGGCCC 

TGTTTGCCTTACCATGTGCTTCTGGGGGAATCTTTGTTCTTAAGCTACCTCCTTATGA 

CATACCTGGTATGGTGTCAGTCGTGGAACTGAAACAGAGTTCAGTAATGCAACGATTG 

CTTACAGG CTGGATGCCAACAGCTATCAGGGGTGACCAGTCG CCTTCAGATCGTCCCC 

TCAGTCTTGCTGTTCATTGTGTGGAGCATGATGCCTTCATCTTTGCTTTGTGTCAGGA 

TCATAAACTACGAATGTGGTCTTACAAGGAGCAAATGTGCCTAATGGTAGCTGACATG 

CTGGAGTATGTCCCTGTGAAGAAAGACCTTCGGCTTACTGCTGGAACTGGACACAAAT 

TACGGCTTGCTTATTCCCCCACCATGGGACTCTACCTGGGGATATACATGCATGCACC 

AAAACGAGGAC^GTTCTGCATTTTCCAGTTGGTGAGCACTGAGAGTAATCGCTATAGT 

CTCGATCATATTTCTTCACTGTTCACTTCTCAGGAGACACTGATTGACTTTGCCTTAA 

CTTCCACGGATATCTGGGCCCTGTGGCATGATGCTGAGAACCAAACAGTAGTGAAATA 

CATCAACTTTGAACATAATGTTGCAGGTCAGTGGAATCCAGTTTTTATGCAGCCTCTG 

CCAGAGGAAGAGATTGTCATCAGAGATGATCAAGACCCCAGAGAGATGTATCTGCAAA 

GTCTTTTTACACCAGGACAATTCACAAATGAAGCTTTATGTAAGGCTTTACAGATTTT 

CTGCCGAGGAACTGAGAGGAATTTGGATCTTTCCTGGAGTGAACTGAAGAAAGAAGTT 

ACTTTAGCTGTTGAAAATGAGCTTCAAGGAAGTGTAACAGAGTATGAATTCTCCCAGG 

AGGAGTTTCGAAATTTACAACAAGAATTCTGGTGCAAGTTCTATGCCTGTTGTCTTCA 

GT ATCAAG AAG CC CT CT CT CACC C T CTTG CC CTAC ATTTGAATCC ACACACAAACATG 

GTGTGCCTGCTGAAAAAAGGGTACCTGTCTTTCCTTATTCCCTCATCCTTAGTGGATC 

ATTTGTATCTCCTGCCTTATGAGAACCTTTTGACAGAAGATGAGACAACCATATCTGA 

TGATGTGGATATCGCTCGGGATGTCATATGTCTTATAAAATGCCTCCGGCTGATTGAA 

GAGTCAGTAACTGTGGATATGTCAGTTATAATGGAAATGAGTTGTTATAACCTACAGT 

CTCCGGAAAAGGCTGCAGAGCAGATTCTGGAAGATATGATCACTATTGATGTAGAAAA 

TGTGATGGAGGATATTTGTAGTAAACTGCAAGAGATTAGGAACCCAATCCATGCAATT 

GGACTACTTATACGGGAAATGGATTATGAAACAGAAGTGGAAATGGAAAAGGGATTCA 

AT C CAG CTCAGCCTTTG AATATTCG AATGAAT CTTACCCAG CTCTATGGTAG T AACAC 

AGCAGGGTATATTGTGTGCTIGAGGGGTGC^TAAAATCGCCAGTACTCGTTTCCTGATC 

TGCAGAG ATCTTTTG ATCTTACAG CAGCTGTTAATG AGGCTTGG AGATGCTG TGATTT 

GGGGAACTGGTCAGCTCTTTCAAGCTCAGCAAGACCTACTACATCGAACAGCrrCCCCT 

ACTCTTATCTTATTACCTCATTAAATGGGGAAGTG AGTGCTTGG CAACTGATGTT CCA 

CTTGACACACTGGAGTCTAATCTCCAACACrrTATCAGTACTCG^ 

GTG CTTTAATGG CAAATAGGTTTGT ATCTAGT CCT C AGACTATTG TGG AGTTATT CTT 

CCAAG AAGTTG CAAGAAAACACATT ATAT CT CAC CTCTTCTCTCAG C CAAAGGCACCT 

CTGAGCCAAACTGGATTGAATTGGCCTGAAATGATTACTGCAATTACCAGTTATTTAT 

TGCAGCJTTTTATGGCCTAGCAATCCTGGTTGTCn'CTTTCTAGAATGT 

TTGCCAATATGTACAATTGCAGGATTATATTCAA 

AATGTTGGTTCCTGTCGATTTATGCTGGGAAGGTGTTACCrrAGTTACAGGAGAAGGAC 
AGAAGGCTCTGGAATGTTTTTGTCAGGCAGC&TCTGAAGTA^ 

GGATCGCTTGATTCGCTC^GAGGATGGGGAGATCGTGTCTACCCCCAGGCTGCAGTAT 

TATGACAAGGTTTTACGACTACTAGATGTCATTGGTTTGCCTGAACTGGTTATTCAGT 

TGGCTACATCAGCCATAACTGAAGC^GTGATGACTGGAAAAGTCAGGCTACT 

G ACATGTATTTTC AAACATCATTTG G ATTTGGGTCACAATAG CCAAGCATATG AAGCC 

TT AAC CCAAATT CCTGATTCCAG CAGGCAATTAGATTGTTTACGG CAGTTGGTGGTAG 

TTCTTTGTGAACG CTCACAG CTACAGG ATCTTGTAG AGTTT CCCTATGTG AATCTGCA 

TAATGAGGTTGTGGGAATAAITGAGTCACGTGCTAGAGCTGTGGACCTTATGACTCAC 

AATTACTATGAACTTCTGTATGCOT 

GCACAGTGATGTTTGAGTATGGAATGCGGCTTt^C^GAGAAGTTCGAACTCTCCGGGG 
ACTTG AGAAACAAGGCAACTGTTATCTGG CTGCTCTCAATTGTTTACGACTTATT CGT 
CCAGAATATGCGTG GATTGTG CAG CCAGTGTCTGGTGCAGTGTATG ATCG CCCTGGAG 
CATCCCCTAAGAGG AATCATGATGGAG AATG CACAGCTG CC CCCACAAATCG AC AAAT 
TOAAATCCTGGAACTGGAAGATCTGGAGAAAGAGTGTTCCTTGGCTCGCATCC^ 
ACTTTGGCTCAGCATG ATCCATCAG CGGTTGCAGTTG CTGG AAGTTCATCAG CAGAGG 
&AATGGTCACTCTCTTCGTTCAGGCGGGC 

SACTTTTAAGCTTCCCrTAACGCCAGTCTTTGAAGGGOTGCCrT 
rTGCAATTTGG AGGAGAGGCAG CACAAG CAGAAG C CTGGGCCTGG CT AGCAGCCAATC 
\GCT CTCATCTGTCATCACTACTAAGGAGTCTAGTG CTACAG ATG AAG CATGG CG ACT 
^TTATCCACTTACCTGGAGAGGTACAAAGTCCAGAA 

^T(^AAC^^GCTCTTGTCTCATGGAGTGC CTCTGCCTAATTGG CTTATAAACAGTTACA 
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AGAAGGTTGATGCTGCTCAATTGOT 

AGCTGTC^TTTGGTGTCAGAATATGTGGATGCTGTATTGGGAAAAGGACATCMTAC 
TTCGGAATTGAGGTAGGCATATACAAATGGTCCCCAATTGGGAGAGACAGTGCCAACA 
GT(^CAACATCGCACTGTCCCAGAAAATACTTGACAAATTGGAGGACTACCAGCAAAA 
AGTTGATAAGGCAACACGGGATTTATTATATCGTCGGACCTTGTGATTTGGATT 




ORF Start: ATG at 1 ORF Stop: TGA at 4162 




SEQIDNO: 178 1387 aa MW at 157181.6kD 


f NOV61a, 
ICG9806I-01 Protein Sequence 

1 

i 


MAAAGALERS FVELS GAERERPRHFRE FTVCS IGTANAVAG AVKYS ES AGGFYYVES G 
KLFSVTRNRFIHWKTSGDTLELMEESLDINLLNNAIRLKFQNCSVLPGGVYVSETQNR 
VIILMLTNQTVHRLLLPHPSRMYRSELVVDSQMQSIFTDIGKVDFTDPCNYQLIPAVP 
GISPNSTASTAWLSSDGEALFALPCASGGIFVLKLPPYDIPGMVSVVELKQSSVMQRL 
LTGWMPTAIRGDQS PSDRPLSLAVHCVEHDAFI FALCQDHKLRMWSYKEQMCLMVADM 
LE YVPVKKDLRLTAGTGHKLRLAYS PTMGL YLG I YMHAPKRGQ FC I FQLVSTE SNRYS 
LDHI SSLFTSQETLIDFALTSTDIWALWHDAENQTWKYI NFEHNVAGQWN PVFMQPL 
PEEEIVIRDDQDPREMYLQSLFTPGQFTNEALCKALQIFCRGTERNLDLSWSELKKEV 
TLAVF^NELQGSVTEYEFSQEEFRNLQQEFWCKFYACCLQYQEALSHPljALHIiNPHTNM 
VCLLKKGYLSFLI PSSLVDHLYLLPYENLLTEDETTISDDVDIARDVICLI KCLRLIE 
ESVTVDMSVIMEMSCYNLQSPEKAAEQILEDMITIDVENVMEDICSKLQEIRNPIHAI 
GLLIREMDYETE VEME KGFN PAQ PLN I RMNLTQLYG SNTAGYI VCRGVHKI ASTRFLI 
CRDLL I LQQLLMRLGD AVI WGTGQLFQAQQDLLHRTAPLLIjS YYL I KWGS ECLATD VP 
LDTLESNLQHLSVLELTDSGALMANRFVSSPQTIVELFFQEVARKHIISHLFSQPKAP 
LSQTGLNWPEMITAITSYLLQLLWPSNPGCLFLECXMGNCQYVQLQDYIQLLHPWCQV 
NVGSCRFMLGRCYLVTGEGQKALECFCQAASEVGKEEFLDRLIRSPnrjPTVCTtJDT nv 
YD KVLRLLDV I GLPE LVI QLATS A I TEASDDWKS QATLRTCI FKHHLDLGHNS QAYEA 

LTQIPDSSRQLDCLRQLVVVLCERSQLQDLVEFPYWLHNFAA^GIIESRARAVDLMTH 
NYYELLYAFHIYRHNYRKAGTVMFEYGMRLGREVRTL^ 

PEYAWIVQPVSGAVYDRPGASPKRNHDGECTAAPTNRQIEILELEDLEKECSLARIRL 
TLAQHDPSAVAVAGSS SAEEMVTLLVQAGLFDTAI SLCQTFKLPLTPVFEGLAFKC I K 
LQFGGEAAQAEAWAWLAANQLSSVITTKESSATDEAWRLLSTYLERYKVQNNLYHHCV 
INKliLSHGVPLPNWLINSYKKVDAAELLRLYLNYDLLEEAVDLVSEYVDAVIjGKGHQY 
FGIEVGIYKWSPIGRDSANSHNIALSQKILDKLEDYQQKVDKATRDLLYRRTL 




SEQIDNO: 179 5264 bp 


NOV6Ib, 
CG98061-02 DNA Sequence 

< 
( 
< 
< 

« 
< 
( 
i 

i 

c 


GCTCTTCrGGATTCCTCGCTGCCCACCCTCATTCTCCCGGTGGAAGCCTCGCAACGAA 

GGGCCGGAGCCGCCTTTCTGCTCCCGGAATGCTTCACCTGTCCGCAGCTCCGCCCGCC 

CCACCCCCGGAAGTGACGGCGACCGCGCGGCCTGCCTTTGTTCCGTTGGGCGTCGCGG 
CGACGGCGGGAAGATGGCGGCC^rGGnanrrrT^ nA /- nn ^ rTTCCTCCACCT ^ cc 

GGAGCTGAGCGCGAAAGGCCGAGGCACTTTCGGGAATTCACAGTCTGCAGCATTGGGA 

CTGCAAATGCCGTGGCTGGCGCCGTAAAATACAGTGAAAGCGCGGGAGGCTTTTACTA 

CGTGGAGAGTGGCAAGTTGTTCrCan-AACCAGAAACAGGTTCATTCATTGGAAGACC 

TCTGGAGATACATTGGAGCTGATGGAGGAGTCACTGGACATAAATCTGTTGAATAATG 

CCATTCGCCTAAAATTCCAAAATTGCAGTGTTTTACCTGGAGGGGTTTATGTCTCTGA 

GACTC^GAATCGTGTGATAATCTTGATGTTAACCAATCAAACAGTGCACAGGTTACTT 

TTACCACACCCCTCCCGGATGTATAGGAGTGAGTTGGTAGTTGACAGTCAGATGCAGT 

CAATATTCACTG ACATTGGAAAAG TTGATTTCACAG AT CCTTG CAACT ATCAGTTAAT 

TCCAGCAGTACCTGGAATATCTCCTAATTCCACCGCCTCTACAGCCTGGCTCAGCAGT 

GATGGGGAGGCCCTGTTTGCCTTACCATGTGCTTCTGGGGGAATCTTTGTTCTTAAGC 

TACCTCCTTATGACATACCTGGTATGGTGTCAGTCGTGGAACTGAAACAGAG^ 

AATGCAACGATTGCTTACAGGCTGGATGCC^ACAGCTATCAGGGGTGACCAGTCGCCT 

TCAGATCGTCCCCTCAGTCTTGCTGTTCATTGTGTGGAGCATGATGCCTTCATCTTTG 

CTTTGTGTCAGGATCATAAACTACGAATGTGGTCTTACAAGGAGCAAATGTGCCTAAT 

GGTAGCTGACATGCTGGAGTATGTCCCTGTGAAGAAAGACCTTCGGCTTACTGCTGGA 

ACTGGAC^CAAATTACGGCTTGCTTATTCCCCCACCJITGGGACTCTACCTGGGGATAT 

ACATGC^TGCACCAAAACGAGGACAGTTCTGCATTl^CCAGTTGGTGAGCACT 

T AATCG CTATAGTCTCGAT CAT ATTTCTT CT^CTGTTCACTT CTCAGGAG ACACTG A^ 

GACTTTGCCTTAACTTCCACGGA^ 

CT^GTAGTGAAATACATCAACTTTGAACATAATGTTGCAGGTCAGTGGAATCCAGTT^ 

rATGCAGCCTCTGCXIAGAGGAAGAGATTGTC^TCAGAGATCATCAAGACCCC^ 

^TGTATCTG CAAAGTCTTTTTACAC CAGG ACAATTCACAAATGAAGCTTTATGTAAGG 

ZTTTACAGATTTTCTGCCGAGGAACTGAGAGGAATTTGGATCTTTCCTGG^ 

3AAGAAAGAAG1TACTTTAGCTGTTGAAAATGAGCTTCAAGGAAGTCT 

JMA1 iK - 1 e UAO^ A(jG AGTTT CG AAATTT ACAACAAGAATT CTGGTG CAAGTTCTATG 

rCTGTTGTCTTC^GTATCAAGAAGCCCTCTCTCACCCTCTTGCCCT 

\CACACAAACATGGTGTGCCTGCTGAAAAAAGGGTACCTGTCTTTCCTTATTCCCT 

rCCTTAGTGGATCATTTGTATCTCCTGCCTTATGAGAACCTTTTGACAGAAGATGAGA 

^CCATATCTGATGATGTGGATATCGCTCGGGATGTC^TATGTCTTATAAAATGCCT 

XGGCTGATTGAAGAGTCAGTAACTGTGGATATGTCAGTTATAATGGAAATGAGTTGT 

rATAACCTACAGTCTCCGGAAAAGGCTGCAGAGCAGATTCTGGAAGATATGATCACTA 

ITGATGTAGAAAATGTGATG GAGGATATTTGTAGTAAACTG CAAGAGATTAGGAACCC 

iATC(^TGCAATTGGACTACTTATACGGGAAATGGATTATGAAACAG^ 

5AAAAGGGATTCAATCCAGCTO^CCITrGAA 
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ATGGTAGTAACACAGCAGGGTATATTGTGTGCAGAGGGGTGCATAAAATCGCCAGTAC 
TCXSTTrCCTGATCTGCAGAGATCrriTGATCTTACAG CAG CTGTT AATGAGGCTTGGA 
GATGCTGTG ATTTGGGGAACTGGTCAG CTCTTTCAAG CTCAG CAAGACCT ACTACATC 
GAACAGCTCCCCTACTCTTATCTTATTACCTCATTAAATGGGGAAGTGAGTGCTTGGC 
AACTGATGTTCCACTTGACACACTGGAGTCTAATCTCCAACACTTATCAGTACTGGAA 
TT AACAG ACTCTGGTGCTTTAATGG CAAATAGGTTTG TAT CTAGTCCTCAGACTATTG 
TGGAGTTATTCTTCCAAGAAGTTGCAAGAAAACACATTATATCTCACCTCTTCTCTCA 
GCCAAAGGCACCTCTGAGCCAAACTGGATTGAATTGGCCTGAAATGATTACTGCAATT 
ACCAGTTATTTATTGCAGCTTTTATGGCCTAGCAATCCTGGT^GTCTCTTTCTAGAAT 
GTTTGATGGGAAATTGCCAATATGTACAATTGCAGGATTATATTCAACTGCTACATCC 
CTGGTGTCAAGTCAATGTTGGTTCCTGTCGATTTATGCTGGGAAGGTGTTACCTAGTT 
AGAGGAGAAGGACAGAAGGCTCTGGAATGTTTTTGTCAGGCAGCATCTGAAGTAGGCA 
AAGAGGAATTCTTGGATCGCTTGATTCGCTCAGAGGATGGGGAGATCGTGTCTACCCC 
CAGGCTGCAGTATTATGACAAGGTTTTACGACTACTAGATGTCATTGGTTTGCCTGAA 
CTGGTTATTCAGTTGGCTACATCAGCCATAACTGAAGCAAGTGATGACTGGAAAAGTC 
AGGCTACTCTAAGGACATGTATTTTCAAACATCATTTGGATTTGGGTCACAATAGCCA 
AGCATATGAAGCCTTAACCCAAATTCCTGATTCCAGCAGGCAATTAGATTGTTTACGG 
CAGTTGGTGGTAGTTCTTTGTGAACGCTCACAGCTACAGGATCTrGTAGAGTTTCCCT 
ATGTGAATCTGCATAATGAGGTTGTGGGAATAATTGAGTCACGTGCTAGAGCTGTGGA 
CCTTATGACTCACAATTACTATGAACTTCTGTATGCCTTTCACATCTATCGCCACAAT 
TACCGCAAGGCTGGCACAGTGATGTTTGAGTATGGAATGCGGCTTGGCAGAGAAGTTC 
GAACTCTCCGGGGACTTGAGAAACAAGGCAACTGTTATCTGGCTGCTCTCAATTGTTT 
j ACGACTTATTCGTCCAGAATATGCGTGGATTGTGCAGCCAGTGTCTGGTGCAGTGTAT 
j GATCGCCCTGG AG CATCCCCTAAGAGG AATCATGATGGAGAATGCACAG CTGCCCCCA 

! CIAAATCGACAAATTGAAATCCTGGAACTGGAAGATCTGGAGAAAGAGTGTTCCTTGGC 
;! TCGCATCCGCCTCACTTTGGCTCAGCATGATCCATCAGCGGTTGCAGTTGCTGGAAGT 
j TCATCAGCAGAGGAAATGGTCACTCTCTTGGTTCAGGCGGGCCTCTTTGACACTGCCA 
j TATCACTCTGTCAGACTTTTAAGCTTCCCTTAACGCCAGTCTTTGAAGGGCTTGCCTT 
j CAAATG CATCAAATTGCAATTTGGAGGAGAGGCAGC ACAAG CAG AAGCCTGGGCCTGG 

■ CTAGCAGCCAATCAGCTCTCATCTGTCATCACTACTAAGGAGTCTAGTGGTACAGATG 
I AAG CATGGCGACTATTAT CCACTTACCTGGAGAGGTACAAAGTCCAGAATAACTTGTA 

I TCACCACTGTGTAATCAACAAGCTCTTGTCTCATGGAGTGCCTCTGCCTAATTGGCTT 
j ATAAACAGTTACAAGAAGGTTGATGCTGCTGAATTGCTTCGTTTATACTTAAACTATG 

ACCTTTTAGAAGAAGCTGTGGATTTGGTGTCAGAATATGTGGATGCTGTATTGGGAAA 
! AGGACATCAATACTTCGGAATTGAGTTTCCACTGTCCGCAACAGCCCCAATGGTGTGG 

CTTCCATAGTCCTCTATTGATCAGCTTCTCCAAGCTCTGGGAGAGAACAGTGCCAACA 
! GTCACAACATCGCACTGTCCCAGAAAATACTTGACAAATTGGAGGACTACCAGCAAAA 
) AGTTGATAAGGCAACACGGGATTTATTATATCGTCGGACCTTGTG ATTTGGATTGTCA 
, CCTAGCCTTTGTAACCGCTTGGTGCCTCTTAGGACTTAAGACTACCCTACAGGAACCC 

TGTACTC^GGCCGATTTTTGTAACTGTAAATGATGTGTACAACATTCAAGTCTGCAT 
TCTGCACAAG ATAGGAGGG CGGAAGAGTCAG AGGACCCTGTGCTTGCTGGTGGTG CTA 
AC ACAATTTCTGGTGTT C AACCTTGGT CTCAAATAGCTG CTTTTGTAT ATGATTCACG 
AGCT TTTTT AG AGTTT AT ATTTTTTT AAACTACCGAAG ACATT CATTAT CTGCAAATT 
AAGACTCACCTTCACTTTCCAAAATAGCTGAGGGTrGTTGGCTTGTTGTAGCTGACCA 
CC AAAAG CAG T CACTG C AAAT CTTTTAATT CTTCCCTATCACCTTTTGTATTTT AATG 
CAATTAT TTTGGT C CAG AACTG ACCTGTATTTTCTGTATTGT ACACZAAAAGCTAATAA 
TTTTGTGTACTTTTTATTTATTTTGGAGGTTTTA^ 

TAATTTGCCTAGATTAAGCCTAAAATGATGACCAGCTAATTAAAGAAGATATTTTGA^ 

TCTGGTTCnSAGCTAAAGTTGAGTAAATTCTTAGCTAAGAAAAAATTGGAAATCCATC 

ATCTATATTAGCAACAGATTCTCAGAGTAAATTGTTAACTTCTATGATTTATGATAAT 

C!AAG CTGG ACTTG ATC!ATACAAGTT AGTCTGATAATGTATTGGAC 

TC^TTGGTCAGATTTAGAAGGATTCATGCTGACAAGTTT 

AAAATCATCTTGGATTTTATTCTGTATATTAAAATTTATCTTTT 
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ORF Start: ATG at 188 


ORF Stop: TGA at 4394 




SEQIDNO: 180 


1402 aa 


MWat 158753.4kD 


NOV61b, 
CG98061-02 Protein Sequence 

i 
i 

> 


MAAAGALERSFVELSGAERERPRHFREFTVCSIGTANAVAGAVKYSESAGGFYYVESG 
KLFSVTRNRFIHWKTSGDTLELMEESLDINLLNNAIRLKFQNCSVLPGGVYVSETQNR 
VIILMLTNQTVHRLLLPHPSRMYRSELVVDSQMQSIFTDIGKVDFTDPCNYQLIPAVP 
GIS PNSTASTAWLS SDGEALFALPCASGG I FVLKLP P YD I PGMVS WELKQS SVMQRL 
LTGWMPTAIRGDQS PSDRPLSLAVHCVEHDAFI FALCQDHKLRMWSYKEQMCLMVADM 
LEYVP VKKDLRLTAGTGHKLRLAYS PTMGLYLG I YMHAPKRGQFC I FQLVSTESNRYS 
LDHISSLFTSQETLIDFALTSTDIWALWHDAENQTWKYINFEHNVAGQWNPVFMQPL 
P EE E I V I RDDQDPR EMYLQSLFT PGQFTNEALCKALQ I FCRGTERNLDLS WS ELKKEV 
TLAVENELQGSVTEYEFSQEEFRNLQQEFWCKF^ACCLQYQEALSHPLALHLNPHTNM 
v\.ubftAbiiJoe JjXirooijViJnl»xijJjFiENLiljTEDEi I ISDDVDIARDVICLIKCIjRLIE 

e s vtvdms vi mems cynl qs pekaaeqi ledm i t i dvenvmed ics klqe i rnp i ha i 
glliremdyetevemekgfnpaqplnirmnltqlygsntagyivcrgvhkiastrfli 

CRDLLILQQLLMRLGDAVIWGTGQLFQAQQDLLHRTAPLLLSYYLIKWGSECLATDVP 

LDTLESNLQHLS VL ELTDS GALMANRFVS S PQT I VELFFQE VARKH 1 1 SHLFSQPKAP 

LSQTGLNWPEMITAITSYLLQLLWPSNPGCLFLECLMGNCQYVQLQDYIQLLHPWCQV 

NVGSCRFMLGRCYLVTGEGQKALECFCQAASEVGKEEFLDRLIRSEDGEIVSTPRLQY 

YDKVLRLLDVI GLPELVIQLATS AITEASDDWKSQATLRTCI FKHHLDLGHNSQAYEA 

LTQIPDSSRQLDCLRQLVWLCERSQLQDLVEFPYVNLHNEWGIIESRARAVDLMTH 

m^ELLYAFHIYRHNYRKAGTVMFEYGMRLGREVRTIJlGLEKQGNCYIJ^ 

PEYAWIVQPVSGAVYDRPGASPKRNHDGECTAAPTNRQIEILELEDLEKECSIjARIRL 

TIiAQHD PSAVAVAGS S S AE EMVTLLVQAGL FDTAI S LCQTFKLPLT PVFEGL AFKC 1 K 

LQFGGFJ^QAEAWAWLAANQLSSVITTKESSATDEAWRLLSTYLERYKVQNNLYHHCV 

INKLLSHGVPLPNWLINSYKKVDAAELLRLYLNYDLLEEAVDLVSEYVDAVLGKGHQY 

FGIEFPLSATAPMVWLPYSSIDQLLQALGEl^SANSHNIALSQKILDKLEDYQQKVDKA 

TRDLLYRRTL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 6 IB. 



Table 61B. Comparison of NOV61a against NOV61b- 


Protein Sequence 


NOV6 la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV61b 


1..1387 
1..1402 


1308/1402(93%) 
1312/1402 (93%) 



Further analysis of the NOV61a protein yielded the following properties shown in 
5 Table 61G. 



Table 61C. Protein Sequence Properties NOV61a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in Iysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV61a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 61 D. 



Table 61D. Geneseq Results for NOV61a 



| Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV61a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 




Human polypeptide SEQ ID NO 
5825 - Homo sapiens, 1401 aa. 
[WO2001 533 1 2-A 1 , 26-JUL-2001 ] 


1..1314 
75..1389 


1308/1315 
(99%) 

1311/1315 
(99%) 


0.0 


AAM39I08 


Human nnlvnpntirlp ^FO IH NO 

2253 - Homo sapiens, 13 16 aa. 
[WO200153312-A1, 26-JUL-2001] 


t .. 1 jU / 

1..1312 


(99%) . 
1301/1312 
(99%) 


A A 


j ABB60919 

! ■ 

j 


Drosophila melanogaster polypeptide 
SEQ ID NO 9549 - Drosophila 
melanogaster, 141 1 aa. 
[WO200171042-A2, 27-SEP-2001] 


43..1335 
41..1333 


389/1342 (28%) 
638/1342 (46%) 


e-143 


! AAB38261 


Gene 13 human secreted protein 
homologous amino acid sequence 
#1 17 - Homo sapiens, 58 aa. 
[WO200058469-A1, 05-OCT-2000] 


471. .528 
1.-58 


58/58 (100%) 
58/58 (100%) 


9e-29 


AAB38262 


Human secreted protein sequence 
encoded by gene 13 SEQ ID NO: 1 1 8 

- Homo sapiens, 58 aa. 
[WO200058469-A1, 05-OCT-2000] 


471. .528 
1..58 


57/58 (98%) 
58/58 (99%) 


2e-28 



In a BLAST search of public sequence datbases, the NOV61a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 6 IE. 
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Table 61E. Public BLASTP Results for NOV61a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV61a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q 12769 

i 
i 


Protein KIAAO 197 - Homo sapiens 
(Human), 1314 aa (fragment). 


3..1307 
1..1310 


1298/1310 
(99%) 

1^00/1310 
(99%) 


0.0 


Q9Z0W3 

i 


Protein KIAA0197 (GTL-13) - 
Mus musculus (Mouse), 1402 aa. 


1 -.1387 
1..1402 


1974/ 1 4A9 
(90%) 

1332/1402 
(94%) 




| Q9VKJ3 


Protein KIAAO 1 97 homolog - 
Drosophila melanogaster (Fruit 
fly), 1411 aa. 


43..1335 
41..1333 


389/1342(28%) 
638/1342(46%) 


e-143 


| Q9CZD9 


281001 1M03RIK PROTEIN - Mus 
musculus (Mouse), 215 aa. 


1181..1380 
1..215 


180/215(83%) 
187/215(86%) 


le-96 


Q96GB3 


KIAAO 197 PROTEIN - Homo 
sapiens (Human), 190 aa. 


I ..141 
1..141 


141/141 (100%) 
141/141 (100%) 


5e-75 



PFam analysis predicts that the NOV6 la protein contains the domains shown in 
the Table 61R 



Table 61F. Domain Analysis of NOV61a 



Pfam Domain 



NOV61a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 62. 

The NOV62 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 62A. 
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Table 62A. NOV62 Sequence Analysis 



SEQ ID NO: 18! 



5544 bp 



NOV62a, 
CG98071-01 DNA Sequence 



GCGGGACTC^GAGTAGCCTTCCTCGAGGACCTGCCTTTCCCATTTGCTGCCTGAAGT 
TAATGTTTCTTGCTGGCCAAATCAGGGACATGCCGGCATTAGCGGGATGAGTGGGTGT 
TCCGGg^GGGATGTGGTCATTGACGGCCAGTGAGGGCGAGAGTACCACGGCCCACTTC 



TTCCTTGG AGCTGG AGATGAGGGGCTGGG CACCCGTGG AATAGGCATGAGG CCAGAAG 
AGAGTGACAGCGAGCTCCTTGAGGATGAGGAGGATGAAGTGCCTCCTGAACCTCAGAT 
CATTGTTGGCATCTGTGCCATGACCAAGAAATCCAAGTCCAAG CCAATGACTCAAAT C 
CTAGAGCGACTCTGCAGATTTGACTACCTGACTGTTGTCATTCTGGGAGAAGATGTAA 
TCCTTAATGAACCTGTGGAAAACTGGCCATCCTGCCACTG CCT CATCTCTTTCCACTC 
CAAAGGCTTTCCTCTGGACAAAGCTGTTGCTTACTCCAAGCTTCGAAACCCCTTTCTT 
ATCAATGATCTGGCCATGCAGTATTACATCCAAGATAGGAGGGAGGTGTACCGGATCC 
TGCAGGAAGAGGGTATTGATCTGCCTCGATATGCTGTGCTCAACCGTGATCCTGCCCG 
GCCTGAGGAATGCAACCTGATAGAAGGTGAAGACCAAGTAGAGGTCAATGGAGCTGTC 
TTTCCCAAGCCCTTTGTGGAGAAGCCAGTGAGTGCAGAAGACCACAATGTTTACATCT 
ACTACCCCAGCTCAGCTGGAGGAGGAAGCCAGCGTCTCTTTCGTAAGATTGGCAGCCG 
AAGCAGTGTTTACTCTCCTGAGAGCAGCGTCCGAAAGACGGGGTCGTACATCTATGAG 
GAGTTTATGCCAACAGATGGCACAGATGTCAAGGTGTATACAGTGGGGCCAGATTATG 
CCCATG CTGAAG CTAGAAAATCTCCAGCTTTGGATGGGAAGGTTG AACG AGACAGTG A 
GGGGAAAGAGATTCGATATCCAGTCATGCTGACTGCCATGGAAAAGCTGGTGGCCAGG 
AAAG T CTG CGT AG CTTT C AAG CAAAC AGT TTGTGG ATTTG AC CTT CTTCGTG CCAATG 
GTCATTCCTTTG TGTGTG ATGTCAATGG CTTTAGTTTTGTCAAG AACTCGATG AAATA 
CTACGATG ACTGTG CCAAGATT CTGGGGAACACCATAATG CGGG AGCTTGCCCCACAG 
TTCCAGATTCCATGGTCCATCCCCACGGAGGCTGAGGACATTCCCATTGTTCCCACCA 
CATCTGGCACTATGATGGAACTTCGTTGTGTCATTGCAATTATTCGTCATGGGGATCG 
TACTCCCAAGCAGAAGATGAAGATGGAAGTGAAACACCCAAGGTTTTTTGCTCTGTTT 
GAAAAACATGGTGGCTACAAGACAGGGAAATTAAAACTCAAGCGACCTGAGCAGCTCC 

TGAGATCGAGGAGAAGACTGGAAAACTAGAGCAGCTGAAGTCTGTACTGGAGATGTAT 

GGTCACTTCTCAGGTATAAACCGGAAGGTACAATTGACTTACTACCCTCATGGAGTAA 

AAG CTTCTAATG AGGGG CAAG ATC CACAG AGGGAAACTCTGG CC CCATCTCTGTTGCT 

GGTACTGAAGTGGGGTGGAGAACTGACTCCTGCTGGCCGTGTTCAGGCTGAGGAGCTG 

GGGCGAGCTTTTCGCTGCATGTACCCTGGAGGACAGGGTGACTATGCTGGCTTCCCTG 

GTTGTGGGCTGCTTCGTCTCCATAGCACTTTCCGCCACGATCTCAAGATCTATGCCTC 

TGATGAGGGTCGTGTTCAGATGACTGCTGCTGCCTTCGCCAAGGGCCTTCTGGCTCTA 

GAAGGGGAGCTGACACCCATTTTGGTGCAAATGGTGAAGAGTGCCAACATGAATGGGC 

TACTGGACAGCGATGGGGATTCCTTGAGCAGCTGCCAGCACCGGGTGAAGGCTCGGCT 

G CACCATATTCT ACAGCAGGATGCACC CT TTGG CCCTGAGGACTACGATCAG CTGG CT 

CCCACC^GAAGTACTTCCCTGCTCAACTCCATGACTATCATCCAGAATCCTGTGAAGG 

TCTGTGATCAGGTATTTGCCCTGATCGAAAACCTCACCCACCAGATCCGGGAACGAAT 

GCAGGACCCCAGGTCTGTAGACCTGCAGCTCTACCACAGTGAGACACTAGAGCTAATG 

CTACAGCGTTGGAGCAAGCTGGAGCGTGACTTTCGACAGAAGAGTGGGCGCTATGATA 

TCAGTAAGATCCCTGACATCTATGACTGTGTCAAGTATGATGTGCAGCACAATGGGAG 

TCTGGGACTTCAAGGCACAGCAGAGTTGCTCCGTCTCTCTAAGGCACTGGCTGATGTG 

GTCATTCCCCAGGAGTACGGGATCAGTCGGGAGGAGAAACTGGAAATTGCTGTGGGCT 

TCTGTCTTCCACTGTTGCGGAAGATACTACTTGACCTGCAGAGAACCCACGAGGATGA 

GTCTGTCAACAAGCTGCATCCCCTGTACTCCCGAGGCGTGCTCTCCCCAGGTCGCCAC 

GTTCGAACGCGTCTCTATTTCACCAGTGAGAGCCATGTCCACTCCCTGCTCAGTGTCT 

TCCGTTATGGAGGACITCTTGATGAGACCCAGGATGGACAATGGCAGCGAGCTTTGGA 

TTATCTTAGTGCC^TCTC^GAGCTTAACrACATGACCCAGATTGTCATCATGCTTTAT 

GAGGACAACACACAGGATCCCTTATCAGAGGAACGGTTCCATGTGGAGCTAC^CTTCA 

GCCCCGGAGTGAAAGGTGTTGAGGAAGAAGGCAGTGCCCCGGCTGGCTGTGGATTCCG 

TCCAGCCTCTTCTGAGAATGAGGAGATGAAAACCAACCAAGGCAGTATGGAGAACCTG 

TGTCCAGGAAAGGCATC^GATGAACCAGACCGGGCATTGCAGACrTCACCCCAGCCTC 

CTGAGGGCCCTGGCCTTCCGAGGAGATCACCCCTCATTCGTAACCGAAAAGCTGGTTC 

CATGGAGGTACTTTCTGAGACTTCATCCTCGAGGCCTGGTGGCTACCGGCTCTTTTCA 

TCTTC^CGGCC^CCCACAGAAATGAAGCAGAGTGGCCTAGGGTTTGAAGGGTGTTCCA 

TGGTGCCTACCATCTACCCTCTGGAAACACTGCATAATGCCCTTTCCCTACGTCAAGT 

GAGTO AATTCTTGAGTAG AGTCTG C CAGCG CCACACTG ATG CCCAGGCACAGG CATCT 

GC31GCCCTCITTGATTCCATGCACAGCAGCCAGGCCT 

CACGTACrCTTCATTCACCTCCCCTGCAACTCCAGCAGCGCTCTGAGAAGCCCCCTTG 



GATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCACCC 

AGCCAGACAGCAGTGGCCCTTCTAGCACTGTGTCCAGTGCTGGTCCTTCTTCCCCTAC 

TACAGTAGATGGTAACTCCCAATTTGGCTTCAGTGATCMCCCTCCCTAAATTCACAC 

GTGGCTGAAGAACATCAAGGCCTTGGGCTGCrCCAGGAGACCCCTGGGAG 

AAGAG CTCTCCATAG AAGGGGAG CAAGAG CTTTTTGAACCAAATCAGTCCC CACAGGT 

GCCACCTATGGAAACCAG CCAG CCATACGAGGAGGTCAG CCAGCCATGTCAGGAGGTC 

CCIXSACATCAGCCAGCCATGCCAGGACATTTCTGAGGCGCTCAGCCAGCCATGTCAGA 
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AGGTCCCTGACATCAGC CAG CAATG CCAGGAGAACCATG ACAATGGTAACCACACATG 
CC^GGAGGTCCCTC^CATC7^CCAGCCATGCCAGAAGTCCAGCCAACTGTGCCAGAAA 
GTCTCTGAGGAAGTTTGCCAG CT ATGTCTGGAGAACTCCG AGG AGGTCAG CCAGCCAT 
GCCAGGGGGTCTCTGTGGAGGTTGGCAAGCTGGTCCATAAGTTCCATGTAGGGGTTGG 
TAGCTTGGTCCAGGAAACCCTTGTAGAAGTTGGCAGCCCAGCTGAAGAGATCCCTGAG 
GAGGTC^TCCAGCCATACCAGGAGTTCTCTGTGGAGGTTGGCAGGCTGGCCCAGGAGA 
CTTCTGCGATCAATCTGTTATCTCAGGGCATCCCTGAGATTGATAAACCATCCCAGGA 
GTTCCCTGAGGAGATTGATCTGCAGGCCCAGGAGGTCCCTGAGGAGATAAATTAGAAG 
TCCTGGGTGGTCCCTGAAGTGATTGATCAGCTGCCTGGAGAGGGTATTCCTCAAGCCC 
AGCATC CATCTGGTGATC CAAACCCTCAG AGCCAGTCTCTAGCCCATGACCAG CACTC 
ACCCCTTCCACCAGCAACATGTGATTAATTTTCTCATTAGTGGTATCACACTATACCA 
GCCATTTGAGCCAGCAACCTTTTCTGTTGGCTAACTCACTGGCCAGCTCTCACCAGCG 
GTGTCTGGGGAAGTAGTTCTCTTTGTATGAAGCATACCTGTGCCAGAGCTGTGGTTGA 
GGAGGAGCCAGTTTTAGGTTCGAAGAAGCCATTGGCTCCTCACTTAGCCATTAGACTT 
GAATAGGATTTCCTTGGGGTGGGTTGGTCTGTCATTACCCAGCCTTCTCTGAGCATCC 
TAGGAAATCACAGATTGTTAAAGGAAATGCCGCTTCACTGCTGAAGACACCATCTGGC 
GAC7VGCAAATGCAAAAGAGGGGACTCTAGGGTCTTCACTTTTCTGGGGAAATGTTCAC 
GACTTCTCAAGGTACGCTTAGACCATATGTGCATTCAGGGGACTCTTGTCTTTGCCAG 
CACTCCTCAGGTGAGCCCGGGATGGCTATCTGAAATGTCTGGAAAATAGAGTCCCTTC 
CCCAAACATCTGACATGGCCACTAAATCTTTAGA 

CATCCCCLAAGTGCCCCTTCTGCCTCAGTCCTGTTCCTCTGGGCAATAGCTCTAGGGAA 
AAGTAGGTATTAGACTGCTGTGCAAAATTCAGAGCAACTACTTAAAATGCTCTAGAGG 
ATTCTTAGCCAGTCTGTGAAAGTGGGCCTCCTCTGTGGTGGGGCTGTTTGGCTTAGGA 
GATGCTGTAGTAGTAGAGATAATCAGGTTCTATTTTAAGCAATCAGCAGAGATCAATA 
TTGTACAGATACAAGCAAAGGTTTAATAAATATATATATATATATTTATTTCTGTAGT 
TGTGCCAGGGACAGACTGGCCAAAGACCAAACACTGCAGGGTCCCCAAGAAATTTGTC 
CTTATATCATTGTGTCTGAGGGCCAGATATGATTATGAAGCTTTTTCCAAAGATCCAG 
GGATGGGAATGGGAGTGGGTAGGAGGGGTAATGCGGTCATTGGAGTCGGGGGCTGGAA 
CATTATGAGTGCTCAATAAATATAAACTAATGAG 




ORF Start: ATG at 127 


ORF Stop: TAG at 4345 


| ^ 


SEQ ID NO: 182 


1406 aa 


MWat 156318.6kD 


! NOV62a, 
jCG98071-0i Protein Sequence 


MWSLTASEGESTTAHFFLGAGDEGLGTRGIGMRPEESDSELLEDEEDEVPPEPQIIVG 

ICAMTKKSKSKPMTQILERLCRFDYLTWILGEDVILNEPVENWPSCHCLISFHSKGF 

PLDKAVAYS KLRNP FLI ND LAMQYY I QDRREVYR I LQE EG I DLPRYAVLNRDPARPEE 

CNLI EGEDQVEVNGAVFPKP FVEKPVS AEDHNVYI YYPSSAGGGSQRLFRKIGSRSSV 

YS PES S VRKTGS YI YEEFMPTDGTDVKVYTVGPDYAHAEARKSPALDGKVERDSEGKE 

I RYPVMLTAMEKLVARKVCVAFKQTVCG FDLLRANGHS FVCDVNGFSFVKNSMKYYDD 

CAKI LGNT IMRELAPQFQ I PWS I PTEAEDI PIVPTTSGTMMELRCVIAI IRHGDRTPK 

QKMKMEVKHPRFFALFEKHGGYKTGKLKLKRPEQLQEVLDITRLLLAELEKEPGGEIE 

EKTGKLEQLKSVLEMYGHFSGINRKVQLTYYPHGVKASNEGQDPQRETLAPSLLLVLK 

WGGELTPAGRVQAEELGRAFRCMYPGGQGDYAGFPGCGLLRLHSTFRHDLKIYASDEG 

RVQMTAAAFAKGLLALEGELTPILVG1WKSANMNGLLDSDGDSLSSCQHRVKA 

LQQDAPFG PEDYDQLAPTRSTS LLNSMT 1 1 QNP VKVCDQVFAL I ENLTHQIRERMQDP 

RS VDLQLYHSETLEU4LQRWSKLERDFRQKSGRYD I SRI PD I YDCVKYDVQHNGSLGL 

O^TAELLRLSKALADVVIPQEYGISREEKLEIAVGFCLPLLRKILLDLQRTHEDESVN 

KLHPLY SRGVLS PGRHVRTRLYFTS ESHVH SLLSVFRYGGLLDETQDAQWQRALDYLS 

AI SELNYMTQIVIMLYEDNTQDPLSEERFHVELHFS PGVKGVEEEGSAPAGCGFRPAS 

SENEEMKTNO^SMENLCPGKASDEPDRALQTSPQPPEGPGLPRRSPLIRNRKAGSMEV 

LSETSSSRPGGYRLFSSSRPPTEMKQSGLGFEGCSMVPTIYPLETLHNALSLRQVSEF 

LSRVCQRHTDAQAQASAALFDSMHSSQASDNPFSPPRTLHSPPLQLQQRSEKPPWLET 

RFCHVGQAGLELLTS SDLPASASQS AG I TGVSHRTQPDS SGPS STVSSAG PS SPTTVD 

GNSQFGFSDQPSI^SHVAEEHOGLGLLQETPGSGAQELSIEGEQELFEPNQSPQVPPM 

ETSQPYEEVSQPCXJOTPDISQPCQDISEALSQPCQKVPDISQ^CQENHDNGNHTCQEV 

PHISQPCQKSSQLCQKVSEEVCQLCLENSEEVSQPCG^SVEVGKLVHKFHVGVGSIiV 

QETLVBVGSPAEEIPEEVIQPYQEFSVEVGRLAQETSAINLLSQGIPEIDKPSQEFPE 

EIDLQAQEVPEEIN 



Further analysis of the NOV62a protein yielded the following properties shown in 
Table 62B. 
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Table 62B. Protein Sequence Properties NOV62a \ 


PSort analysis: 

i 

i 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


. SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV62a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 62C 



i — — — -. . 

Table 62C. Geneseq Results for NOV62a 

■ 



! 

! Geneseq 
; Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV62a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


1 ABG 15880 

j 


Novel human diagnostic protein 
#15871 - Homo sapiens, 1 136 aa. 
[WO200I75067-A2, ll-OCT-2001] 


161..1008 
239.. 1056 


621/850 (73%) 
695/850 (81%) 


0.0 


! ABG 14341 


Novel human diagnostic protein 
#14332 - Homo sapiens, 1 136 aa. 
[WO200175067-A2, ll-OCT-2001] 


161..1008 
239.. 1056 


621/850 (73%) 
695/850 (81%) 


0.0 


ABG15880 


Novel human diagnostic protein 
#15871 - Homo sapiens, 1 136 aa. 
[WO200175067-A2, ll-OCT-2001] 


161..1008 
239.. 1056 


621/850(73%) 
695/850 (81%) 


0.0 


ABG 14341 


Novel human diagnostic protein 
#14332 - Homo sapiens, 1 136 aa. 
[WO200175067-A2, ll-OCT-2001] 


161..1008 
239..1056 


621/850 (73%) 
695/850(81%) 


0.0 


ABB69600 


Drosophila melanogaster 
polypeptide SEQ ID NO 35592 - 
Drosophila melanogaster, 1751 aa. 
[WO200171042-A2, 27-SEP-2001] 


16..954 
33..991 


619/966 (64%) 
730/966 (75%) 


0.0 



In a BLAST search of public sequence datbases, the NOV62a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 62D. 



323 



WO 03/010327 



PCT/US02/14199 



Table 62D. Public BLASTP Results for NOV62a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV62a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


015082 

i 


KIAA0377 PROTEIN - Homo 
sapiens (Human), 1406 aa. 


1..1406 
1.. 1406 


1406/1406 
(100%) 

1406/1406 
(100%) 


0.0 


AAH24591 


SIMILAR TO KIAA0433 
PROTEIN - Homo sapiens 
(Human), 1222 aa. 


11. .997 
2..986 


743/989 (75%) 
835/989(84%) 


0.0 


| 043314 

j 


KIAA0433 PROTEIN - Homo 
sapiens (Human), 1243 aa. 


11. .997 
2..986 


743/989 (75%) 
835/989(84%) 


0.0 


Q9VR59 


CG 1 46 16 PROTEIN - 
Drosophila melanogaster (Fruit 
fly), 1751 aa. 


16..954 
33..991 


619/966 (64%) 
730/966 (75%) 


0.0 


! T25770 


hypothetical protein F46F1 1.1 - 
Caenorhabditis elegans, 1224 aa. 


54..909 
16..9I7 


491/910(53%) 
627/910(67%) 


0.0 



PFam analysis predicts that the NOV62a protein contains the domains shown in 
the Table 62E. 



Table 62E. Domain Analysis of NO V62a 


Pfam Domain 


NOV62a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


acid_phosphat 


488..947 


78/531 (15%) 
290/531 (55%) 


0.4 


zf-NF-Xl 


1269.. 1292 


8/27 (30%) 
20/27 (74%) 


0.75 



Example 63. 



The NOV63 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 63 A. 
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Table 63A. NOV63 Sequence Analysis 




SEQ ID NO: 183 


885 bp 


NOV63a 
! CG98131-01 DN A Sequence 


CGGCCACTTCTCCCCCGTCCATCTGACCAGCTGGGCCCCTGCGCCCACCTGGCCTCCA 
CGTTCCCTCTCCTCTCACCCACACCCCTGGCCATGGCTAACTACTATGAAGTGCTGGG 
CGTGCAGGCCAGCGCTTCGCCGGAGGACATCAAGAAAGCCTACCGCAAGCTGGCCCTT 
CGTTGGCACCCCGAQUV.GAACCCTGACAATAAGGAGGAGGCGGAGAAGAAGTTCAAGC 




TGCTGGCTGTGACAGCTGGCGGGCTGGTGGCGGGGCCAGCACGCCCTACCACAGCCCC 
TTCGACACCGGCTACACCTTCCGTAACCCTGAGGACATCTTCCGGGAGTTTTTTGGTG 
GCCTGGACCCTTT CTCCTTTGAGTT CTGGGACAGCCCATTCAATAGTG ACCGTGGTGG 
CCGGGGCCATGGCCTGAGGGGGGCCTTCTCGGCJiGGCTTTGGAGAATTTCCGGCCTTC 
ATGG AGGCCTTCT CATCCTTCAACATG CTGGGCTGCAGCGGGGGCAGCCACACCACCT 
TCTCATCCACCTCCTTCGGGGGCTCCAGTTCTGGCAGCTCGGGGTTCAAGTCGGTGAT 
GTCATCCACCGAGATGATCAATGGCCACAAGGTCACCACCAAGCGCATCGTGGAGAAC 
GGGCAGGAGCGCGTGGAGGTGGAGGAAGACGGGCAGCTCAAGTCGGTGACTGTGAACG 
GCAAGGAGCAGCTCAAATGGATGGACAGCAAGTAGGCGCTGGCCACCCGGCCCTGCCT 
TCCCACCACCACCACCGTGCATGGGGCAGCAAACACGTGGGGCCGCAGACATAGCCTG 
ATGGTTAATAAATGT 




ORF Start: ATG at 91 


ORF Stop: TAG at 787 




SEQ ID NO: 184 


232 aa 


MWat 25686. IkD 


NOV63a, 
CG98 131-01 Protein Sequence 


MANYYEVLGVQASAS PED I KKAYRKLALRWH PDKN PDNKEEAEKKFKLVS EAYEVLSD 
SKKRSLYDRAGCDSWRAGGGASTPYHSPFDTGYTFRNPEDIFREFFGGLDPFSFEFWD 
S PFNS DRGGRGHGLRG AFS AGFGE F PAFMEAFS S FNMLGCSGGSHTTFSSTS FGGS SS 
GSSGFKSVMSSTEMINGHKVTTKRIVENGQERVEVEEDGQLKSVTVNGKEQLKWMDSK 



Further analysis of the NOV63a protein yielded the following properties shown in 
Table 63B. 



! — 

j Table 63B. Protein Sequence Properties NOV63a 


j PSort analysis: 

i 

i 1 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


| SignalP analysis: i 


No Known Signal Sequence Predicted 



A search of the NOV63a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 63 C. 
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Table 63C. Geneseq Results for NOV63a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV63a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


| AAU 17599 

i 

\ 


Novel signal transduction pathway 
protein, Seq ID 1 164 - Homo sapiens, 
267 aa. [WO200154733-A1, 02-AUG- 
2001] 


1..232 
36..267 


231/232 (99%) 
231/232 (99%) 


e-I36 


AAM9543I 


Human reproductive system related 
antigen SEQ ID NO: 4089 - Homo 
sapiens, 267 aa. [WO200155320-A2, 
02-AUG-2001] 


1..232 
36..267 


231/232 (99%) 
231/232 (99%) 


e-136 


AAW94066 

j 


Human DnaJ-like protein, HSPJ2 - 
Homo sapiens, 330 aa. [WO9855509- 
A2, 10-DEC-1998] 


1..222 
1..231 


147/233 (63%) 
179/233 (76%) 


4e-78 


j AAY74126 

1 


Human prostate tumor EST fragment 
derived protein #3 13 - Homo sapiens, 
3 1 7 aa. [DE 1 9820 1 90-A 1 , 04-NOV- 
1999] 


1..222 
2..232 


147/233 (63%) 
179/233(76%) 


4e-78 


; AAM40013 ; 


Human polypeptide SEQ ID NO 3 1 58 

- Homo sapiens, 309 aa. 
[WO200 1 533 1 2-A 1 , 26-JUL-200 1 ] 


I. .221 
1..225 


106/232(45%) 
149/232(63%) 


6e-48 



In a BLAST search of public sequence datbases, the NOV63a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 63D. 
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Table 63D. Public BLASTP Results for NOV63a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV63a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


r 

! Q9QY17 

» 


DnaJ homolog subfamily B member 8 
(mDJ6) - Mus musculus (Mouse), 227 
aa. 


1..230 
1. .226 


190/230 (82%) 
203/230 (87%) 


e-107 


O75I90 


DnaJ homolog subfamily B member 6 
(Heat shock protein J2) (HSJ-2) (MSJ- 
1) (HHDJ1) (MRJ) - Homo sapiens 
(Human), 326 aa. 


1..222 
1..231 


147/233 (63%) 
179/233 (76%) 


9e-78 


! 054946 

< 

! 


DnaJ homolog subfamily B member 6 
(Heat shock protein J2) (HSJ-2) (MRJ) 
(mbj4) - Mus musculus (Mouse), 242 
aa. 


1..232 
1..242 


151/246(61%) 
185/246 (74%) 


3e-77 


j Q8WN90 

i 


MOLECULAR CHAPERONE MRJ - 
Bos taurus (Bovine), 242 aa. 


1 ..232 
1..242 


148/246 (60%) 
180/246 (73%) 


4e-77 


; 035723 

i 


DnaJ homolog subfamily B member 3 
(DnaJ protein homolog 3) (Heat shock 
J3 protein) (HSJ-3) (MSJ-I) - Mus 
musculus (Mouse), 242 aa. 


I..226 
1..236 


133/238(55%) 
169/238(70%) 


2e-65 



PFam analysis predicts that the NOV63a protein contains the domains shown in 
the Table 63E. 



Table 63E. Domain Analysis of NOV63a 


Pfam Domain 


NOV63a Match Region 


Identities/ ' 
Similarities 
for the Matched Region 


Expect Value 


DnaJ 


3..69 


45/79 (57%) 
62/79 (78%) 


4.7e-40 



Example 64. 



The NOV64 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 64A. 
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Table 64A. NOV64 Sequence Analysis 




SEQ ID NO: 185 


2711 bp 


NOV64a, 
CG98I64-01 DNA Sequence 


v-v- x.muv3v_«a\_cv_v»vji. i /u\ai_a/UsA.L tioiAxtU^i. 1 uCGAGAGGGCTLTI CCTGAGGACCAG 
AGCTGCCTCTCTCCTGAGAGGCTTGGGCAGATCCCGAACTGGAGCCCGATCGTTACAG 
TTTCGCG CAGAAAAAGAGCGTCAGCCTTG CTGGTCTTTTCCCATGGGGCAGAAGACGA 
AAGGCAGCTCTAACATAGCCTCCTCCTACCTGCTCCAGCAGCTCATGCACCGCTATCA 
GGAGCTGGACTCGGACGGAGATGAGGACCACGGCGAGGGCGAGGCGGGATCCGAGGAG 
TCCTCAGAGTCCGAAATGCTGAATTTGGAGGAGGAATTTGATGGGGTCCTGAGAGAGG 
AGGCTGTGGCCAAAGCACTCCATCACTTGGGGCGCTCAGGCTCTGGGACTGAGCAAGT 
CTACCTCAATCTAACTTTATCAGGTTGTAATTTAATTGATGTTAGCATTCTCTGTGGA 
TATGTTCATCTACAGAAGTTGGATCTTTCAGCGAATAAAATTGAAGATTTATCTTGTG 
TGAGTTGTATGCCTTATCTCCTAGAACTTAATGCTTCTCAAAATAATTTGACTACGTT 
CTTCAATTTCAAGCCACCCAAAAACCTCAAGAAGGCGGATTTTTCCCACAACCAAATT 
TCTGAAATTTGTGATTTGTCAGCGTATCATG CTCTCACTAAACTAATTTTGG ATGG CA 
ATGAGATAGAAGAAATCAGTGGACTAGAGATGTGCAACAACCTAATTCACCTTAGTTT 
GGCCAACAATAAGATCACGACAATTAATGGCTTAAACAAGTTACCAATCAAAATACTT 
TGTCTGAGCAATAACCAGATTGAGATGATCACAGGTTTGGAGGATCTGAAAGCCCTGC 
AGAACCTGGATCTGTCCCACAATCAGATAAGCAGCCTCCAAGGCTTAGAGAATCATGA 
CCTCCTGGAAGTGATCAACCTGGAGGATAATAAGATTGCTGAGCTGAGAGAAATAGAA 
TACATTAAAAATTTACCCATCCTTCGAGTTCTCAATCTTCTAGAAAATCCAATTCAGG 


* 

• 


AAAAGTCTGAATATTGGTTCTTCGTAATTTTTATGCTTCTGCGATTAACAGAATTAGA 

TCAGAAGAAGATTAAAGTGGAAGAAAAGGTTTCAGCAGTGAATAAATATGATCCTCCC 

CCTGAAGTGGTTGCAGCTCAAGACCACCTGACCCATGTTGTCAACAGCGTGATGCAGC 

CGCAGAGGATCTTTGACAGCACTCTTCCCAGCCTGGATGCCCCTTATCCCATGCTGAT 

ACTAGCTGGTCCTGAAGCTTGTGGGAAACGAGAGCTTGCCCATCGCCTCTGCAGACAG 

TTTAGCACTTACTTCAGATATGGGGCCTGTCATACCACAAGACCACCTTACTTTGGAG 

AAGGGGATCGAGTTGATTATCATTTTATCTCTCAAGACGTTTTTGATGAAATGGTGAA 

CATGGGGAAATTCATTCTAACATTTAGTTATGGTAATCACAAGTATGGATTAAATAGG 

G ACACCGTAGAAGGTATCG CAAG AGATGGTTTGG CAAGCTGTATTC ATATGGAAATAG 

AAGGTGTAAGAAGTTTGAAATATTCCTATTTTGAGCCTCGTTATATCCTGGTGGTGCC 

CATGAACAAGGAAAAATATG AG GG AT ATTTG CGGAGAAAAGGATTATTC AGTCGTGC A 

GAAATTGAATTTGCTGTCTCCAGAGTGGACCTTTATATTAAAATTAATCAGAATTTTC 

CGGGATATTTTGATGAGGTAATCAATGCAGATGATCTGGATGTTGCCTACCAAAAACT 

GAGTCAG CTCATTAG AG AATAC CTTGGATTGACTG AGGAACCTGCCAAGAGTTTGGCT 

ACAACTGCAGATGTTAAGACATCCCACCTGAAACCTGAAGCGCATCCTACAAAGTATA 

TTTCTTCGAATATGGGTGATTTCCTGCATTCTACAGACAGAAACTACCTCATTAAATT 

TTGGGCCAAACTTTCAGCCAAAAAAACACCAGCGGAAAGAGATTCTATACAC^ 

CATGAGGCAGCCCGGCAAGCTCTAATGGGAAGGATACGCCCTGATCAC^CACTCCTAT 

TTCAAAGGGGTCC^GTACCAGCACCTCTCACCAGTGGTCTACACTATTATACAACTTT 

AGAAGAACTCTGGAAAAGTTTTGATCTTTGTGAAGACTATTTTAAACCTCCATT^ 

C CATATCCTGAAAAGAGTGGGAAGG ATTCCTTG GTTTC CATGAAATGTTCATTGTTTC 

GGTTCTGTCCGTGGTCAAAAGAATTGCCTTTCCAGCCTCCGG^ 

ACACCTAGG AT CAGG AG CCAGTGACAG TG AG AC CG AAG AG AC CCGGAAAGCACTACCT 
ATACAAT C ATTTTCACATG AAAAAGAGT CTCAC CAACAC AGACAA CACT CGGTCCCAG 
TC^TCAGTCGCCC^GGTTC(^CGTaUUVCCC^CCCTCCCTCCAATCCCTCAGG^ 
CAGGTAGACTAGCACTTGATGTCTCATCCTAACATGGAAAACCTGCTCTGCnX^TGTC 
GAATTCCTTGCCTTACCTGGCCATGGGTCCAGCTGT 

CGGAAGAATGTTCTACCTGCCTTAATTCTATCAGCC^GTTTCrCrTGTGATTCrTTGG 
CTGGTGT CTTTTAGTTTTTTAAT T AAAAAATTG TT'l' CTTAAAA 




ORF Start: ATG at 24 


ORF Stop: TAG at 2499 




SEQ ID NO: 186 


825 aa MWat 93626.3kD 


NOV64a, 
CG98 164-01 Protein Sequence 


MATSERAIJ^TRAASIiRGLGRSRTGARSI^FRAEKERQPCWSFPMGQKTKGSSNIAS 
SYtJjQQLMHRYQELDSDGDEDHGEGEAGSEBSS ES EMLNLEEEFTJGVLREEAVAKALH 
HLGRSGSGTEQVYLNLTLSGCNL IDVS I LCGYVHLQKLDLSANKI EDLS CVS CMPYLL 
ELNASQNNLTTFFNFKPPKNLKKADFSHNQI SEICDLSAYHALTKLILDGNEIEEISG 
LEMQWLIHLSLANNKITTINGLNKLPIKILCLSNNQIEMITGLEDLKALQNLDLSHN 
QISSLQGLEKHDLLEVINLEDNKIAELREI EYI KNLPILRVLNLLENPIQEKSEYWFF 
VI FMLLRLTELDQKKI KVE EKVS AVN KYD P P P EWAAQDHLTHWNS VMQP QR I FDST 
LPSLDAPYPMLILAGPEACGKRELAHRLCRQFSTYFRYGACHTTRPPYFGEGDRVDYH 
FISQDVFDEMVNMGKFILTFSYGNHKYGLNRDTVEGIARDGLASCIHMEIEGVRSLKY 
S YFEPRYI LWPMNKE KYEGYLRRKGLFSRAE I EFAVSRVDLYI KINQNFPGYFDEVI 
NADDLDVAYQKLSQLIREYLGLTEEPAKSLATTADVKTSHLKPEAHPTKYISSNMGDF 
LiHSTDRNYL I KFWAKLS AKKTP AERD S I HRQH EAARQ ALMGR I RPDHTLL FQRG P VP A 
PLTSGXJTYYTTLEELWKSFDLCEDYFKPPFGPYPEKSGKDSLVSMKCSLFRFCPWSKE 
LPFQPPEGSISSHLGSGASDSETEETRKALPIQSFSHEKESHQHRQHSVPVISRPGSN 
VKPTLPPIPQGRR 
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SEQ ID NO: 187 


2550 bp 


NOV64b, 
CG98 164-02 DNA Sequence 

! 

! 

i 

i 

i 
1 
I 
1 

i 

i 

•f 

3 

! 

i 

! 


AAGATGGCGACCTCCGAGAGGGCTCTCCTGAGGACCAGAGCTGCCTCTCTCCTGAGAG 
GCTTGGGCAGATCCCGAACTGGAGCCCGATCGTTAC^USTTTCGCGCAGAAAAAGAGCG 
TCAGCCTTGCTGGTCTTTTCCC^TGGGGCAGAAGACGAAAGGCAGCTCTAACATAGCC 
TCCTCCTACCTGCTCCAGCAGCTCATGCACCGCTATCAGGAGCTGGACTCGGACGGAG 
ATGAGGACCAGGGCGAGGGCGAGGCGGGATCCGAGGAGTCCTCAGAGTCCGAAATGCT 
GAATTTGGAGGAGGAATTTGATGGGGTCCTGAGAGAGGAGGCTGTGGCCAAAGCACTC 
CATCACTTGGGGCGCTCAGGCTCTGGGACTGAGCAAGTCTACCTCAATCTAACTTTAT 
CAGGTTGTAATTTAATTGATGTTAGCATTCTCTGTGGATATGTTCATCTACAGAAGTT 
GGATCTTTCAGCGAATAAAATTGAAGATTTATCTTGTGTGAGTTGTATGCCTTATCTC 
CTAGAACTTAATGCTTCTCAAAATAATTTGACTACGTTCTTCAATTTCAAGCCACCCA 
AAAACCTCAAGAAGGCGGATTTTTCCCACAACCAAATTTCTGAAATTTGTGATTTGTC 
AGCGTATCATG CT CTCACT AAACTAATTTTGG ATGGCAATGAGATAG AAGAAATCAGT 
GGACTAGAGATGTGCAAOU^CCTAATTCACCTTAGTTTGGCCAACAATAAGATCACGA 
CAATTAATGGCITAAACAAGTTACCAATCAAAAT^ 

TGAGATGATCAC^GGTTTGGAGGATCTGAAAGCCCTGCAGAACCTGGATCTGTCCCAC 
AATC AG ATAAG CAG CCTCC AAGG CTT AGAG AATCATG AC CTCCTGG AAG TG AT CAAC C 
TGGAGG AT AAT AAG ATTG CTG AG CTGAGAG AAAT AG AAT ACATT AAAAATTTACC CAT 
CCTTCGAGTTCTC^TCTTCrrAGAAAATCCAATTCAGGAAAAGTCTGAATATTGGTTC 
TT CGT AATTTTT ATG CTTCTGCG ATT AAC AG AAT TAG AT CAG AAGAAG ATT AAAGTG G 
AAGAAAAGGTTTCAGCAGTGAATAAATATGATCCTCCCCCTGAAGTGGTTGCAGCTCA 
AG ACCACCTGACCCATGTTGTCAACAGCGTGATG CAGCCG CAG AGGATCTTTGACAGC 
ACTCHTCCCAGCCTGGATGCTCCTTATCCCATGCTGATACT^^ 

GTGGGAAACGAGAGCTTGCCCATCGCCTCrGCAGACAGTTTAGCACTTACTTCAGATA 
TGGGGCCTGTCATACCACAAGACCACCTTACTTTGGAGAAGGGGATCGAGTTGATTAT 
CATTTTATCTCTCAAGACGTTTTTGATGAAATGGTGAACATGGGGAAATTCATTCTAA 
CATTTAGTTATGGTAATCACAAGTATGGATTAAATAGGGACACCGTAGAAGGTATCGC 
AAGAGATGGTTTGGCAAGCTGTATTCATATGGAAATAGAAGGTGTAAGAAGTTTGAAA 
TATTCCTATTTTGAGCCTCGTTATATCCTGGTGGTGCCCATGAACAAGGAAAAATATG 
AGGGATATTTGCGGAGAAAAGGATTATTCAGTCGTGCAGAAATTGAATTTGCTGTCTC 
CAGAGTGGACCITTATATTAAAATTAATCAGAATTTTCCGGGATATTTTGATGAGGTA 
ATCAATGCAGATGATCTGGATGTTGCCTACCAAAAACTGAGTCAGCTCATTAGAGAAT 
ACCTTGGATTGACTGAGGAACCTGCCAAGAGTTTGGCrACAACTGCAGATGTTAAGAC 
AT CCCAC CTGAAACCTG AAG CG CATCCTACAAAG TATATTTCTTCGAATATGGGTGAT 
TTCCTG CATTCTACAG AGAT AAACT ACCT C ATT AAATTTTGGG C C^ C C A 
AAAAAACACCAGCGG AAAG AGATTCTATACAC AGACAGCACG AGGCAGCCCGG CAAG C 
T CT AATGG G AAGG AT ACG CC CTG ATCACAC ACTC CT ATTT CAAAGGGG TC C AGTAC CA 
GCACCTCTCACCAGTGGTCTACACTATTATACAACTTTAGAAGAACTCTGGAAAAGTT 
TTGATCTTTGTGAAGACTATTTTAAACCTCCATTTGGACCATATCCTGAAAAGAGTGG 
GAAGGATTCCTTC^TTTCCATGAAATGTTCATTGTTTCGGTTCTGTCCGTGGTC!AAAA 
GAATTGCCTTTCCAGCCTCCGGAGGGGAGCATTTCTTCA 

GTGACAGTG AGACCGAAG AG AC CCGGAAAG (^CT ACCTATAC!AATCATTTTCACATGA 
AAAAGAGTCTCACCAACACAGACAACAOTCGGTCCC^GTCATCAGTCXSCCCAGGTTCC 
AArnTCAAACCCACCCTCCCTCCAATCCCrCGGGGCC^CAGGTAGACTAGCACTTGAT 
GTCTGATCCTAACATGG AAAACCTG CTCTGCTG ATGTCGAATT C CTTG CCTT ACCT 




ORF Start: ATG at 4 


ORF Stop: TAG at 2479 




SEQ ID NO: 188 


825 aa 


MW at 93602.3kD 


NOV64b, 
CG98 164-02 Protein Sequence 


MATS ERALLRTRAASLLRGLGRSRTGARSLQFRAEKERQPCWSF PMGQKTKGSSNI AS 
S YLLQQLMHRYQELDS DGDEDQGEGEAGSEES SE SEMLNLEEEFDGVLREEAVAKALH 
HLGRSG SGTEQVYLNLTLSG CNLI DVSILCGYVHLQKLDLS ANKI EDLS CVSCMPYLL 
ELNASQNOTiTTFFNFKPPKNLKKADFSHNQISEICDLSAYHALTKLILDGNEIEEISG 
LEMCNNLIHLSLANNKITTINGLNKLPIKILCLSNNQIEMITGLEDLKALQNLDLSHN 
QISSLQGLENHDLLEVINLEDNKIAELREIEYIKNLPILRVLNLLENPIQEKSEYWFF 
VIFMLLRLTELDQKKIKVEEKVSAWKYDPPPEWAAQDHLTHWNSVMQPQRIFDST 
LPSLDAPYPMLIIAGPEACGKREIiAHRLCRQFSTYFRYGACHTTRPPyFGEGDRVDYH 
FI SQDVFDEMA/NMGKFILTFSYGNHKYGLNRDTVEGI ARDGLAS CI HMEI EGVRSIiKY 
SYFEPRYILWPMNKE KYEGYLRRKGLFSRAEI EFAVSRVDLYI KI NQNFPGYFDEVI 
NADDLDYAYQKLSQLI REYLGliTEEPAKSLATTADVKTSHLKPE AHPTKYISSNMGD F 
LHSTDINYLIKFWAKLSAKKTPAERDSIHRQHEAARQALMGRIRPDHTLLFQRGPVPA 
PLTSGLHYYTTLEELWKSFDLCEDYFKPPFGPYPEKSGKDSLVSMKCSLFRFCPWSKE 
LPFQPPEGS I SSHLGSGASDSETEETRKAL PIQS FSHEKESHQHRQHSVPVI SRPGSN 
VKPTLPPI PRGRR 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 64B. 
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Table 64B. Comparison of NOV64a against NOV64b. 


Protein Sequence 


NOV64a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV64b 


31. .825 
3 1. .825 


750/795 (94%) 
751/795(94%) 



Further analysis of the NOV64a protein yielded the following properties shown in 
Table 64C. 



Table 64C. Protein Sequence Properties NOV64a 


PSort analysis: 

I 

r 


0.9074 probability located in mitochondrial matrix space; 0.6000 probability 
located in mitochondrial inner membrane; 0.6000 probability located in 
mitochondrial intermembrane space; 0.6000 probability located in mitochondrial 

outer membrane 


\ SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV64a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 64D. 
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Table 64D. Geneseq Results for NOV64a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV64 
a 

Residue 

s/ 
Match 
Residue 
s 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


ABG07962 


Novel human diagnostic protein #7953 - 
Homo sapiens, 1 175 aa. [WO200 175067- 
A2, ll-OCT-2001] 


235..614 
1..389 


347/389 (89%) 
354/389 (90%) 


0.0 


ABG07962 


Novel human diagnostic protein #7953 - 
Homo sapiens, 1 175 aa. [WO200 175067- 
A2, ll-OCT-2001] 


235..614 
1..389 


347/389 (89%) 
354/389 (90%) 


0.0 


AAM05508' 

! 

1 


Peptide #4190 encoded by probe for 
measuring breast gene expression - Homo 
sapiens, 61 aa. [WO200157270-A2, 09- 
AUG-2001] 


31. .91 
1..61 


60/61 (98%) 
60/61 (98%) 


2e-28 


; AAM30367 

i 


Peptide #4404 encoded by probe for 
measuring placental gene expression - 
Homo sapiens, 61 aa. [WO2001 57272- A2, 
09-AUG-2001] 


3I..9I 
1..61 


60/61 (98%) 
60/61 (98%) 


2e-28 


AAMI7861 

\ 

i 


Peptide #4295 encoded by probe for 
measuring cervical gene expression - 
Homo sapiens, 61 aa. [WO200157278-A2, 
09-AUG-2001] 


31..91 
I ..61 


60/61 (98%) 
60/61 (98%) 


2e-28 



In a BLAST search of public sequence datbases, the NOV64a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 64E. 
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Table 64E. Public BLASTP Results for NOV64a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV64a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q96M69 


CDNA FU32786 FIS, CLONE 
TESTI2002256, WEAKLY SIMILAR 
i \J UUA1N t LA 1 b Aofc (EL 2.7.4.8) 
- Homo sapiens (Human), 825 aa. 


1..825 
1..825 


824/825 (99%) 
824/825 (99%) 


0.0 


Q9D5S7 


492 1 528H 16RIK PROTEIN - Mus 
musculus (Mouse), 820 aa. 


1..825 
1..820 


UUOVOZ/ {fJ/Qj 

680/827(81%) 


A A 
U.U 


BAB85845 


AXONEMAL LEUCINE-RICH 
REPEAT PROTEIN - Ciona intestinalis, 
325 aa. 


92..377 
8..305 


97/300 (32%) 
160/300 (53%) 


6e-31 


035125 

] 


CHROMOSOME 6 B AC-284H 1 2 
(RESEARCH GENETICS MOUSE BAC 
LIBRARY) COMPLETE SEQUENCE - 
Mus musculus (Mouse), 340 aa. 


71. .372 
9..312 


96/306(31%) 
162/306 (52%) 


2e-30 


Q99620 


LEUCINE RICH PROTEIN - Homo 
sapiens (Human), 3 12 aa. 


73..318 
16..26I 


81/249 (32%) 
130/249 (51%) 


6e-23 



PFam analysis predicts that the NOV64a protein contains the domains shown in 
the Table 64F. 



Table 64F. Domain Analysis of NOV64a 


Pfam Domain 


NOV64a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


LRR 


238..2S9 


8/25 (32%) 
17/25 (68%) 


0.21 


LRR 


281..302 


10/25 (40%) 
16/25 (64%) 


0.69 


Guanylate_kin 


450..554 


28/108(26%) 
66/108 (61%) 


1.7e-07 



Example 65- 



The NOV65 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 65 A. 
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Table 65A. NOV65 Sequence Analysis 




SEQ ID NO: 189 


723 bp 


NOV65a, 
CG99588-01 DNA Sequence 

I 

1 
< 


ATGGTGTCCCCGGTCACTGTGGTGAAGAGTGAAGGACCCAAACTGGTGCCGTTCTTCA 
AGGCCACCTGCGTGTATTTTCTGCTCTGGCTGCCCTCATCTAGCCCATCGTGGGTCAG 
CACCCTCATCAAGTGCCTG CCTATCTTCTGCCTCTGGCTCTTCCTTCTGGCCCATGG C 
CTGGGATTCCTGCTGGCCCACCCC^GCGCCACCCGCATCTTTGTGGGGCrTGTCrrTCT 
CTGCTGTAGGTGACGCCTTCCTCATCTGGCAGGACCAAGGATACTTCGTGCATGGTCT 
GCTGATGTTTGCTGTGACCCACATGTTCTACGCCTC^GCCTTTGGCATGCAGCCACTG 
GCTCTTCGGACAGGTCTGGTGATGGCAGCGCTGTCGGGCCTGTGCTATGCCCTCCTCT 
ACCCATGCCTCTCAGGTGCCTTCACCTACCTGGTGGGGGTCTATGTGGCCCTTATCGG 
CTTCATGGGCTGGCGAGCTATGGCAGGGCTGCGGCTGGCCGGGGCAGACTGGCGCTGG 
ACAGAGCTGGCAGCTGGCAGTGGTGCACTCTTCTTTATCATCTCAGACCTGACCATCG 
CCCTCAACAAATTCTGTTTTCCTGTG CCCT ACTCTCGGGCG CTTATCATGTCCACCTA 
CTATGCGGCCCAGATGCTCGTCGCCTTGTCAGCTGTCGAAAGCCGGGAGCCTGTGGAA 
CACTACAGACTGACCAAGGCCAACTGA 




ORF Start: ATG at 1 


ORF Stop: TGA at 721 




SEQ ID NO: 190 


240 aa MW at 26370. lkD 


NOV65a, 
CG99588-01 Protein Sequence 

1 

i 
1 


MVSPVTWKSEGPKLVPFFKATCVYFVLWLPSSSPSWVSTLI KCLPI FCLWLFLLAHG 
LG FLIAHPSATRI FVGL V FS AVGDAFLI WQDQG YFVHGLLM FAVTHM FYAS AFGMQPL 
ALRTGLVMAALSG LCY ALLYPCLS G AFTYLVGVYVAL IG FWGWRAMAGLRLAGADWRW 
TELAAGSGALFFIISDLTIALNKFCFPVPYSRALIMSTYYAAQMLVALSAVESREPVE 
HYRLTKAN 



. Further analysis of the NOV65a protein yielded the following properties shown in 
Table 65B. 



J — _ — , _ 

Table 65B. Protein Sequence Properties NOV65a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.0300 
probability located in mitochondrial inner membrane 


SignalP analysis: 


Cleavage site between residues 65 and 66 



A search of the NOV65a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 65C. 
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Table 65C Geneseq Results for NOV65a 


Geneseq 
Identifier 

1UV11M1 IVI 


Protein/Organism/Length [Patent #, 


NOV65a 
Residues/ 

Residues 


Identities/ 
Similarities for 

Region 


Expect 

V alUc 


ABG 18549 


Novel human diagnostic protein #18540 
- Homo sapiens, 350 aa. 
r WO200 1 75067- A 2 1 1 -OfT-200 1 1 


1..240 
111. .350 


240/240(100%) 
240/240(100%) 


e-139 


ABG 18549 


Novel human diagnostic protein #18540 
- Homo sapiens, 350 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..240 
111. .350 


240/240(100%) 
240/240(100%) 


e-139 


AAM93259 


Human polypeptide, SEQ ID NO: 2709 
- Homo sapiens, 240 aa. [EP1 130094- 
A2,05-SEP-2001] . 


1..240 
1..240 


239/240 (99%) 
239/240 (99%) 


e-138 


ABB59182 


Drosophila melanogaster polypeptide 

SEQ ID NO 4338 - Drosophila 
melanogaster, 254 aa. [WO200 171042- 
A2, 27-SEP-2001] 


9..226 
9..240 


84/238(35%) 
118/238 (49%) 


2e-26 


AAY116I8 


Human 5* EST secreted protein SEQ ID 
NO:270 - Homo sapiens, 100 aa. 
[WO9906439-A2, ll-FEB-1999] 


15..87 
25..93 


34/76 (44%) 
47/76 (61%) 


6e-10 



In a BLAST search of public sequence datbases, the NOV65a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 65D. 



Table 65D. Public BLASTP Results for NOV65a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV65a. 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96AJ0 


SIMILAR TO RIKEN CDNA 
1810054013 GENE - Homo sapiens 
(Human), 224 aa (fragment). 


17..240 
1..224 


220/224 (98%) 
220/224 (98%) 


e-126 


Q9D8N3 


1 810054O13RIK PROTEIN - Mus 
musculus (Mouse), 241 aa. 


1..240 
L.241 


214/241 (88%) 
224/241 (92%) 


e-122 


Q9VAJ0 


CG7582 PROTEIN - Drosophila 
melanogaster (Fruit fly), 254 aa. 


9..226 
9..240 


84/238(35%) 
118/238(49%) 


5e-26 


Q9KLP0 


HYPOTHETICAL PROTEIN 
VCA0703 - Vibrio cholerae, 229 aa. 


21. .225 
31..228 


68/215 (33%) 
105/215(48%) 


le-12 


Q8YDD4 


HYPOTHETICAL PROTEIN 
BMEII0243 - Brucella melitensis, 184 
aa. 


66..223 
15..176 


53/168 (31%) 
79/168(46%) 


3e-l I 
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PFam analysis predicts that the NOV65a protein contains the domains shown in 
the Table 65E. 



Table 65E. Domain Analysis of NOV65a 



Pfam Domain 



NOV65a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



Example 66. 



The NOV66 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 66A. 



Table 66 A. NOV66 Sequence Analysis 



SEQ ID NO: 191 



1773 bp 



NOV66a, 
\CG996 18-01 DIM A Sequence 



GGAGGAACATGAC^TCATGGAAATGGTTTC^CCCAAATATC^CTGGTGTGGAGGCAGA 



AAACCTACTGTTGACAAGAGGAGTTGATGGCAGTTTTTCGGCAAGGCCTAGTAAAAGT 
AACCCTGGAGACGTCACACTTTCTGTTAGAAGAAATGGAGCTGTCACCCACACCAAGA 
TTCAGAACACTGGTGATTGCTATGACCTGTATGGAGGGGAGAAGTTTGCCACTTTGGC 
TGAGTTGGTCCAGTATTACATGGAACATCATGGGCAATTAAAAGAGAAGAATGGAGAT 
GTTATTGAGCTTAAAAATCCTCTGAACTGTGCAGATCCTACTTCTCAAAGGTGGTTTC 
ATGG ACAC CTCT CTGG AAAAG AAG C AG AG AAAT TGTT AA CTG AAAAAGG AAAG CAT AG 
T AGCTTT CTTGTACGAGAGAG CCAGAGCCACCCTG ATTTTGTTCTCTCCGTGTG CACC 
GGTGATGACAAAGGAGAGAGCAATGACGGCAAGTCTAAAGTGACTCATGTCATGATTC 
ACTGTCAGGAACTGAAATACGATGTTGGTGGAGGAGAACGGTTTGATGCTTTGACAGA 
T CTTAAGAAG AATCCTATG GTGGAAACACTG GGTACAGTACTACAACTCAAGCAG CCC 
CTTAACACGACTTGTACAAATGCTGCTGAAATAGAAAGCAGAGTTCGAGAACTAAGCA 
AATTAGCTGAGACCACAGATAAAGTCAGACAAGGCTTTTGGGAAGAATTTGA^ 
GCAACAACAGGAGTGCAAACTTCTCTACAGCCAAAAAGAGGGTCGAAGGCAAGAAAAC 
AAAAC CAAAAATAG ATATAAAAACATCCTGCCCTTTG ATCATAGGGTTGTCCTACACA 
ATGGTGATCCCAGTGAGGCTGTTTCAGATTACATCAACGCAGATATCATCATGCCTGA 
ATTTGAACCAAGTGCACAATTTGTAGCCAAAAAAAGCTACTTGGGAA 
ACTGGGTTATTTCTTAATGAGTTCTGGC^CATGGTTCrCATCTTTT^ 
GTGTAACTCGGCAGCACCAAGTGGAGAGAGGAAAGAGTAAATGTGTCAAATACTGGCC 
CGATGAGTATGCTCTAAAAGAATATGGCATCGTGCATGTTAGGAACGTCAAAGAAAGC 
ACCACT CATGACTATAAG CTAAG AAAACTTAAACTTTCAAAG ATTGGACAAGGG AAT A 
TGGAGAGAATGGTCTGGC^TACCACTTGTGGACCTGGCTGTACCACGGAGTGCCCTG 
CGACCCCAAAGGCATGCTGCACTTCCTGGACGAAGTGCACCATAAGCAGGAGAGCATC 
ATGGATGCAGGGCTGGTCGTGGTGCCCTGCAGTGCTGGCATTGGCCAGACAGGGACGT 
TCATTGTGATTGATATTCTTATTGACATCATC^ 

TGACGTTCCCAAAACCATCCAGATGGTGCGGTCTCAGAGGTCAGAGATGGTCCAGACA 
GAAGC^CAGTACOSATTTATCTATACGGTGGTCC^GCATCATATTGAAACACTACAGC 
GCAGGATTGAGGAAGAGCAGAAAAGCIAAGAGGAAAGGGCACGAATATACAAATATTAA 
GTATTCTCTAGTGGAC CAGACGAGTGGGGATCAGAG CCC CCT CCTG C CTTAT ACTCCA 
ATGCC^CCCTGTGCAGAAATGAGAGAAGAGAGT<3CC^GAGTCTATGAAAACGT<K3GCC 
TGATG CAACAG CAG AAACATTT AAGATGAGAAA 
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ORF Start: ATG at 9 


ORF Stop: TGA at 1767 




SEQ ID NO. 192 


586 aa MW at 67260.0kD 


NOV66a, 
CG99618-0I Protein Sequence 

i 

I 


MTSWKWFHPNI TG VEAENLLLTRGVDGSFSARPS KSNPGDVTLS VRRNGAVTHTKI QN 
TGD CYDLYGG EKFATLAELVQYYMEHHGQLKEKNGDVI ELKNPLNCADPTSQRWFHGH 
LSGKEAEKLLTEKGKHSSFLVRESQSHPDFVLSVCTGDDKGESNDGKSKVTHVMIHCQ 
EI,KYT5VGGGERFDALTDLKKNP^ETLGTVLQIiKQPLNTTCTNAAEIESRVRELSKIiA 
ETTDKVRQGFWEEFETLQQQECKLLYSQKEGRRQENKTKNRYKNILPFDHRVVLHNGD 
PSEAVSDYINADIIMPEFEPSAQFVAKKSYLGTSLITGLFLNEFWHMVLIFLRGKSVT 
RQHQVERGKSKCVKYWPDEYALKEYGIVHVRNVKESTTHDYKLRKXiKLSKIGQGNMER 
MVWQYHLWTWLYHGVPCDPKGMLH FLDEVHHKQES IMDAGLVWPCSAGIGQTGTFI V 
IDILIDI IREKGVDCE I DVPKTI QMVRSQRSEMVQTEAQYRF I YTWQHHI ETLQRRI 
EEEQKSKRKGHEYTNIKYSLVDQTSGDQSPLLPYTPMPPCAEMREESARVYENVGIiMQ 
QQKHLR 



Further analysis of the NOV66a protein yielded the following properties shown in 
Table 66B. 



f — ' — — — — — ■ — ^ 

Table 66B. Protein Sequence Properties NOV66a 


j PSort analysis: 

: 
i 


0.4500 probability located in cytoplasm; 0.3719 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


[ SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV66a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 66C 
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Table 66C. Geneseq Results for NOV66a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV66a 

R pQinnpc/ 

Match Residues 


Identities/ 
oirriudnucb lor me 
Matched Region 


Expect 
Value 


AAB59215 


SHP-2 protein - Unidentified, 593 aa. 
[US6 156551 -A, 05-DEC-2000] 


1..586 
1..593 


498/593 (83%) 
521/593 (86%) 


0.0 


AAY 13476 


Peptide Seq ID No: 8 of W09923493 
- Homo sapiens, 593 aa. 
[W09923493-A1, 14-MAY-1999] 


1..586 
I. .593 


498/593 (83%) 
521/593 (86%) 


0.0 


AAR99313 

j 


Human SH-PTP2 (protein tyrosine 
phosphatase-2, with SH2 domains) - 
Homo sapiens, 593 aa. [US5536636- 
A, 16-JUL-1996] 


1..586 
1..593 


498/593 (83%) 
521/593 (86%) 


0.0 


AAB59221 

\ 
\ 


SHP-2 activated double compound 
mutant E76A/D425A protein - 
Unidentified, 593 aa. [US6 156551 -A, 
05-DEC-2000] 


1..586 
1..593 . 


497/593 (83%) 
520/593 (86%) 


0.0 


\ AAB59214 

i 
j 


SHP-2 mutant E76A protein - 
Unidentified, 593 aa. [US6 156551 -A, 
05-DEC-2000] 


1.586 
1.593 


497/593 (83%) 
520/593 (86%) 


0.0 



In a BLAST search of public sequence datbases, the NOV66a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 66D. 
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Table 66D. Public BLASTP Results for NOV66a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV66a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q06124 

i 

i 

i 

i 


Protein-tyrosine phosphatase, non- 
receptor type 1 1 (EC 3. 1 .3.48) 
(Protein-tyrosine phosphatase 2C) 
(PTP-2C) (PTP-1D) (SH-PTP3) (SH- 
PTP2) (SHP-2) - Homo sapiens 
(Human), 593 aa. 


1..586 
1..593 


498/593 (83%) 
521/593 (86%) 


0.0 


Q64509 


PROTEIN TYROSINE 
PHOSPHATASE (EC 3.1.3.48) - Mus 
rnusculus (Mouse), 597 aa. 


1..586 
1..597 


497/597 (83%) 
522/597 (87%) 


0.0 


P41499 

i 
i 

! 


Protein-tyrosine phosphatase, non- 
receptor type 11 (EC 3. 1.3.48) 
(Protein-tyrosine phosphatase SYP) - 
Rattus norvegicus (Rat), 593 aa. 


1..586 
1..593 


494/593 (83%) 
520/593 (87%) 


0.0 


i Q90687 

j 

| 

1 
i 

i 
\ 


PROTEIN-TYROSINE 
PHOSPHATASE N 1 1 (EC 3. 1 .3.48) 

(PROTEIN TYROSINE 
PHOSPHATASE, NON-RECEPTOR 
TYPE 1 1) - Gallus gallus (Chicken), 
593 aa. 


1..586 
1.593 


491/593 (82%) 
518/593 (86%) 


0.0 


A53593 


protein-tyrosine-phosphatase (EC 
3.1 .3.48), nonreceptor type 1 1 - rat, 
597 aa. 


1..586 
1..597 


493/597 (82%) 
519/597 (86%) 


0.0 



PFam analysis predicts that the NOV66a protein contains the domains shown in 
the Table 66E. 



Table 66E. Domain Analysis of NOV66a 


Pfam Domain 


NOV66a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


SH2 


6..81 


39/79 (49%) 
66/79 (84%) 


8e-38 


SH2 


112..192 ' 


33/84 (39%) 
72/84 (86%) 


3.6e-29 


Y_phosphatase 


268..513 


98/298 (33%) 
187/298(63%) 


1.7e-58 
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Example 67. 

The NOV67 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 67A. 



Table 67A. NOV67 Sequence Analysis 

j . 


i 

s 


SEQ ID NO: 193 


1311 bp 


i NOV67a, 
CG99832-01 DNA Sequence 


CCGGGATG CACTGTTCCTG CTGTGGGTCCTCATCAT6GAG ACCAAACGGGTGGAGATT 
CCCGG CAGCGTCCTGG ACG ATTTCTGCAG CCGATTTATTTTGCATATTCCCAGCG AGG 
AAAGAGACAATGCAATCCGAGTGTGTTTTCAGATTGAACTTGCCCATTGGTTTTACTT 
GGATTTCTACATGCAGAACACACCAGGATTACCTCAGTGTGGGATAAGAGACTTTGCT 
AAAGCTCTCTTCAGTCATTGTCCGTTTTTGCTGCCTCAAGGTGAAGATGTGGAAAAAG 
TTTTGGATGAATGGAAGGAATATAAAATGGGAGTACCAACATATGGTGCAATTATTCT 
TGATGAGACACTTGAAAATGTACTACTAGTTCAGGGGTACCTAGCAAAATCAGGCTGG 
GGATTTCCAAAAGGAAAAGTAAATAAAGAAGAAGCTCCTCATGATTGTGCTGCTAGAG 
AGGTATTTGAAGAAACTGGTTTTGATATCAAAGACTATATTTGTAAGGATGATTACAT 
TGAACTTCGAATCAATGACCAGCTTGCTCGTTTGTACATCATTCCAGGAATTCCAAAA 
GACACAAAATTTAACCCAAAAACTAGAAGAGAAATTCGGAACATTGAGTGGTTCTCTA 


i 


ACCTAACAAATTTTTTATGGCCATTCCCTTTATCAGGCCATTAAGGGACTGGCTTTCT 
CGAAGATTTGGCGATTCCTCAGACAGTGACAATGGATTTTCCTCAACTGGTAGCACGC 
CGGCTAAACCCACTGTGGAAAAATTGAGGAGTCGAACCAAATTCCGCCACAGTCAGCA 
GTTATTTCCTGACGGTTCTCCTGGTGACCAGTGGGTAAAGCACAGGCAACCACTGCAG 
CAAAAGCCATATAATAATCATTCTGAAATGTCTGACCTTTTAAAAGGAAAGAATCAAA 
GTATGAGGGGAAATGG CAG AAAACAGTATCAAG ATT CAC CTAATCAAAAG AAAAGAAC 
AAATGGGCTTCAGCCAG C AAAGCAG C AG AATTCTTTG ATGAAGTGTGAAAAGAAACTT 
CATCCACGGAAACTTCAGG AT AATTTTGAAAC AG ATG CTGT AT ATG ACTTG CCT AGCT 
CCAGTGAAGACCAGTTGCTAGAACATGCTGAGGGACAGCCCGTGGCATGTAATGGACA 
TTGCAAGTTCCCCrTTTCATCCAGAGCCTTTTTGAGTTTCAAGTTTGACCATAATGCT 
ATAATG AAAATCTTGG ACCTTTGAT AG CAG CACAT 


j 


ORF Start: ATG at 35 


ORF Stop: TGA at 1298 




SEQ ID NO: 194 


421 aa 


MW at 48626.8kD 


NOV67a, 
CG99832-01 Protein Sequence 


METKRVEIPGSVIiDDFCSRFILHIPSEERDNAIRVCFQIEIiAHWFYIiDFYMQNTPGLP 
QCGIRDFAKALFSHCPFLLPQGEDVEKVLDEWKEYKM 

GYLAKSGWGFPKGKVNKEEAPHDCAAREVFEETGFDI KDYI CKDD Y I ELR I ND QLARL 
YI I PG I PKDTKFNPKTRRE IRN I EWFS I EKLPCHRNDMTPKS KLGLAPNKFFMAI PF I 
RPLRDWLSRRFGDSSDSDNGFSSTGSTPAKPTVEKLRSRTKFRHSQQLFPDGSPGDQW 
VKHRQPLQQKPYNNHS EMSDLUCGKNQSMRGNGRKQYQDS PNQKKRTNGLQPAKQQNS 
LMKCEKKLHPRKLQDNFETDAVYDLPSSSEDQUjEHAEGQPVAQTGHCICFPFSSRAFL 
SFKFDHNAIMKILDI* 



Further analysis of the NOV67a protein yielded the following properties shown in 
Table 67B. 



Table 67B. Protein Sequence Properties NOV67a 


PSort analysis: 


0.4741 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



339 



WO 03/010327 



PCT/US02/14199 



A search of the NOV67a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 67C. 



1 

Table 67C. Geneseq Results for NOV67a 


vjeneseq 
Identifier 


rroiein/xjrganisrn/Lengtn [ratent ??, 
Date] 


NOV67a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB62H4 


JL/i U->vpl Ilia UJClallUgao id puiypcpilUC 

SEQ ID NO 13194 - Drosophila 
melanogaster, 791 aa. 
[WO200171042-A2, 27-SEP-2001] 


205..474 


n 1/970 t&9PA\ 

171/270 (62%) 


4f» 79 


AAM90822 


Human immune/haematopoietic 
antigen SEQ ID NO: 184 1 5 - Homo 
sapiens, 95 aa. [WO200157182-A2, 
09-AUG-2001] 


253..348 
1..92 


91/96 (94%) 
92/96 (95%) 


8e-46 


AAW87487 


S. cerevisiae TIH1 polypeptide - 
Saccharomyces cerevisiae, 595 aa. 
[US5846764-A, 08-DEC-1998) 


12..239 
1 8..235 


82/228 (35%) 
125/228 (53%) 


2e-36 


AAW54167 


S.cerevisiae TIH 1 protein - 
Saccharomyces cerevisiae, 595 aa. 
[WO9813502-A2, 02-APR-1998] 


12..239 
I8..235 


82/228 (35%) 
125/228 (53%) 


2e-36 


AAR81314 


Yeast TIH1 - Saccharomyces 
cerevisiae, 595 aa. [W09519988-A1, 
27-JUL-1995] 


12..239 
18..235 


82/228 (35%) 
125/228 (53%) 


2e-36 



In a BLAST search of public sequence datbases, the NOV67a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 67D. 
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Table 67D. Public BLAST? Results for NOV67a 




Protein 
Accession 
Number 


Protein/Organism/Length 


NOV67a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9CYC6 


j /->UjJ /MUI Kits. rtvJ 1 JtllN - iviub 

musculus (Mouse), 422 aa. 


1..421 
1..422 


387/423 (91%). 
402/423 (94%)' 


0.0 


I Q9VUU4 


CG6169 PROTEIN - Drosophila 
melanogaster (Fruit fly), 791 aa. 


4..261 
205..474 


131/270 (48%) 
171/270 (62%) 


ie-71 


013828 


HYPOTHETICAL 83.2 KDA 
PROTEIN C19A8.12 IN 
CHROMOSOME I - 
Schizosaccharomyces pombe (Fission 
yeast), 741 aa. 


12..242 
11. .236 


109/231 (47%) 
153/231 (66%) 


le-59 


Q948G1 


PUTATIVE ABC TRANSPORTER - 
Oryza sativa (Rice), 1372 aa. 


18..265 
1 14..368 


96/258(37%) 
148/258(57%) 


9e-46 


j E88886 

i 


protein F52G2.1a [imported] - 
Caenorhabditis elegans, 328 aa. 


4..195 
129..323 


89/195 (45%) 
123/195 (62%) 


8e-44 



PFani analysis predicts that the NOV67a protein contains the domains shown in 
the Table 67E. 



Table 67E. Domain Analysis of NOV67a 


Pfam Domain 


NOV67a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


NUDIX 


96..221 


25/137(18%) 
78/137 (57%) 


4.7e-12 



Example 68. 



The NOV68 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 68A. 
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Table 68A. NOV68 Sequence Analysis 



SEQ ID NO: 195 



5474 bp 



NOV68a, 
CG99842-01 DNA Sequence 



TGATCATAAACAATACTTTTGGAGCTGGATTAATATTAAGTCACTTGAGAGATATAGA 
AAATACCATGTTCGTGGTCAACTTTGATTATTTTAGCCGGGTTCATCTGGTTTGCACA 
TTGAGACCGTAATACTTCAGGTTCTAATTGTGTCCAGAAGGCTATGTGTTAACATTCA 
TAGCAAACTGGTTACCTTTATTTCCTGAAAGTTAAATTTTGAAAGAAAGGGATACCAA 
GTGAGAAAGCAACCTTCAGTTAAACAAAGGGGTACATCAAGTTTGATACTCTTGGATT 
AAAAATCAGGTGACAATTGGAACATGGAGGAGAAATTCATGATTATAGAAACCATGAA 
GCCAATCTTTTTTACAACAAAGCTGCTTGTTTTTTTCTTAATCATGTAGTGGCTGCTA 
TCAGGCCAGTTCTGGAACTTCACTGAGTCCCTCTGCTCTTCTGAATCAGCTCTTGATT 
GGTGGTGACTTCTCGGAAGTTCTTATCCCTACCCTCCACCTTCTGCTCCAGGGTGGCA 
CGACTCAAACCTG CCCTACGGCCCATTAG CTCCATTCCATGAAGTTCCTATCCAGTAC 
CAGGCACGTTTCTTCTGCGTCTGTCTGGCTCGCGTGTTTCTTTTTGCCTGTGTCCCGC 
CGGGTCAGGCTCTCCCCGTCTCTCAGACTTTCCCGTCCCCGTGTCCTCCGCGGCCGTC 
TCTCTCCAACTCTCTCCCCCTCTCAGCCACTTCCTGTCTCCAGTATGTGCCTGGCGCG 
GCCCGGCTGGCCGTCTGCGCACCCTCTCTCCCCTCGGCTCTTTCCTAGGAAAGCTGAG 
CCTCATAGCTTCCGGGAGAAGGTTTTCCGGAAGAAAeCTCCAGTCTGTGCAGTATGTA 
AGGTGACCATCGATGGGACAGGCGTTTCGTGCAGAGTCTGCAAGGTGGCGACGCACAG 
AAAATGTGAAGC?UVAGGTGACTTCAGCCTGTCAGGCCTTGCCTCCCGTGGAGTTGCGG 
CGAAACAC^GCCCCAGTCAGGCGCATAGAGCACCTGGGATCCACCAAATCTCTGAACC 
ACT CAAAGCAGCGCAGCACTCTGCCCAGGAGCTT CAGCCTGGACCCGCTCATGGAGCG 
GCGCTGGGACTTAGACCTCACCTACGTGACGGAGGGCATCTTGGCCGCCGCCTTCCCC 
GCGCGGCCCGATGAACAGCGGCACCGGGGCCACCTGCGCGAGCTGGCCCATGTGCTGC 
AATCCAAGCACCGGGACAAGTACCTGCTCTTCAACCTTTCAGAGAAAAGGCATGACCT 
GACCCGCTTAAACCCCAAGGTTCAAGACTTCGGCTGGCCTGAGCTGCATGCTCCACCC 
CTGGACAAGCTGTGCTCCATCTG CAAAGCCATGGAGACATGGCTCAGTG CTG ACCCAC 
AGCACGTGGTCGTACTATACTGCAAGGGAAACAAGGGCAAGCTTGGGGTCATCGTTTC 
TGCCTACATGCACTACAGCAAGATCTCTGCAGGGGCX5GACCAGGCACTGGCCACTCTT 
ACCATGCGGAAATTCTGCG AGGACAAGGTGG CCACAGAACTGCAG CCCTCCCAGCGTC 
GATATATCAGCTACTTCAGTGGGCTGCTATCTGGCTCCATCAGAATGAACAGCAGCCC 
TCTCTTCCTGCACTATGTGCTCATCCCCATGCTGCCAGCCTTTGAACCTGGCACAGGC 
TTCCAGCCCTTCCTTAAAATCTACCAGTCCATGCAGCTTGTCTACACATCTGGAGTCT 
ATCACATTGCAGGCCCTGGTCCCCAGCAGCTTTGCATCAGCCTGGAGCCAGCCCTCCT 
CCTCAAAGGCGATGTCATGGTAACATGTTATCACAAGGGTGGCCGGGGCACAGACCGG 

TCCCCAAGGACCAGCTTGACGAGGCCTGGACTGATGAGAGGTTCCCCTTCCAAGCCTC 
CGTGGAGTTTGTCTTCTCCTCCAGCCCCGAGAAGArCAAAGGCAGCACTCCACGGAAC 
GACCCCTCGGTCTCTGTCGACTACAACACCACTGAGCCAGCCGTGCGCTGGGACTCCT 
ATGAGAACTTCAACCAG CACCACGAGGACAGTGTGG ATGGCTCCTTGACC CACACCCG 
GGGTCCCCTGGATGGCAGTCCTTATGCCCAGGTGCAGCGGCCTCCCCGGCAGACCCCC 



GCCATTCCTCCACGCTGACCACAGAGCCGGCTGCTGAGTCCCCTGGCCGGCCGCCCCC 
TACAGCTGCTGAACGGCAGGAGCTGGATCGCCTCCTAGGAGGCTGCGGAGTGGCCAGT 
GGGGGCCGGGGAGCTGGGCGCGAGACGGCCATCCTAGATGACGAAGAGCAGCCCACTG 
TGGGCGGAGGCCCCCACCTCGGAGTGTATCCAGGCCATAGGCCTGGCCTCAGCCGCCA 
CTGCTCCTGCCGCCAGGGCTACCGGGAGCCCTGCGGGGTTCCCAATGGGGGCTACTAC 
CGGCCAGAGGGAACCCTGGAGAGGAGGCGACTGGCCTACGGGGGCTATGAGGGATCCC 
CCCAGG G CTACG CCGAGGCCTCGATG G AGAAG AGG CGCCTCTG C CG ATCG CTGT CAGA 
GGGGCTATACCC CTAC C CACCTGAG ATGGGGAAACCAG C CACTG GGGACTTTGG CT AC 
CGCGCCCCAGGCTACCGGGAGGTGGTCATCCTGGAGGACCCTGGGCTGCCTGCCCTAT 
ACCCATGCCCAGCCTGCGAGGAGAAGCTGGOGCTGCCTACAGCAGCCTTGTATGGACT 
GCGGCTGGAGAGGGAGGCTGGAGAAGGGTGGGCAAGTGAGGCTGGCAAGCCTCTCCTG 
C^CCCAGTGCGGCCTGGGCACCCXaCTGCCrrCTGCTCT^GCCTGCCTGTGGGCATCACC 
ATGCCCCGATGCCTGACTACAG CTG CCTGAAGCCACCCAAGG CAGG CGAGG AAGGG CA 
CGAGGGCTGCTCCTACACCATGTGCCCCGAAGGCAGGTATGGGCATCCAGGGTACCCT 
G CCCTG GTG ACATACAGCTATGGAGG AG CAGTTCCCAGTTACTG CCCAG CATATGG CC 
GTGTCCCTCATAGCTGTGGCn'CrCCAGGAGAGGGCAGAGGGTATCCCAGCCCTGGTGC 



CCACTCCCCAO^CTGGCTCCAT^ 
AAGCTGAGCTACGAGATCCCTACGGAGGAGGGAGGGGACAGGTACCCATTGCCTGGGC 
ACCTGGCCTQ^CAGGACCTTTGGCATCTGCAGAGTCGCTGGAGCCGGTCTCCTGGAG 
GGAGGGCCCCAGTGGGCACAGCACACTGCCTCGGTCTCCCCGAGATGCCCCATGCAGT 
GCI^CGTC^GAGTTGTCTGGTCCCTCCACGCCCOTGC^^CC^GC^GTCCAGTCCAGG 
GC^GGAAAGCACCCGGCGACAGGAO^CCAGGTCCCCCACCTCAGCGCCCACTCAGAG 
ACTGAGTCCTGGCGAGGCCTTGCCCCCTCTTTCCCAGGCAGGCACCGGAAAGGCCCCT 
GAGCTGCCGTCGGGAAGTGGGCCTGAGCCTCTGGCCCCTAGCCCAGTCTCTCCGACCT 
TCCCTCCCAGCTCGCCCAGTGACTCGCCTCAGGAAAGGAGTCCAGGGGGCCACTCAGA 



TGKSCAAGGCCCTCGAGGCCCCC^CGACAGCCCAGATGGGTCTCCCCTCACTCCTGTGC 
CTTCCCAGATGCXCTX^CITGTGGCC^CCCAGAGCC^ 
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t 


TGCTTTCCCCCTGGCTGCCTCCTATGACACCAATGGCCT 

GAGAAACGCCACCTGCCCGGGCCGGGGCAACAGCCAGGACCCTGGGGCCCAGAGCAGG 


CATCATCGCCAGCCAGAGGCATCAGTCACCATGTCACCTTCGCACCTCTGCTCTCAGA 
TAATGTCCCCOVAACCCCAGAGCCTCCTACACAAGAGAGCCAAAGCAATGTCAAGTTT 
GTCCAGGATACATCCAAGTTCTGGTACAAGCC^CACCTGTCCCGTGACC^GCC^TTG 
CCCTGCTGAAGGACAAGGACCCTGGGGCCTTCCTGATCAGGGACAGTCATTCATTCCA 
AGGAGCTTATGGGCTGGCCCTCAAGGTGGCCACACCGCCACCCAGTGCCCAGCCCTGG 
AAAGGGGACCCCGTGGAACAGCTGGTCCGCCATTTCCTCATCGAGACTGGGCCCAAAG 
GGGTGAAGATCAAGGGCTGCCCC^GTGAGCCCTACTTTGGCAGCCTGTCCGCCTTGGT 
CTCCCAGCACTCCATCTCCCCCATCTCCCTGCCCTGCTGCCTGCGCATTCCCAGCAAA 
GATCCTCTGGAAGAGACCCCAGAGGCTCCAGTGCCCACCAACATGAGCACAGCGGCAG 
ACCTCCTGCGTCAGGGTGCTGCCTGCAGCGTGCTCTACTTGACCTCAGTGGAGACAGA 
GTCACTGACGGGCCCCCAAGCTGTGGCCCGGGCCAGCTCTGCAGCTCTGAGCTGTAGC 
CCCCGCCCGACACCAGCTGTTGTCCACTTCAAGGTGTCAGCCCAGGGCATTACACTGA 
CGGACAACCAAAGGAAGCTCTTCTTTCGCCGCCATTATCCAGTGAACAGCATCACCTT 
CTCCAG CACTGACC CTCAAG ACCGG AG ATGGACCAACCCAGACGGG ACCACCTCCAAG 
ATCTTTGGTTTCGTGGCCAAGAAGCCGGGAAGCCCCTGGGAGAATGTGTGTCACCTCT 
TTGCAGAG CTTGACCCAG ATCAGCCTGCTGGCG CCATTGTCACCTT CATCACC AAAGT 
TCTACTGGGCCAGAGAAAATGAAGGAAGGCCACAAGCTCAGAGCCOVCATCAACACTG 
CCCCCCTCCCAGCACCCCACAGCCCTCACATCCCCTGGCCTGGACCCAGGAGACCCAG 
GAGAAAGCACCCTCCCTTAGGAATGAGGAGTGGGCATCAGGCCTGGGACACTGCTCTC 
CTTCCCCGCCCCCAGCCTGCTAAGTTAAGTGGACAGGCCCACAAGATGACCTTGCATG 
TGAGCAGATGGCAGAGATGGGTGTGTGAGGGGTGAGGAGGCATCAGCAGTTGAGCCCC 
GAAGGAGATCAGGCAGCCCCACCTGCAGGAGAACGTCAGCCCTCCAGGGGATCAGCCC 
CTGCCAGTTCCACCCAG CTGCAGGTGC CAG CACGG CAGGGATGGG AG AGGGGTGGGG A 
GCG AGTCACTG CCTCCTCTG AG GAG AG ATTCAG AGTAGG ATCACATG AATAGGGG AAA 
AAAGAGAGTCTATTTTTGTCTAATAATAAAGAATTTCTATAAACTTTAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA 




ORF Start: ATGat 561 


ORF Stop: TGA at 4950 




SEQIDNO: 196 


1463 aa 


MWat 158629.7RD 


j NOV68a, 
;CG99842-01 Protein Sequence 


MKFLSSTRHVS SAS VWLACFFLPVSRRVRLSPSLRLSRPRVLRGRLS PTLS P SQPLPV 
S SMSLARPGWPSAHPLS PRLFPRKAE PH S FREKVFRKKPPVCAVCKVTIDGTGVS CRV 
CKVATHRKCEAKVTSACQALPPVELRRNTAPVRR I EHLGSTKSLNHS KQRSTLPRS FS 
LDPLMFJIRWDLDLTYVTERILAAAFPARPDEQRHRGHLRELAHVLQSKHRDKYLLFNL 
S E KRHDI/TRLNPKVQDFGWPELHAP PLDKLCS I CKAMETWLS AD PQHVWL YC KGNKG 
KLGVIVSAYMHYSKI SAGADQALATLTMRKFCEDKVATELQP SQRRYI SYFSGLLSGS 
IRMNSSPLFLHYVLIPMLPAFEPGTGFQPFLKIYQSMQLVYTSGVYHIAGPGPQQLCI 
SLE PALLLKGDVMVTCYHKGGRGTDRT LVFRVQFHTCT I HGPQLTFPKDQLDEAWTDE 
RFPFQASVEFVFSSSPEKI KGSTPRND PSVSVDYNTTEPAVRWDS YENFNQHHEDSVD 
GSLTHTRGPLDGSPYAQVQRPPRQTPPAPSPEPPPPPMLSVSSDSGHSSTLTTEPAAE 
SPGRPPPTAAERQELDRLLGGCGVASGGRGAGRETAILDDEEQPTVGGGPHLGVYPGH 
RPGLSRHCSCRQGYREP CX3VPNGGYYRPEGTLERRRLAYGGYEGS PQGYAEASMEKRR 
LCRSLSEGLYPYPPEMGKPATGDFGYRAPGYREVVILEDPGLPALYPCPACEEKLALP 
TAALYGLRLEREAGEGWAS EAGKPLLHPVRPGHPL PLLLPACGHHHAPMPDYS CLKPP 
KAGEEGHEGCS YTMCPEGRYGHPGYPALVTYS YGGAVPS YCPAYGRVPHS CGS PGEGR 
GYPSPGAHSPRAGSISPGS PPYPQSRKLSYEI PTEEGGDRYPLPGHLASAGPLASAES 
LEPVSWREGPSGHSTLPRS PRDAPCSASSELSGPSTPLHTSS PVQGKESTRRQDTRSP 
TSAPTQRLSPGEALPPVSQAGTGKAPELPSGSGPEPLAPSPVSPTFPPSSPSDWPQER 
SPGGHSDGASPRSPVPTTLPGLRHAPWQGPRGPPDSPDGSPLTPVPSQMPWIiVASPEP 
PQSSPTPAFPLAASYDTNGLSQPPLPEKRHIiPGPGQQPGPWGPEQASSPARGISHHVT 
FAPLLSDNVPQTPEPPTQESQSNVKFVQDTSKFWYKPHLSRDQAIALLKDKDPGAFLI 
RDSHSFO^YGLALKVATPPPSAQPWKGDPVEQLVRHFIiIETGPKGVKIKGCPSEPYF 
GSLSALVSQHS I SPI SLPCCLRIPSKDPLEETPEAPVPTNMSTAADLLRQGAACSVLY 
LTS VETESLTGPQAVARAS S AALS CS PRPTPAWHFKVS AQG ITLTDNQRKLFFRRHY 
PVNSITFSSTDPQDRRWTNPDGTTSKIFGFVAKKPGSPWENVCHLFAELDPDQPAGAI 
VTFITKVLLGQRK 



Further analysis of the NOV68a protein yielded the following properties shown in 
Table 68B. 
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Table 68B. Protein Sequence Properties NOV68a 


PSort analysis: 


0.3700 probability located in outside; 0.1900 probability located in lysosome 
(lumen); 0.1080 probability located in nucleus; 0.1000 probability located in 
endoplasmic reticulum (membrane) 


SignalP analysis: 


Cleavage site between residues 28 and 29 



A search of the NOV68a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 68C. 



Table 68C. Geneseq Results for NOV68a 


Geneseq 
Identifier 

i 


Protein/Organism/Length [Patent #, 
Date] 


NOV68a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB40638 

! 

I 


Human ORFX ORF402 polypeptide 
sequence SEQ ID NO:804 - Homo 
sapiens, 1400 aa. [WO200058473-A2, 
05-OCT-2000] 


64.. 1463 
1..I400 


1400/1400 
(100%) 

1400/1400 
(100%) 


0.0 


AAM40312 


Human polypeptide SEQ ID NO 3457 - 
Homo sapiens, 1409 aa. 
[WO200153312-A1, 26-JUL-2001] 


80.. 1463 
26.. 1409 


1383/1384 
. (99%) 
1383/1384 
(99%) 


0.0 


AAU17I96 


Novel signal transduction pathway 
protein, Seq ID 761 - Homo sapiens, 524 
aa. [WO200154733-Al,02-AUG-2001] 


942..1463 
3..524 


522/522(100%) 
522/522(100%) 


0.0 


AAU 17573 


Novel signal transduction pathway 
protein, Seq ID 1 138 - Homo sapiens, 
522 aa. [WO200154733-A1, 02-AUG- 
2001] 


942..1463 
1..522 


521/522(99%) 
521/522(99%) 


0.0 


AAB42301 


Human ORFX ORF2065 polypeptide 
sequence SEQ ID NO:4130 - Homo 
sapiens, 523 aa. [WO200058473-A2, 
05-OCT-2000] 


149..582 
48..492 


232/445 (52%) 
311/445(69%) 


e-131 



In a BLAST search of public sequence datbases, the NOV68a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 68D. 



344 



WO 03/010327 



PCT/US02/14199 



Table 68D. Public BLASTP Results for NOV68a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV68a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UPS7. 


KIAA1075 PROTEIN - Homo 
sapiens (Human), 1400 aa (fragment). 


64.. 1463 
1..I400 


1399/1400 

(99%) 
1399/1400 


0.0 


Q96P25 


TENSIN2 - Homo sapiens (Human), 
1285 aa. 


179..1463 
1 1285 


1284/1285 
(99%^ 

1284/1285 
(99%) 


0.0 


Q9NT29 


HYPOTHETICAL 68.7 KDA 
PROTEIN - Homo sapiens (Human), 
649 aa (fragment). 


813..1463 
1..649 


648/651 (99%) 
648/651 (99%) 


0.0 


| AAH25818 

i 


SIMILAR TO K1AA1075 PROTEIN 
- Mus musculus (Mouse), 655 aa 
(fragment). 


808.. 1463 
2..655 


564/663 (85%) 
580/663 (87%) 


0.0 


1 A54970 

i 


tensin, cardiac muscle - chicken, 1744 
aa. 


149-1 140 
31..1088 


382/1 1 1 1 (34%) 
513/1111 (45%) 


e-I39 



PFam analysis predicts that the NOV68a protein contains the domains shown in 
the Table 68E. 



Table 68E. Domain Analysis of NOV68a 


Pfam Domain 


NOV68a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


DAG_PE-bind 


86..133 


16/51 (31%) 
33/51 (65%) 


3e-05 


PHD 


99..136 


9/51 (18%) 
20/51 (39%) 


022 


SH2 


1194.. 1286 


28/97 (29%) 
66/97 (68%) 


5.4e-15 



Example 69. 



The NOV69 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 69A. 
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Table 69 A. NOV69 Sequence Analysis 




SEQIDNO:197 j 1502 bp 


NOV69a, 
CG99944-01 DNA Sequence 

i 

f 

i 

i 

i 


GAATAAGGAGGAAGAATCATGAATGAAAA AfiTr'&T&cn a MYirana ^ ^^jy^CTlTlTlC 

AATTTGGAACATTCAAGG CAAATGACAACATCAATTTACAAGTAAAGG CAGG AG AGAT 

TCATG CGTTGCTTGGAGAAAACGGTGCTGGCAAATCTACATTGATGAACGTG CTTTCC 

GGATTATTAGAGCCGACATCAGGGAAAATTTTGATGCGTGGGAAAGAAGTACAGATCA 

CAAGCCCGACAAAAG CCAATCAATTAGGG ATTGGG ATGGTCCATCAGCACTTTATG CT 

TGTTGATGCCTTTACTGTAACAGAAAACATCGTGTTGGGAAGCGAACCTAGTCGTGCA 

GGGATGCTTGACCATAAAAAAGCGCGAAAAGAGATCCAAAAAGTTTCTGAACAATATG 

GATTATC^GTCAACCCGGATGCTTATGTTCGTGATATTTCAGTTGGGATGGAACAACG 

GGTAGAAATTTTAAAAACACTTTACCGAGGAGCAGATGTACTGATTTTTGATGAGCCG 

ACAGCTGTATTGACCCCTCAGGAAATTGATGAATTAATCGTGATCATGAAGGAATTAG 

TCAAAGAAGr*PA&(^Tr , aaTr*fl'TT*FT , r*7v'rwT'-A/^"/-'/-iTv»ri7» * ^ mm - „ 

^rtnnu AAvaouwu i l ah i la 1 1 1 lTJATTACG CAT AAGTTAGATGAAATCAAAG CAGT 

AGCTGACCGTTGTACAGTTATCCGCCGTGGAAAAGGAATCGGTACAGTCAACGTTAAA 
GACGTTACCTCACAGCAATTAGCTGATATGATGGTCGGAAGAGCGGTTTCATTCAAAA 
CGATGAAAAAAGAAGCGAAGCCTCAAGAAGTCGTTTTGTeTATTGAAAATCTAGTGGT 
AAAAGAAAATCGTGGATTAGAAGCCGTGAAAAACCTGAACTTAGAGGTTCGTGCTGGC 
GAAGTACTTGGTATCGCTGGAATCGATGGAAACGGGCAGTCGGAGTTGATCCAAGCTT 
TGACTGGTTTGCGAAAGGCAGAAAGCGGACATATCAAGCTAAAAGGGGAAGACATCAC 
CAATAAAAAACCTCGAAAGATCACTGAACATGGTGTAGGACATGTGCCAGAAGACCGT 
CATAAATACGGGTTGGTCCTAGATATGACATTGTCTGAAAACATTGCCCTGCAAACGT 
ATCATCAAAAACCTTACAGTAAAAACGGTATGCTGAATTATTCAGTGATAAATGAACA 
TGCCAGAGAATTGATCGAAGAATATGATGTTCGAACAACGAATGAACTTGTTCCTGCA 
AAAGCTTTATCAGGCGGAAATCAGCAAAAAGCAATCATCGCTCGGATAGTCGACCGAG 
ATCCTGATCTGTTGATCGTTG CAAATC CAACTCGTGGGCTGGATGTAGGTGCGATCGA 
ATTTATTCATAAACGTCTGATCGAACAAAGGGACAAATACAAAGCAGTGTTATTGATT 
AGTTTCGAATTAGAAGAAATTTTAAATGTTTCGGATCGTATTGCTGTTATCCATGAAG 
GAGAAATCGTCGGGATCGTTGATCCGAAAGAAACATCTGAAAATTAATAGGG 




ORF Start: ATG at 19 


ORF Stop: TAA at 1495 


! 


SEQIDNO: 198 


492 aa MW at 54439.3kD 


NOV69a, 
CG99944-01 Protein Sequence 


MNEKVIEMQNITKEFGTFKANDNINLQVKAGEIHALI/3ENGAGKSTLMNVLSGLLEPT 
SGKILMRGKEVQITSPTKANQLG I GMVHQHFMLVDAFTVTENI VLGSEPSRAGMLDHK 
KARKEI QKVSEQYGLSVNPDAYVRDIS VGMEQRVEI LKTLYRGADVLI FDEPTAVLTP 
QEIDELIVIMKELVKEGKSIILITHKLDEIKAVADRCTVIRRGKGIGTVNVKDVTSQQ 
LADMMVGRAVS FKTMKKEAKPQ EWLS I ENL WKENRGLEAVKNLNLEVRAGEVLGI A 
GIDGNGQSELIQALTGIiRKAESGHIKLKGEDITNKKPRKITEHGVGHVPEDRHKYGLV 
LDMTLSENLALQTYHQKPYSKNGMLNYSVINEHARELIEEYDVRTTNELVPAKALSGG 
NQQKAI IARI VDRDPDLLIVANPTRGLDVGAI EFIHKRL I EQRDKYKAVLL I S F ELEE 
ILNVSDRIAVIHEGEIVGIVDPKETSEN 



Further analysis of the NOV69a protein yielded the following properties shown i 
Table 69B. 



Table 69B. Protein Sequence Properties NOV69a 


PSort analysis: 


0.6000 probability located in nucleus; 0.1000 probability located in mitochondrial 
matrix space; 0.1 000 probability located in Iysosome (lumen); 0.0000 probability 
located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV69a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 69C. 
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Table 69C. Geneseq Results for NOV69a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV69a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAE 16766 

i 
i 


Human transporter and ion channel- 

3 (TRICH-3) protein - Homo 
sapiens, 485 aa. [WO2001 92304- 
A2,06-DEC-2001] 


S..492 
1..485 


485/485(100%) 
485/485(100%). 


0.0 


AAU35015 


Enterococcus faecalis cellular 
proliferation protein #302 - 
Enterococcus faecalis, 513 aa. 
[ WO200 1 70955-A2, 27-SEP-200 1 ] 


8..492 
I. .485 


395/485 (81%) 
450/485 (92%) 


0.0 


AAU35015 


Enterococcus faecalis cellular 
proliferation protein #302 - 
Enterococcus faecalis, 513 aa. 
[ WO200 1 70955-A2, 27-SEP-200 1 ] 


8..492 
1..485 


395/485 (81%) 
450/485 (92%) 


0.0 


j AAU38034 

; 
1 
i 


Streptococcus pneumoniae cellular 

proliferation protein #463 - 
Streptococcus pneumoniae, 51 1 aa. 
[ WO200 1 70955-A2, 27-SEP-2001 ] 


2..491 
3. .492 


352/490 (71%) 
424/490 (85%) 


0.0 


1 AAU37574 

i ! 

: J 


Streptococcus pneumoniae cellular 

proliferation protein #3 - 
Streptococcus pneumoniae, 511 aa. 
[WO200170955-A2, 27-SEP-2001] 


2..491 
3..492 


352/490 (71%) 
424/490 (85%) 


0.0 



In a BLAST search of public sequence datbases, the NOV69a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 69D. 
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Table 69D. Public BLASTP Results for NOV69a 


Protein 
Accession 
Number 

i 


Protein/Organism/Length 


N0V69a Residues/ 
Match Residues 


Identities/ 
Similaritips fnr 

OlllIJ J til IllUD Jul 

the Matched 
Portion 


Value 


» AARTQCK^? 
\ r\/-\r\.yy j 

I 


ABC TRANSPORTER ATP- 
BINDING PROTEIN- 

RIBOSE/GALACTOSE 
TRANSPORT - Streptococcus 

pneumoniae R6, 511 aa. 


2..49I 
3. .492 


352/490 (71%) 


0.0 


Q97RG9 


SUGAR ABC TRANSPORTER, 

ATP-BINDING PROTEIN - 
Streptococcus pneumoniae, 51 1 
aa. 


2..491 
3 492 


352/490 (71%) 

494/400 rtKO/A 


0.0 




PUTATIVE SUGAR ABC 
TRANSPORTER (ATP- 
BINDING PROTEIN)- 
Streptococcus pyogenes, 510 aa. 


1..491 
1..491 


347/491 (70%) 
424/491 (85%) 


0.0 


\ AAL97795 

i 

! 

! 

! 


PUTATIVE SUGAR ABC 
TRANSPORTER (ATP- 
BINDING PROTEIN) - 
Streptococcus pyogenes 
MGAS8232,5I0 aa. 


1..491 
1 ..49 1 


347/491 (70%) 
424/491 (85%) 


0.0 


Q8Y7A1 


HYPOTHETICAL PROTEIN 
LM01389- Listeria 
monocytogenes, 513 aa. 


5.491 
4..490 


345/487 (70%) 
407/487 (82%) 


0.0 



PFam analysis predicts that the NOV69a protein contains the domains shown in 
the Table 69E. 



Table 69E. Domain Analysis of NOV69a 


Pfam Domain 


N0V69a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ABCjran 


31..217 


73/201 (36%) 
144/201 (72%) 


5.8e-49 


ABCjran 


284..478 


59/207 (29%) 
134/207(65%) 


7.4e-I8 



Example 70. 



The NOV70 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 70A. 
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Table 70A. NOV70 Sequence Analysis 




SEQIDNO: 199 




689 bp 


NOV70a, 
CG99963-01 DNA Sequence 

: 


AGCGTGAGCCACCGCACCTGGCCTATTATTGTTTTGTTAAAAGCAGGGTCTCGGCTTT 
GCAAATGCAGCCACCGACGCCAGGAGCTCTGTACGATCAGCCATGGTTAACCCCACCG 
TGTTCGACATCGCTGTCGACGGCAAGCCCTTGGGCCGCGTGTCCTTCGAGCCGTTTGC 
AGACAAGGTTCCAAAGGCAGCAGAAAACTTTCGTGCTCTGAGCAATGTAGAGAAAGGA 
TTTGGTTATAAAGGTTCCTGCTTTCACAGAATTATTCCAGGGTTTATGTGTCAGGGTG 
GTGACTTCACATGCCATAATGGCACTGGTGGCAAGTACACCTGCGGGGAGAAATTTGA 
TGACGAGAG CTTCGTCCTAAAG CATACAC ATCCTGGCATCT TGTCCATGG CAAATGCT 
GGACCCAACACAAATGGTTCC CAGTGTTTCATCTGC ACTGC CAAGACTGAGTGGCTGG 
ATGG CAATCCTGTGGTCTTTGG CAAGGTGAAAGAAGG CATG AATATTGTGGAGG CCAT 
GGGGCACTTTGGGTCCGAGAATGGCAAGACCAGCAAGAAGATCACCATTGCTGACTGT 
GGACAACTCTAACAAGTTTGG CTTTTGTTTTATCTTAACC ACCAG AC CATTCCTTTTG 
TAGCTCAGAAGAGCACCCCTTC^CCCCATTTGCTTGCAGTATTTTTTTTTT 




ORF Start: ATGatlOl 


ORF Stop: TAA at 590 




SEQ ID NO: 200 


163 aa 


MWat I7503.8kD 


NOV70a, 
CG99963-01 Protein Sequence 


MVNPTVFDI AVDGKPLGRVS FEPFADKVPKAAENFRALSNVE KGFGYKGSCFHRI I PG 
FMCQGGDFTCHNGTGGKYTCGEKFDDES FVLKHTHPG ILSMANAGPNTNGSQCFICTA 
KT EWLDGN PWFG KVKEGMN I VE AMGHFGS ENGKTS KKIT I ADCGQL 


{ ■ 


SEQ ID NO: 201 




596 bp 


i NOV70b, 

] CG99963-02 DNA Sequence 

? 

! 

1 

I 


TTGTTTTGTTAAAAGCAGGGTCTCGGCTTTGCAAATGCAGCCACCGACGCCAGGAGCT 
CTGTACG ATCAG CCATGGTT AACCCCACCGTGTTCGACATCG CTGTCG ACGGC A AGP r 


CTTGGACCGCGTGTCCITCGAGCCGTTTGCAGACAAGGTTCCAAAGGC^GCAGAAAAC 
TTTCGTGCTCTGAGCAATGTAGAGAAAGGATTTGGTTATAAAGGTTCCTGCTTTCACA 
GAATTATTCCAGGGTTTATGTGTCAGGGTGGTGACTTCACATGCCATAATGGCACTGG 
TGGCAAGTACACCTG CGGGGAGAAATTTGATG ACGAG AGCTTCGTCCTAAAG CAT ACA 
C^TCCTGGCATCTTGTCCATGGCAAATGCTGGACCCAACACAAATGGTTCCCAGTGTT 
TCATCTGCACTGCCAAGACTGAGTGGCTGGATGGCAATCCTGTGGTCTTTGGCAAGGT 
GAAAGAAGGCATGAATATTGTGGAGGCCATGGGGCACTTTGGGTCCGAGAATGGCAAG 
ACCGGCAAGAAGATCACCATTGCTGACTGTGGACAACTCTAACAAGTTTGGCTTTTGT 
TTTATCTTAACCACCA 


j 1 ' ' i ... — 


ORF Start: ATG at 73 j 


ORF Stop: TAA at 562 




SEQ ID NO: 202 j 


163 aa 


- • MWat 17531.8kD 


NOV70b, 
CG99963-02 Protein Sequence 


MVNPTVFDIAVDGKPLDRVSFEPFADKVPKAAENFRALSNVEKGFGYKGSCFHRI I PG 
FMCC^DF^CHNGTGGKOTCGE KFDDES FVLKHTHPG I LSMANAGPNTOGSQCFICTA 
KTEWLDGNPWFGKVKEGMNIVEAMGHFGSENGKTGKKITIADCGQL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 70B. 



Table 70B. Comparison of NOV70a against NOV70b. 


Protein Sequence 


NOV70a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV70b 


1-163 
1..163 


161/163 (98%) 
161/163(98%) 



Further analysis of the NOV70a protein yielded the following properties shown in 
Table 70C. 
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Table 70C. Protein Sequence Properties NOV70a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.5689 probability located in 
microbody (peroxisome); 0.4500 probability located in cytoplasm; 0.1000 
probability located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV70a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 70D. 



Table 70D. Geneseq Results for NOV70a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV70a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU0I195 


Human cyclophilin A protein - Homo 
sapiens, 165 aa. [WO2001 32876- A2, 
10-MAY-2001] 


K.163 
1..164 


144/164 (87%) 
148/164 (89%) 


le-79 


AAW56028 


Calcineurin protein - Mammalia, 165 aa. 
[WO9808956-A2, 05-MAR-1998] 


L.I63 
1..164 


144/164 (87%) 
148/164 (89%) 


le-79 


AAR13726 


Bovine cyclophilin - Bos taurus, 163 aa. 
[US5047512-A, I0-SEP-199I] 


2..163 
1..163 


143/163 (87%) 
147/163 (89%) 


2e-79 


AAG65275 


Haematopoietic stem cell proliferation 
agent related human protein #2 - Homo 
sapiens, 164 aa. [JP2001 163798-A, 19- 
JUN-2001] 


2.. 163 
1..163 


143/163 (87%) 
147/163 (89%) 


4e-79 


AAP90431 . 


Cyclophilin - Homo sapiens (human), 
164 aa. [EP326067-A, 02-AUG-1989] 


2..163 
1..163 


143/163 (87%) 
147/163 (89%) 


4e-79 



In a BLAST search of public sequence datbases, the NOV70a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 70E. 
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Table 70E. Public BLASTP Results for NOV70a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV70a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Exnect 
Value 


CAC39529 


SEQUENCE 26 FROM PATENT 
WOO 132876 - Homo sapiens (Human), 
1 65 aa. 


1..163 
1.-164 


144/164 (87%) 
148/164 (89%) 


2e-79 


Q9TTC6 


CYCLOPHILIN 18-OryctoIagus 
cuniculus (Rabbit), 164aa. 


1..163 
1..164 


143/164 (87%) 
150/164(91%) 


4e-79 


Q9BRU4 

1 


PEPTI D YLPROL YL ISOMERASE A 
(CYCLOPHILIN A) - Homo sapiens 
(Human), 165 aa. 


1..163 
1..164 


144/164 (87%) 
147/164 (88%) 


5e-79 


P04374 


Peptidyl-prolyl cis-trans isomerase A 
(EC 5.2.1 .8) (PPlase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A- 
binding protein) - Bos taurus (Bovine), 
and, 163 aa. 


2.. 163 
1..163 


143/163 (87%) 
147/163 (89%) 


5e-79 


I P05092 

i 

i 

! 

i 


Peptidyl-prolyl cis-trans isomerase A 
(EC 5.2.1.8) (PPlase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A- 
binding protein) - Homo sapiens 
(Human),, 1 64 aa. 


2.. 163 
1..163 


143/163 (87%) 
147/163 (89%) 


9e-79 



PFam analysis predicts that the NOV70a protein contains the domains shown in 
the Table 70F. 



Table 70F. Domain Analysis of NOV70a 


Pfam Domain 


NOV70a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


projsomerase 


7.. 163 


V 108/179 (60%) 
•\ 140/179 (78%) 


3.7e-87 



Example B: Sequencing Methodology and Identification of NOVX Clones 

1. GeneGalling™ Technology: This is a proprietary method of performing 
differential gene expression profiling between two or more samples developed at CuraGen 
and described by Shimkets, et al., "Gene expression analysis by transcript profiling 
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5 



coupled to a gene database query" Nature Biotechnology 17:198-803 (1999). cDNA was 
derived from various human samples representing multiple tissue types, normal and 
diseased states, physiological states, and developmental states from different donors. 
Samples were obtained as whole tissue, primary cells or tissue cultured primary cells or 
cell lines. Cells and cell lines may have been treated with biological or chemical agents 
that regulate gene expression, for example, growth factors, chemokines or steroids. The 
cDNA thus derived was then digested with up to as many as 120 pairs of restriction 
enzymes and pairs of linker-adaptors specific for each pair of restriction enzymes were 
ligated to the appropriate end. The restriction digestion generates a mixture of unique 
cDNA gene fragments. Limited PCR amplification is performed with primers 
homologous to the linker adapter sequence where one primer is biotinylated and the other 
is fluorescently labeled. The doubly labeled material is isolated and the fluorescently 
labeled single strand is resolved by capillary gel electrophoresis. A computer algorithm 
compares the electropherograms from an experimental and control group for each of the 
restriction digestions. This and additional sequence-derived information is used to predict 
the identity of each differentially expressed gene fragment using a variety of genetic 
databases. The identity of the gene fragment is confirmed by additional, gene-specific 
competitive PCR or by isolation and sequencing of the gene fragment. 

2. SeqCalling™ Technology: cDNA was derived from various human samples 
representing multiple tissue types, normal and diseased states, physiological states, and 
developmental states from different donors. Samples were obtained as whole tissue, 
primary cells or tissue cultured primary cells or cell lines. Cells and cell lines may have 
been treated with biological or chemical agents that regulate gene expression, for example, 
growth factors, chemokines or steroids. The cDNA thus derived was then sequenced using 
CuraGen's proprietary SeqCalling technology. Sequence traces were evaluated manually 
and edited for corrections if appropriate. cDNA sequences from all samples were 
assembled together, sometimes including public human sequences, using bioinformatic 
programs to produce a consensus sequence for each assembly. Each assembly is included 
in CuraGen Corporation's database. Sequences were included as components for assembly 
when the extent of identity with another component was at least 95% over 50 bp. Each 
assembly represents a gene or portion thereof and includes information on variants, such 
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as splice forms single nucleotide polymorphisms (SNPs), insertions, deletions and other 
sequence variations. 

3. PathCalling™ Technology: 

5 The NOVX nucleic acid sequences are derived by laboratory screening of cDNA 

library by the two-hybrid approach. cDNA fragments covering either the full length of the 
DNA sequence, or part of the sequence, or both, are sequenced. In silico prediction was 
based on sequences available in CuraGen Corporation's proprietary sequence databases or 
in the public human sequence databases, and provided either the full length DNA 
1 0 sequence, or some portion thereof. 

The laboratory screening was performed using the methods summarized below: 

cDNA libraries were derived from various human samples representing multiple 
tissue types, normal and diseased states, physiological states, and developmental states 
from different donors. Samples were obtained as whole tissue, primary cells or tissue 

15 cultured primary cells or cell lines. Cells and cell lines may have been treated with 

biological or chemical agents that regulate gene expression, for example, growth factors, 
chemokines or steroids. The cDNA thus derived was then directionally cloned into the 
appropriate two-hybrid vector (Gal4-activation domain (Gal4-AD) fusion). Such cDNA 
libraries as well as commercially available cDNA libraries from Clontech (Palo Alto, CA) 

20 were then transferred from Exoli .into a CuraGen Corporation proprietary yeast strain 
(disclosed in U. S. Patents 6,057,101 and 6,083,693, incorporated herein by reference in 
their entireties). 

Gal4-binding domain (Gal4-BD) fusions of a CuraGen Corportion proprietary 
library of human sequences was used to screen multiple Gal4-AD fusion cDNA libraries 
25 resulting in the selection of yeast hybrid diploids in each of which the Gal4-AD fusion 
contains an individual cDNA. Each sample was amplified using the polymerase chain 
reaction (PCR) using non-specific primers at the cDNA insert boundaries. Such PCR 
product was sequenced; sequence traces were evaluated manually and edited for 
corrections if appropriate. cDNA sequences from all samples were assembled together, 
30 sometimes including public human sequences, using bioinformatic programs to produce a 
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consensus sequence for each assembly. Each assembly is included in CuraGen 
Corporation's database. Sequences were included as components for assembly when the 
extent of identity with another component was at least 95% over 50 bp. Each assembly 
represents a gene or portion thereof and includes information on variants, such as splice 
5 forms single nucleotide polymorphisms (SNPs), insertions, deletions and other sequence 
variations. 

Physical clone: the cDNA fragment derived by the screening procedure, covering 
the entire open reading frame is, as a recombinant DNA, cloned into pACT2 plasmid 
(Clontech) used to make the cDNA library. The recombinant plasmid is inserted into the 
10 host and selected by the yeast hybrid diploid generated during the screening procedure by 
the mating of both CuraGen Corporation proprietary yeast strains N106 1 and YULH (U. S. 
Patents 6,057,101 and 6,083,693). 

4. RACE: Techniques based on the polymerase chain reaction such as rapid 

1 5 amplification of cDNA ends (RACE), were used to isolate or complete the predicted 

sequence of the cDNA of the invention. Usually multiple clones were sequenced from one 
or more human samples to derive the sequences for fragments. Various human tissue 
samples from different donors were used for the RACE reaction. The sequences derived 
from these procedures were included in the SeqCalling Assembly process described in 

20 preceding paragraphs. 

5. Exon Linking: The NOVX target sequences identified in the present invention 
were subjected to the exon linking process to confirm the sequence. PCR primers were 
designed by starting at the most upstream sequence available, for the forward primer, and 

25 at the most downstream sequence available for the reverse primer. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
sequence, until a suitable sequence that is either unique or highly selective was 
• encountered, or, in the case of the reverse primer, until the stop codon was reached. Such 
primers were designed based on in silico predictions for the full length cDNA, part (one or 

30 more exons) of the DNA or protein sequence of the target sequence, or by translated 

homology of the predicted exons to closely related human sequences from other species. 
These primers were then employed in PCR amplification based on the following pool of 
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human cDNAs: adrenal gland, bone marrow, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal 
kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, pancreas, 
pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, spinal 
cord, spleen, stomach, testis, thyroid, trachea, uterus. Usually the resulting amplicons 
were gel purified, cloned and sequenced to high redundancy. The PCR product derived 
from exon linking was cloned into the pCR2.1 vector from Invitrogen. The resulting 
bacterial clone has an insert covering the entire open reading frame cloned into the pCR2.1 
vector. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence 
traces were evaluated manually and edited for corrections if appropriate. These procedures 
provide the sequence reported herein. 

6. Physical Clone: 

Exons were predicted by homology and the intron/exon boundaries were 
determined using standard genetic rules. Exons were further selected and refined by means 
of similarity determination using multiple BLAST (for example, tBlastN, BlastX, and 
BlastN) searches, and, in some instances, GeneScan and GraiL Expressed sequences from 
both public and proprietary databases were also added when available to further define and 
complete the gene sequence. The DNA sequence was then manually corrected for apparent 
inconsistencies thereby obtaining the sequences encoding the full-length protein. 

The PCR product derived by exon linking, covering the entire open reading frame, 
was cloned into the pCR2.1 vector from Invitrogen to provide clones used for expression 
and screening purposes. 

Example C: Quantitative expression analysis of clones in various cells and tissues 

The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines 
and tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed on an 
Applied Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence 
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Detection System. Various collections of samples are assembled on the plates, and 
referred to as Panel 1 (containing normal tissues and cancer cell lines), Panel 2 (containing 
samples derived from tissues from normal and cancer sources), Panel 3 (containing cancer 
cell lines), Panel 4 (containing cells and cell lines from normal tissues and cells related to 
inflammatory conditions), Panel 5D/5I (containing human tissues and cell lines with an 
emphasis on metabolic diseases), AI_comprehensive_panel (containing normal tissue and 
samples from autoimmune diseases), Panel CNSD.01 (containing central nervous system 
samples from normal and diseased brains) and CNS_neurodegeneration_panel (containing 
samples from normal and Alzheimer's diseased brains). 

RNA integrity from all samples is controlled for quality by visual assessment of 
agarose gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio 
as a guide (2:1 to 2.5:1 28s:18s) and the absence of low molecular weight RNAs that 
would be indicative of degradation products. Samples are controlled against genomic 
DNA contamination by RTQ PGR reactions run in the absence of reverse transcriptase 
5 using probe and primer sets designed to amplify across the span of a single exon. 

First, the RNA samples were normalized to reference nucleic acids such as 
constitutively expressed genes (for example, P-actin and GAPDH). Normalized RNA (5 
ul) was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master 
Mix Reagents (Applied Biosystems; Catalog No. 4309169) and gene-specific primers 
according to the manufacturer's instructions. 

In other cases, non-normalized RNA samples were converted to single strand 
cDNA (sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 18064-147) 
and random hexamers according to the manufacturer's instructions. Reactions containing 
up to 10 ug of total RNA were performed in a volume of 20 ul and incubated for 60 
minutes at 42°C. This reaction can be scaled up to 50 ug of total RNA in a final volume of 
100 ul. sscDNA samples are then normalized to reference nucleic acids as described 
previously, using IX TaqMan® Universal Master mix (Applied Biosystems; catalog No. 
4324020), following the manufacturer's instructions. 

Probes and primers were designed for each assay according to Applied Biosystems 
Pnmer Express Software package (version I for Apple Computer's Macintosh Power PC) 
or a similar algorithm using the target sequence as input Default settings were used for 
reaction conditions and the following parameters were set before selecting primers: primer 
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concentration = 250 nM, primer melting temperature (Tm) range - 58°-60°C, primer 
optimal Tm = 59°C, maximum primer difference = 2°C, probe does not have 5'G, probe 
Tm must be 10°C greater than primer Tm, amplicon size 75bp to lOObp. The probes and 
primers selected (see below) were synthesized by Synthegen (Houston, TX, USA). Probes 
were double purified by HPLC to remove uncoupled dye and evaluated by mass 
spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' ends of the 
probe, respectively. Their final concentrations were: forward and reverse primers, 900nM 
each, and probe, 200nM. 

PCR conditions: When working with RNA samples, normalized RNA from each 
tissue and each cell line was spotted in each well of either a 96 well or a 384-well PCR 
plate (Applied Biosystems). PCR cocktails included either a single gene specific probe and 
primers set, or two multiplexed probe and primers sets (a set specific for the target clone 
and another gene-specific set multiplexed with the target probe). PCR reactions were set 
up using TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 
4313803) following manufacturer's instructions. Reverse transcription was performed at 
48°C for 30 minutes followed by amplification/PCR cycles as follows: 95°C 10 min, then 
40 cycles of 95°C for 15 seconds, 60°C for 1 minute. Results were recorded as CT values 
(cycle at which a given sample crosses a threshold level of fluorescence) using a log scale, 
with the difference in RNA concentration between a given sample and the sample with the 
lowest CT value being represented as 2 to the power of delta CT The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and 
multiplying by 100. 

When working with sscDNA samples, normalized sscDNA was used as described 
previously for RNA samples. PCR reactions containing one or two sets of probe and 
primers were set up as described previously, using IX TaqMan® Universal Master mix 
(Applied Biosystems; catalog No. 4324020), following the manufacturer's instructions. 
PCR amplification was performed as follows: 95°C 10 min, then 40 cycles of 95°C for 15 
seconds, 60°C for 1 minute. Results were analyzed and processed as described previously. 
Panels 1, 1.1, 1.2, and 13D 

The plates for Panels 1, 1.1, 1.2 and 1.3D include 2 control wells (genomic DNA 
control and chemistry control) and 94 wells containing cDNA from various samples. The • 
samples in these panels are broken into 2 classes: samples derived from cultured cell lines 
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and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
gastric cancer and pancreatic cancer. Cell lines used in these panels are widely available ' 
through the American Type Culture Collection (ATCC), a repository for cultured cell 
lines, and were cultured using the conditions recommended by the ATCC. The normal 
tissues found on these panels are comprised of samples derived from all major organ 
systems from single adult individuals or fetuses. These samples are derived from the 
following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, fetal heart, adult 
kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various regions of the 
brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, pituitary gland, 
adrenal gland, spinal 6ord, thymus, stomach, small intestine, colon, bladder, trachea, ' 
breast, ovary, uterus, placenta, prostate, testis and adipose. 



In the results for Panels 1, 1.1, 1.2 and 1 3D, the following abbreviations are used: 
ca. = carcinoma, 
* = established from metastasis, 
met = metastasis, 
s cell var = small cell variant, 
non-s = non-sm = non-small, 
squam = squamous, 
pi. eff = pi effusion = pleural effusion, 
glio = glioma, 
astro = astrocytoma, and 
25 neuro = neuroblastoma. 

General_ S creening_paneI_vl.4 and General_ S creening_panel_vl.5 

The plates for Panels 1.4 and 1.5 include 2 control wells (genomic DNA control 
and chemistry control) and 94 wells containing cDNA from various samples. The samples 
in Panels 1.4 and 1.5 are broken into 2 classes: samples derived from cultured cell lines 
and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
gastric cancer and pancreatic cancer. Cell lines used in Panel 1 .4 are widely available 
through the American Type Culture Collection (ATCC), a repository for cultured cell 
lines, and were cultured using the conditions recommended by the ATCC. The normal 
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tissues found on Panels 1.4 and 1.5 are comprised of pools of samples derived from all 
major organ systems from 2 to 5 different adult individuals or fetuses. These samples are 
derived from the following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, 
fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various 
5 regions of the brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, 
pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, colon, 
bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. Abbreviations 
are as described for Panels 1, 1.1, 1.2, and 1.3 D. 

Panels 2D and 2.2 

1 0 The plates for Panels 2D and 2.2 generally include 2 control wells and 94 test 

samples composed of RNA or cDNA isolated from human tissue procured by surgeons 
working in close cooperation with the National Cancer Institute's Cooperative Human 
Tissue Network (CHTN) or the National Disease Research Initiative (NDRI). The tissues 
are derived from human malignancies and in cases where indicated many malignant 

15 tissues have "matched margins" obtained from noncancerous tissue just adjacent to the 
tumor. These are termed normal adjacent tissues and are denoted "NAT" in the results 
below. The tumor tissue and the "matched margins" are evaluated by two independent 
pathologists (the surgical pathologists and again by a pathologist at NDRI or CHTN). This 
. analysis provides a gross histopathological assessment of tumor differentiation grade. 
20 Moreover, most samples-include the original surgical pathology report that provides 

information regarding the clinical stage of the patient. These matched margins are taken 
from the tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated 
"NAT", for normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples 
were obtained from various human tissues derived from autopsies performed on elderly 
25 people or sudden death victims (accidents, etc.). These tissues were ascertained to be free 
of disease and were purchased from various commercial sources such as Clontech (Palo 
Alto, CA), Research Genetics, and Invitrogen. 

Panel 3D 

The plates of Panel 3D are comprised of 94 cDNA samples and two control 
30 samples. Specifically, 92 of these samples are derived from cultured human cancer cell 
lines, 2 samples of human primary cerebellar tissue and 2 controls. The human cell lines 
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are ge^rafly obtained from ATCC (American Type Cutare Collection), NCI or the 
German ttanor cell bank and fall into the flowing tissue groups, S^ous ceU 
eare.no.na of the tongue, breast eancer, prostate eancer, melaooma, epidermoid carcinoma, 
sarcomas, bladder carcinomas, pancreatic cancers, kidney cancer, leuke-mas/lymphomas 
ovananiutenne/cervica,, gastric, eolon, lung md CNS eancer eel, lines. ■„ addition, there ' 
are two .dependent samples of cerebel.um. These cells are al, cultured under Sudani 
recommended conditions and RNA extracted using me standard procedure, The cel, lines 
■n pane. jD and ,.3D are of the most common cel, ,i„es used in me scientific literature. 
Panels 4D,4R, and 4.1D 

Panel 4 includes samples on a 96 well plate (2 control w=l„ 94 test 
composed of RNA (Pane. 4R) or cDNA (Panels 4D/4,D, isolated from various human 
cel. hues or tissues related ,0 inflammatory condition, Total RNA from control normal 
.sues such as colon and luog (SurUagene, u Jolla> CA) md ^ ^ 
Clontech) was employed. Total RNA from liver tissue from cirrhosis patients and kidney 
from lupus patients was obtained from BioChain (Biocham .nstitute, Inc., Hayward, CA) 
ln.es .na. tissue for RNA preparation fiom patients diagnose as having Crohn's disease 
« .ulcerative co,itis was obtained ftom me National Disease Research Interchange 
(NDRJ) (Philadelphia, PA). 

Astrocytes, tog fibroblast, dermal fibroblast, coronary artery smoofl, muscle 
ce 1, amal, ainvay epimelium, bronchia, epithelium, microvascular derma, endothelial 
cel., rn.cmvascu.ar Itmg endothelial cell, humao pulmonary aortic endothelial cells 
human umbilical vein endothelial cells were al. purchased from Clonetics (WaUcersvifle 
MD) and grown in the media suppIied for ^ ^ fcy ^ ^ 

ypes werc activated with various cytokines or combinations of cytokines for 6 ard/or ,2- 

* &,lowin8 were -* IL -* — - T >- 

ng/m. TNF ajpha a, approximate., S-.Ong/ml, ,FN gamma „ approximateIy 20 . 
50ng/ml, IL-4 at approximate., 5-.0ng/ml, IL-9 a, approximately 5-.0„g/ml, IL-.3 a, • 
approximate,, s-.ong/m,. Endotheba. oe„s werc sometimes starved for various times by 
cultme m the basal media fix,™ Clonetics with 0.1% serum. 

Mononuclear cells were prepared from blood of employees a. CuraGen 
Coition, using Ffcoll . LAK ce„s werc prepared fiom titese cefls by eulUrre in DMEM 
(HyC, ° ne) ' 100 * M « — « amino aeids (Gibco/Life Technologies, 
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Rockville, MD), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0 5 M (Gibco), and 
lOmM Hepes (Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 
10-20ng/ml PMA and l-2ug/ml ionomycin, IL-12 at 5-10ng/ml, IFN gamma at 20- 
50ng/ml and IL-18 at 5-10ng/ml for 6 hours. In some cases, mononuclear cells were 
cultured for 4-5 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO- 5 M (Gibco), and lOmM 
Hepes (Gibco) with PHA (phytohemagglutinin) or PWM (pokeweed mitogen) at 
approximately 5ug/ml. Samples were taken at 24, 48 and 72 hours for RNA preparation. 
MLR (mixed lymphocyte reaction) samples were obtained by taking blood from two 
donors, isolating the mononuclear cells using Ficoll and mixing the isolated mononuclear 
cells 1 : 1 at a final concentration of approximately 2xl0 6 cells/ml in DMEM 5% FCS 
(Hyclone), 1 OOuM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol (5.5xlO- 5 M) (Gibco), and lOmM Hepes (Gibco). The MLR was cultured 
and samples taken at various time points ranging from 1- 7 days for RNA preparation. 

Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve 
VS selection columns and a Vario Magnet according to the manufacturer's instructions. 
Monocytes were differentiated into dendritic cells by culture in DMEM 5% fetal calf 
serum (FCS) (Hyclone, Logan, UT), lOOuM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol 5.5xlO- 5 M (Gibco), and lOmM Hepes (Gibco), 
50ng/ml GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of 
monocytes for 5-7 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0' 5 M (Gibco), lOmM 
Hepes (Gibco) and 1 0% AB Human Serum or MCSF at approximately 50ng/ml. 
Monocytes, macrophages and dendritic cells were stimulated for 6 and 12-14 hours with 
lipopolysaccharide (LPS) at lOOng/ml. Dendritic cells were also stimulated with anti- 
CD40 monoclonal antibody (Pharmingen) at lOug/ml for 6 and 12-14 hours. 

CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from 
mononuclear cells using CD4, CDS and CD56 Miltenyi beads, positive VS selection 
columns and a Vario Magnet according to the manufacturer's instructions. CD45RA and 
CD45RO CD4 lymphocytes were isolated by depleting "mononuclear cells of CD8, CD56, 
CD14 and CD19 cells using CDS, CD56, CD14 and CD19 Miltenyi beads and positive 
selection. CD45RO beads were then used to isolate the CD45RO CD4 lymphocytes with 
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the remaining cells being CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and 
CD8 lymphocytes were placed in DMEM 5% FCS (Hyclone), lOOfiM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and 
lOmM Hepes (Gibco) and plated at 10 6 cells/ml onto Falcon 6 well tissue culture plates 
5 that had been coated overnight with 0.5^g/ml anti-CD28 (Pharmingen) and 3ug/ml anti- 
CD3 (OKT3, ATCC) in PBS. After 6 and 24 hours, the cells were harvested for RNA 
preparation. To prepare chronically activated CD8 lymphocytes, we activated the isolated 
CD8 lymphocytes for 4 days on anti-CD28 and anti-CD3 coated plates and then harvested 
the cells and expanded them in DMEM 5% FCS (Hyclone), lOOjuM non essential amino 

10 acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and 
lOmM Hepes (Gibco) and IL-2. The expanded CD8 cells were then activated again with 
plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was 
isolated 6 and 24 hours after the second activation and after 4 days of the second 
expansion culture. The isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 

15 IOOjiM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 

mercaptoethanol 5.5xl0~ 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2 for 4-6 days 
before RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with 
sterile dissecting scissors and then passed through a sieve. Tonsil cells were then spun 

20 down and resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), lOO^M non essential 
amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), 
and lOmM Hepes (Gibco). To activate the cells, we used PWM at 5|ig/ml or anti-CD40 
(Pharmingen) at approximately 10|ig/ml and IL-4 at 5-10ng/ml. Cells were harvested for 
RNA preparation at 24,48 and 72 hours. 

25 To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon 

plates were coated overnight with 10|ig/ml anti-CD28 (Pharmingen) and 2|ng/ml OKT3 
(ATCC), and then washed twice with PBS. Umbilical cord blood CD4 lymphocytes 
(Poietic Systems, German Town, MD) were cultured at 10 5 -10 6 cells/ml in DMEM 5% 
FCS (Hyclone), lOO^M non essential amino acids (Gibco), ImM sodium pyruvate 

30 (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). 
IL-12 (5ng/ml) and anti-IL4 (1 ug/ml) were used to direct to Thl, while IL-4 (5ng/ml) and 
anti-IFN gamma (lug/ml) were used to direct to Th2 and IL-10 at 5ng/ml was used to 
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direct to Trl. After 4-5 days, the activated Thl, Th2 and Trl lymphocytes were washed 
once in DMEM and expanded for 4-7 days in DMEM 5% FCS (Hyclone), lOOuM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (lng/ml). Following this, the activated Thl, 
5 Th2 and Trl lymphocytes were re-stimulated for 5 days with anti-CD28/OKT3 and 
cytokines as described above, but with the addition of anti-CD95L (lug/ml) to prevent 
apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then 
expanded again with IL-2 for 4-7 days. Activated Th 1 and Th2 lymphocytes were 
maintained in this way for a maximum of three cycles. RNA was prepared from primary 
10 and secondary Thl , Th2 and Trl after 6 and 24 hours following the second and third 
activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second 
and third expansion cultures in Interleukin 2. 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, 
KU-8 1 2. EOL cells were further differentiated by culture in 0. 1 mM dbcAMP at 
15 5x 1 O'cells/ml for 8 days, changing the media every 3 days and adj usting the cell 

concentration to 5xl0 5 cells/ml. For the culture of these cells, we used DMEM or RPMI (as 
recommended by the ATCC), with the addition of 5% FCS (Hyclone), lOOuM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10' 
5 M (Gibco), lOmM Hepes (Gibco). RNA was either prepared from resting cells or cells 
20 activated with PMA at lOng/ml and ionomycin at 1 ug/ml for 6 and 14 hours. Keratinocyte 
line CCD106 and an airway epithelial tumor line NCI-H292 were also obtained from the 
ATCC. Both were cultured in DMEM 5% FCS (Hyclone), 100uM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO- 5 M (Gibco), and 
lOmM Hepes (Gibco). CCD1 106 cells were activated for 6 and 14 hours with 
25 approximately 5 ng/ml TNF alpha and lng/ml IL-1 beta, while NCI-H292 cells were 
activated for 6 and 14 hours with the following cytokines: 5ng/ml IL-4, 5ng/ml IL-9, 
5ng/ml IL-1 3 and 25ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 
10 7 cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane 
30 (Molecular Research Corporation) was added to the RNA sample, vortexed and after 10 
minutes at room temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. 
The aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 
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isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down 
at 9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet 
was redissolved in 300ul of RNAse-free water and 35ul buffer (Promega) 5nl DTT, 7ul 
RNAsin and 8ul DNAse were added. The tube was incubated at 37°C for 30 minutes to 
5 remove contaminating genomic DNA, extracted Once with phenol chloroform and re- 
precipitated with 1/10 volume of 3M sodium acetate and 2 volumes of 100% ethanol. The 
RNA was spun down and placed in RNAse free water. RNA was stored at -80°C. 
Al comprehensive panel vl.O 

The plates for AI_comprehensive panel_vl .0 include two control wells and 89 test 
1 0 samples comprised of cDNA isolated from surgical and postmortem human tissues 
obtained from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was 
extracted from tissue samples from the Backus Hospital in the Facility at CuraGen. Total 
RNA from other tissues was obtained from Clinomics. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained 
from patients undergoing total knee or hip replacement surgery at the Backus Hospital. 
Tissue samples were immediately snap frozen in liquid nitrogen to ensure that isolated 
RNA was of optimal quality and not degraded. Additional samples of osteoarthritis and 
rheumatoid arthritis joint tissues were obtained from Clinomics. Normal control tissues 
were supplied by Clinomics and were obtained during autopsy of trauma victims. 

Surgical specimens of psoriatic tissues and adjacent matched tissues were provided 
as total RNA by Clinomics. Two male and two female patients were selected between the 
ages of 25 and 47. None of the patients were taking prescription drugs at the time samples 
were isolated. 

Surgical specimens of diseased colon from patients with ulcerative colitis and 
25 Crohns disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue 
from three female and three male Crohn's patients between the ages of 41-69 were used. 
Two patients were not on prescription medication while the others were taking 
dexamethasone, phenobarbital, or tylenol. Ulcerative colitis tissue was from three male 
and four female patients. Four of the patients were taking lebvid and two were on 
30 phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or 
with emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients 
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ranged in age from 40-70 and all were smokers, this age range was chosen to focus on 
patients with cigarette-linked emphysema and to avoid those patients with alpha-1 anti- 
trypsin deficiencies. Asthma patients ranged in age from 36-75, and excluded smokers to 
prevent those patients that could also have COPD. COPD patients ranged in age from 35- 
80 and included both smokers and non-smokers. Most patients were taking corticosteroids, 
and bronchodilators. 

In the labels employed to identify tissues in the AI_comprehensive paneljvl.O 
panel, the following abbreviations are used: 

AI = Autoimmunity 

Syn = Synovial 

Normal = No apparent disease 

Rep22 /Rep20 = individual patients 

RA = Rheumatoid arthritis 

Backus = From Backus Hospital 

OA = Osteoarthritis 

(SS) (BA) (MF) - Individual patients 

Adj = Adjacent tissue 

Match control = adjacent tissues 

-M = Male 

-F = Female 

COPD = Chronic obstructive pulmonary disease 
Panels 5D and 51 

The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
isolated from human tissues and cell lines with an emphasis on metabolic diseases. 
Metabolic tissues were obtained from patients enrolled in the Gestational Diabetes study. 
Cells were obtained during different stages in the differentiation of adipocytes from 
human mesenchymal stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are young (18-40 years), otherwise 
healthy women with and without gestational diabetes undergoing routine (elective) 
Caesarean section. After delivery of the infant, when the surgical incisions were being 
repaired/closed, the obstetrician removed a small sample (<1 cc) of the exposed metabolic 
tissues during the closure of each surgical level. The biopsy material was rinsed in sterile 
saline, blotted and fast frozen within 5 minutes from the time of removal. The tissue was 
then flash frozen in liquid nitrogen and stored, individually, in sterile screw-top tubes and 
kept on dry ice for shipment to or to be picked up by CuraGen. The metabolic tissues of 
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interest include uterine wall (smooth muscle), visceral adipose, skeletal muscle (rectus) 
and subcutaneous adipose. Patient descriptions are as follows: 

Patient 2: Diabetic Hispanic, overweight, not on insulin 
Patient 7-9: Nondiabetic Caucasian and obese (BMI>30) 
Patient 10: Diabetic Hispanic, overweight, on insulin 
Patient 1 1 : Nondiabetic African American and overweight 
Patient 12: Diabetic Hispanic on insulin 

Adipocyte differentiation was induced in donor progenitor cells obtained from 
Osirus (a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had 
only two replicates. Scientists at Clonetics isolated, grew and differentiated human 
mesenchymal stem cells (HuMSCs) for CuraGen based on the published protocol found in 
Mark F. Pittenger, et al., Multilineage Potential of Adult Human Mesenchymal Stem Cells 
Science Apr 2 1999: 143-147. Clonetics provided Trizol lysates or frozen pellets suitable 
for mRNA isolation and ds cDNA production. A general description of each donor is as 
follows: 

Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 
Donor 2 and 3 AM: Adipose, AdiposeMidway Differentiated 
Donor 2 and 3 AD: Adipose, Adipose Differentiated 

Human cell lines were generally obtained from ATCC (American Type Culture 
Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver 
HepG2 cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These 
cells are all cultured under standard recommended conditions and RNA extracted using 
the standard procedures. All samples were processed at CuraGen to produce single 
stranded cDNA. 

Panel 51 contains all samples previously described with the addition of pancreatic 
islets from a 58 year old female patient obtained from the Diabetes Research Institute at 
the University of Miami School of Medicine. Islet tissue was processed to total RNA at an 
outside source and delivered to CuraGen for addition to panel 51. 

In the labels employed to identify tissues in the 5D and 51 panels, the following 
abbreviations are used: 

GO Adipose = Greater Omentum Adipose 
SK= Skeletal Muscle 
UT = Uterus 
PL = Placenta 

AD = Adipose Differentiated 
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AM = Adipose Midway Differentiated 
U = Undifferentiated Stem Cells 

Panel CNSD.01 

5 The plates for Panel CNSD.01 include two control wells and 94 test samples 

comprised of cDNA isolated from postmortem human brain tissue obtained from the 
Harvard Brain Tissue Resource Center. Brains are removed from calvaria of donors 
between 4 and 24 hours after death, sectioned by neuroanatomists, and frozen at -80°C in 
liquid nitrogen vapor. All brains are sectioned and examined by neuropathologists to 

1 0 confirm diagnoses with clear associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains two brains 
from each of the following diagnoses: Alzheimer's disease, Parkinsons disease, 
Huntington's disease, Progressive Supernuclear Palsy, Depression, and "Normal controls". 
Within each of these brains, the following regions are represented: cingulate gyrus, 

1 5 temporal pole, globus palladus, substantia nigra, Brodman Area 4 (primary motor strip), 
Brodman Area 7 (parietal cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 
17 (occipital cortex). Not all brain regions are represented in all cases; e.g., Huntington's 
disease is characterized in part by neurodegeneration in the globus palladus, thus this 
region is impossible to obtain from confirmed Huntington's cases. Likewise Parkinson's 

20 disease is characterized by degeneration of the substantia nigra making this region more 
difficult to obtain. Normal control brains were examined for neuropathology and found to 
be free of any pathology consistent with neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following 
abbreviations are used: 

25 PSP = Progressive supranuclear palsy 

Sub Nigra = Substantia nigra 

Glob Palladus= Globus palladus 

Temp Pole = Temporal pole 

Cing Gyr = Cingulate gyrus 
30 BA 4 - Brodman Area 4 

Panel CNS_Neurodegeneration_V1.0 

The plates for Panel CNS_Neurodegeneration_V1.0 include two control wells and 
47 test samples comprised of cDNA isolated from postmortem human brain tissue 
35 obtained from the Harvard Brain Tissue Resource Center (McLean Hospital) and the 
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Human Brain and Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare 
System). Brains are removed from calvaria of donors between 4 and 24 hours after death, 
sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are 
sectioned and examined by neuropathologists to confirm diagnoses with clear associated 
5 neuropathology. 

Disease diagnoses are taken from patient records. The panel contains six brains 
from Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who 
showed no evidence of dementia prior to death. The eight normal control brains are 
divided into two categories: Controls with no dementia and no Alzheimer's like pathology 
1 0 (Controls) and controls with no dementia but evidence of severe Alzheimer's like 
pathology, (specifically senile plaque load rated as level 3 on a scale of 0-3; 0 - no 
evidence of plaques, 3 = severe AD senile plaque load). Within each of these brains, the 
following regions are represented: hippocampus, temporal cortex (Brodman Area 21), 
parietal cortex (Brodman area 7), and occipital cortex (Brodman area 17). These regions 
were chosen to encompass all levels of neurodegeneration in AD. The hippocampus is a 
region of early and severe neuronal loss in AD; the temporal cortex is known to show 
neurodegeneration in AD after the hippocampus; the parietal cortex shows moderate 
neuronal death in the late stages of the disease; the occipital cortex is spared in AD and 
therefore acts as a "control" region within AD patients. Not all brain regions are 
represented in all cases. 

In the labels employed to identify tissues in the CNS_Neurodegeneration_V 1 .0 
panel, the following abbreviations are used: 

AD = Alzheimer's disease brain; patient was demented and showed AD-like 
pathology upon autopsy 

Control = Control brains; patient not demented, showing no neuropathology 
Control (Path) = Control brains; pateint not demented but showing sever AD-like 
pathology 

SupTemporal Ctx = Superior Temporal Cortex 
Inf Temporal Ctx = Inferior Temporal Cortex 



15 



The quantitative expression of various clones was assessed using microliter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines 
and tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed 
Applied Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence 



on an 
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Detection System. Various collections of samples are assembled on the plates, and 
referred to as Panel 1 (containing normal tissues and cancer cell lines), Panel 2 (containing 
samples derived from tissues from normal and cancer sources), Panel 3 (containing cancer 
cell lines), Panel 4 (containing cells and cell lines from normal tissues and cells related to 
5 inflammatory conditions), Panel 5D/5I (containing human tissues and cell lines with an 
emphasis on metabolic diseases), AI_comprehensive_panel (containing normal tissue and 
samples from autoinflammatory diseases), Panel CNSD.01 (containing samples from 
normal and diseased brains) and CNS_neurodegeneration_panel (containing samples from 
normal and Alzheimer's diseased brains). 

10 RNA integrity from all samples is controlled for quality by visual assessment of 

agarose gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio 
as a guide (2: 1 to 2.5:1 28s: 1 8s) and the absence of low molecular weight RNAs that 
would be indicative of degradation products. Samples are controlled against genomic 
DNA contamination by RTQ PCR reactions run in the absence of reverse transcriptase 

1 5 using probe and primer sets designed to amplify across the span of a single exon. 

First, the RNA samples were normalized to reference nucleic acids such as 
constitutively expressed genes (for example, P-actin and GAPDH). Normalized RNA (5 
ul) was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master 
Mix Reagents (Applied Biosysteiiis; Catalog No. 4309169) and gene-specific primers 

20 according to the manufacturer's instructions. 

In other cases, non-normalized RNA samples were converted to single strand 
cDNA (sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 18064-147) 
and random hexamers according to the manufacturer's instructions. Reactions containing 
up to 10 ^g of total RNA were performed in a volume of 20 ^1 and incubated for 60 

25 minutes at 42°C. This reaction can be scaled up to 50 \ig of total RNA in a final volume of 
100 ul. sscDNA samples are then normalized to reference nucleic acids as described 
previously, using IX TaqMan® Universal Master mix (Applied Biosystems; catalog No. 
4324020), following the manufacturer's instructions. 

Probes and primers were designed for each assay according to Applied Biosystems 

30 Primer Express Software package (version I for Apple Computer's Macintosh Power PC) 
or a similar algorithm using the target sequence as input. Default settings were used for 
reaction conditions arid the following parameters were set before selecting primers: primer 
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concentration = 250 nM, primer melting temperature (Tm) range = 58°-60°C, primer 
optimal Tm = 59°C, maximum primer difference = 2°C, probe does not have 5'G, probe 
Tm must be 10°C greater than primer Tm, amplicon size 75bp to lOObp. The probes and 
primers selected (see below) were synthesized by Synthegen (Houston, TX, USA). Probes 
5 were double purified by HPLC to remove uncoupled dye and evaluated by mass 

spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' ends of the 
probe, respectively. Their final concentrations were: forward and reverse primers, 900nM 
each, and probe, 200nM. 

PCR conditions: When working with RNA samples, normalized RNA from each 
1 0 tissue and each cell line was spotted in each well of either a 96 well or a 384-well PCR 

plate (Applied Biosystems). PCR cocktails included either a single gene specific probe and 
primers set, or two multiplexed probe and primers sets (a set specific for the target clone 
and another gene-specific set multiplexed with the target probe). PCR reactions were set 
up using TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 
15 43 1 3 803) following manufacturer's instructions. Reverse transcription was performed at 
48°C for 30 minutes followed by amplification/PCR cycles as follows: 95°C 10 min, then 
40 cycles of 95°C for 15 seconds, 60°C for 1 minute. Results were recorded as CT values 
(cycle at which a given sample crosses a threshold level of fluorescence) using a log scale, 
with the difference in RNA concentration between a given sample and the sample with the 
20 lowest CT value being represented as 2 to the power of delta CT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and 
multiplying by 100. 

When working with sscDNA samples, normalized sscDNA was used as described 
previously for RNA samples. PCR reactions containing one or two sets of probe and 

25 primers were set up as described previously, using 1 X TaqMan® Universal Master mix 
(Applied Biosystems; catalog No. 4324020), following the manufacturer's instructions. 
PCR amplification was performed as follows: 95°C 10 min, then 40 cycles of 95°C for 15 
seconds, 60°C for 1 minute. Results were analyzed and processed as described previously. 
Panels 1,1.1, 1.2, and 1.3D 

30 The plates for Panels 1, 1.1, 1.2 and 1.3D include 2 control wells (genomic DNA 

control and chemistry control) and 94 wells containing cDNA from various samples. The 
samples in these panels are broken into 2 classes: samples derived from cultured cell lines 
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and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
gastric cancer and pancreatic cancer. Cell lines used in these panels are widely available 
through the American Type Culture Collection (ATCC), a repository for cultured cell 
lines, and were cultured using the conditions recommended by the ATCC. The normal 
tissues found on these panels are comprised of samples derived from all major organ 
systems from single adult individuals or fetuses. These samples are derived from the 
following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, fetal heart, adult 
kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various regions of the 
brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, pituitary gland, 
adrenal gland, spinal cord, thymus, stomach, small intestine, colon, bladder, trachea, 
breast, ovary, uterus, placenta, prostate, testis and adipose. 

In the results for Panels 1, 1.1, 1 .2 and 1 .3D, the following abbreviations are used: 

ca. = carcinoma, 

* = established from metastasis, 

met = metastasis, 

s cell var = small cell variant, 

npn-s = non-sm = non-small, 

squam = squamous, 

pi. efif = pi effusion = pleural effusion, 

glio = glioma, 

astro = astrocytoma, and 

neuro = neuroblastoma. 

General_screening_panel_vl A and General_screening^_panel_vl .5 
The plates for Panels 1.4 and 1.5 include 2 control wells (genomic DNA control 
and chemistry control) and 94 wells containing cDNA from various samples. The samples 
in Panels 1.4 and 1.5 are broken into 2 classes: samples derived from cultured cell lines 
and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
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gastric cancer and pancreatic cancer. Cell lines used in Panels 1.4 and 1.5 are widely 
available through the American Type Culture Collection (ATCC), a repository for cultured 
cell lines, and were cultured using the conditions recommended by the ATCC. The normal 
tissues found on Panels 1.4 and 1.5 are comprised of pools of samples derived from all 
5 major organ systems from 2 to 5 different adult individuals or fetuses. These samples are 
derived from the following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, 
fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various 
regions of the brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, 
pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, colon, 
1 0 bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. Abbreviations 
are as described for Panels 1 , 1 . 1 , 1 .2, and 1 .3D. 
Panels 2D, 2.2, 2.3 and 2.4 

The plates for Panels 2D, 2.2, 2.3 and 2.4 generally include 2 control wells and 94 
test samples composed of RNA or cDNA isolated from human tissue procured by 
surgeons working in close cooperation with the National Cancer Institute's Cooperative 
Human Tissue Network (CHTN) or the National Disease Research Initiative (NDRI) or 
from Ardais or Clinomics). The tissues are derived from human malignancies and in cases 
where indicated many malignant tissues have "matched margins" obtained from 
noncancerous tissue just adjacent to the tumor. These are termed normal adjacent tissues 
and are denoted "NAT" in the results below. The tumor tissue and the "matched margins" 
are evaluated by two independent pathologists (the surgical pathologists and again by a 
pathologist at NDRI/ CHTN/Ardais/Clinomics). Unmatched RNA samples from tissues 
without malignancy (normal tissues) were also obtained from Ardais or Clinomics. This 
analysis provides a gross histopathological assessment of tumor differentiation grade. 
Moreover, most samples include the original surgical pathology report that provides 
information regarding the clinical stage of the patient. These matched margins are taken 
from the tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated 
"NAP', for normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples 
were obtained from various human tissues derived from autopsies performed on elderly 
people or sudden death victims (accidents, etc.). These tissues were ascertained to be free 
of disease and were purchased from various commercial sources such as Clontech (Palo 
Alto, CA), Research Genetics, and Invitrogen. 
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HASS Panel v 1.0 

The HASS panel v 1.0 plates are comprised of 93 cDNA samples and two controls. 
Specifically, 81 of these samples are derived from cultured human cancer cell lines that 
had been subjected to serum starvation, acidosis and anoxia for different time periods as 
well as controls for these treatments, 3 samples of human primary cells, 9 samples of 
malignant brain cancer (4 medulloblastomas and 5 glioblastomas) and 2 controls. The 
human cancer cell lines are obtained from ATCC (American Type Culture Collection) and 
fall into the following tissue groups: breast cancer, prostate cancer, bladder carcinomas, 
pancreatic cancers and CNS cancer cell lines. These cancer cells are all cultured under 
standard recommended conditions. The treatments used (serum starvation, acidosis and 
anoxia) have been previously published in the scientific literature. The primary human 
cells were obtained from Clonetics (Walkersville, MD) and were grown in the media and 
conditions recommended by Clonetics. The malignant brain cancer samples are obtained 
as part of a collaboration (Henry Ford Cancer Center) and are evaluated by a pathologist 
prior to CuraGen receiving the samples . RNA was prepared from these samples using the 
standard procedures. The genomic and chemistry control wells have been described 
previously. 

Panel 3D 

The plates of Panel 3D are comprised of 94 cDNA samples and two control 
samples. Specifically, 92 of these samples are derived from cultured human cancer cell 
lines, 2 samples of human primary cerebellar tissue and 2 controls. The human cell lines 
are generally obtained from ATCC (American Type Culture Collection), NCI or the 
German tumor cell bank and fall into the following tissue groups: Squamous cell 
carcinoma of the tongue, breast cancer, prostate cancer, melanoma, epidermoid carcinoma, 
sarcomas, bladder carcinomas, pancreatic cancers, kidney cancers, leukemias/lymphomas, 
ovarian/uterine/cervical, gastric, colon, lung and CNS cancer cell lines. In addition, there 
are two independent samples of cerebellum. These cells are all cultured under standard 
recommended conditions and RNA extracted using the standard procedures. The cell lines 
in panel 3D and 1 .3D are of the most common cell lines used in the scientific literature. 

Panels4D,4R,and4.1D 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 
composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human 
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cell lines or tissues related to inflammatory conditions. Total RNA from control normal 
tissues such as colon and lung (Stratagene, La Jolla, CA) and thymus and kidney 
(Clontech) was employed. Total RNA from liver tissue from cirrhosis patients and kidney 
from lupus patients was obtained from BioChain (Biochain Institute, Inc., Hayward, CA). 
5 Intestinal tissue for RNA preparation from patients diagnosed as having Crohn f s disease 
and ulcerative colitis was obtained from the National Disease Research Interchange 
(NDRI) (Philadelphia, PA). 

Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle 
cells, small airway epithelium, bronchial epithelium, microvascular dermal endothelial 

10 cells, microvascular lung endothelial cells, human pulmonary aortic endothelial cells, 
human umbilical vein endothelial cells were all purchased from Clonetics (Walkersville, 
MD) and grown in the media supplied for these cell types by Clonetics. These primary cell 
types were activated with various cytokines or combinations of cytokines for 6 and/or 12- 
14 hours, as indicated. The following cytokines were used; IL-1 beta at approximately 1- 

15 5ng/ml, TNF alpha at approximately 5-1 Ong/ml, IFN gamma at approximately 20- 

50ng/ml, IL-4 at approximately 5-1 Ong/ml, IL-9 at approximately 5-1 Ong/ml, IL-1 3 at 
approximately 5-1 Ong/ml. Endothelial cells were sometimes starved for various times by 
culture in the basal media from Clonetics with 0.1% serum. 

Mononuclear cells were prepared from blood of employees at CuraGen 

20 Corporation, using Ficoll. LAK cells were prepared from these cells by culture in DMEM 
5% FCS (Hyclone), lOO^M non essential amino acids (Gibco/Life Technologies, 
Rockville, MD), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO* 5 M (Gibco), and 
lOmM Hepes (Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 
10-20ng/ml PMA and l-2^ig/ml ionomycin, IL-12 at 5-10ng/ml, IFN gamma at 20- 

25 50ng/ml and IL-1 8 at 5-1 Ong/ml for 6 hours. In some cases, mononuclear cells were 
cultured for 4-5 days in DMEM 5% FCS (Hyclone), IOOjiM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0~ 5 M (Gibco), and lOmM 
Hepes (Gibco) with PHA (phytohemagglutinin) or PWM (pokeweed mitogen) at 
approximately 5ng/ml. Samples were taken at 24, 48 and 72 hours for RNA preparation. 

30 MLR (mixed lymphocyte reaction) samples were obtained by taking blood from two 

donors, isolating the mononuclear cells using Ficoll and mixing the isolated mononuclear 
cells 1 :1 at a final concentration of approximately 2xl0 6 cells/ml in DMEM 5% FCS 
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(Hyclone), lOOuM non essential amino aeids (Gibco), ImM sodium pyravate (Gibco), 
mercaptoethanol (5.5xlO- 5 M) (Gibco), and lOmM Hepes (Gibco). The MLR was cultured 
and samples taken at various time points ranging from 1 - 7 days for RNA preparation. 

Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve 
VS selection columns and a Vario Magnet according to the manufacturer's instructions. 
Monocytes were differentiated into dendritic cells by culture in DMEM 5% fetal calf 
serum (FCS) (Hyclone, Logan, UT), lOOuM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol 5.5x1 (T 5 M (Gibco), and lOmM Hepes (Gibco), 
50ng/ml GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of 
monocytes for 5-7 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO' 5 M (Gibco), lOmM 
Hepes (Gibco) and 1 0% AB Human Serum or MCSF at approximately 50ng/ml. 
Monocytes, macrophages and dendritic cells were stimulated for 6 and 12-14 hours with 
lipopolysaccharide (LPS) at 1 OOng/ml. Dendritic cells were also stimulated with anti- 
CD40 monoclonal antibody (Pharmingen) at lOpg/ml for 6 and 12-14 hours. 

CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from 
mononuclear cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection 
columns and a Vario Magnet according to the manufacturer's instructions. CD45RA and 
CD45RO CD4 lymphocytes were isolated by depleting mononuclear cells of CD8, CD56, 
CD14 and CD19 cells using CD8, CD56, CD14 and CD19 Miltenyi beads and positive 
selection. CD45RO beads were then used to isolate the CD45RO CD4 lymphocytes with 
the remaining cells being CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and 
CD8 lymphocytes were placed in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl(T 5 M (Gibco), and 
lOmM Hepes (Gibco) and plated at 10 6 cells/ml onto Falcon 6 well tissue culture plates 
that had been coated overnight with 0.5ug/ml anti-CD28 (Pharmingen) and 3ug/ml anti- 
CD3 (OKT3, ATCC) in PBS. After 6 and 24 hours, the cells were harvested for RNA 
preparation. To prepare chronically activated CD8 lymphocytes, we activated the isolated 
CD8 lymphocytes for 4 days on anti-CD28 and anti-CD3 coated plates and then harvested 
the cells and expanded them in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 (T 5 M (Gibco), and 
lOmM Hepes (Gibco) and IL-2. The expanded CD8 cells were then activated again with 
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plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was 
isolated 6 and 24 hours after the second activation and after 4 days of the second 
expansion culture. The isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 
lOOuM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2 for 4-6 days 
before RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with 
sterile dissecting scissors and then passed through a sieve. Tonsil cells were then spun 
down and resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), lOOuM non essential 
amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), 
and lOmM Hepes (Gibco). To activate the cells, we used P WM at 5ug/ml or anti-CD40 
(Pharmingen) at approximately lOug/ml and IL-4 at 5-10ng/ml. Cells were harvested for 
RNA preparation at 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon 
plates were coated overnight with lOug/ml anti-CD28 (Pharmingen) and 2ug/ml OKT3 
(ATCC), and then washed twice with PBS. Umbilical cord blood CD4 lymphocytes 
(Poietic Systems, German Town, MD) were cultured at 10 5 -10 6 cells/ml in DMEM 5% 
FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5xl()- 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). 
IL-12 (5ng/ml) and anti-IL4 (lug/ml) were used to direct to Thl, while IL-4 (5ng/ml) and 
anti-IFN gamma (1 ug/ml) were used to direct to Th2 and IL-10 at 5ng/ml was used to 
direct to Trl . After 4-5 days, the activated Thl, Th2 and Trl lymphocytes were washed 
once in DMEM and expanded for 4-7 days in DMEM 5% FCS (Hyclone), lOOuM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (lng/ml). Following this, the activated Thl, 
Th2 and Trl lymphocytes were re-stimulated for 5 days with anti-CD28/OKT3 and 
cytokines as described above, but with the addition of anti-CD95L (1 ug/ml) to prevent 
apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then 
expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes were 
maintained in this way for a maximum of three cycles. RNA was prepared from primary 
and secondary Thl , Th2 and Trl after 6 and 24 hours following the second and third 
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activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second 
and third expansion cultures in Interleukin 2. 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, 
KU-812. EOL cells were further differentiated by culture in 0.1 mM dbcAMP at 
5 5x1 (fcells/ml for 8 days, changing the media every 3 days and adjusting the cell 

concentration to SxlO^cells/ml. For the culture of these cells, we used DMEM or RPMI (as 
recommended by the ATCC), with the addition of 5% FCS (Hyclone), 100|iM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco). RNA was either prepared from resting cells or cells 

10 activated with PMA at 1 Ong/ml and ionomycin at 1 ^ig/ml for 6 and 14 hours. Keratinocyte 
line CCD 106 and an airway epithelial tumor line NCI-H292 were also obtained from the 
ATCC. Both were cultured in DMEM 5% FCS (Hyclone), lOO^M non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and 
lOmM Hepes. (Gibco). CCD 1 106 cells were activated for 6 and 14 hours with 

15 approximately 5 ng/ml TNF alpha and lng/ml IL-1 beta, while NCI-H292 cells were 
activated for 6 and 14 hours with the following cytokines: 5ng/ml IL-4, 5ng/ml IL-9, 
5ng/ml IL-1 3 and 25ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 
10 7 cells/ml using TrizoL(Gibco.BRL), Briefly, 1/10 volume of bromochloropropane 

20 (Molecular Research Corporation) was added to the RNA sample, vortexed and after 10 
minutes at room temperature, the tubes were spun at 14,000 ipm in a Sorvall SS34 rotor. 
The aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 
isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down 
at 9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet 

25 was redissolved in 300^1 of RNAse-free water and 35^1 buffer (Promega) 5fil DTT, 7jnl 
RNAsin and 8jil DNAse were added. The tube was incubated at 37°C for 30 minutes to 
remove contaminating genomic DNA, extracted once with phenol chloroform and re- 
precipitated with 1/10 volume of 3M sodium acetate and 2 volumes of 100% ethanol. The 
RNA was spun down and placed in RNAse free water. RNA was stored at -80°C. 

30 Alcomprehensive panel_v 1 .0 

The plates for AI_comprehensive panel_vl.0 include two control wells and 89 test 
samples comprised of cDNA isolated from surgical and postmortem human tissues 
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obtained from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was 
extracted from tissue samples from the Backus Hospital in the Facility at CuraGen. Total 
RNA from other tissues was obtained from Clinomics. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained 
5 from patients undergoing total knee or hip replacement surgery at the Backus Hospital. 
Tissue samples were immediately snap frozen in liquid nitrogen to ensure that isolated 
RNA was of optimal quality and not degraded. Additional samples of osteoarthritis and 
rheumatoid arthritis joint tissues were obtained from Clinomics. Normal control tissues 
were supplied by Clinomics and were obtained during autopsy of trauma victims. 
10 Surgical specimens of psoriatic tissues and adjacent matched tissues were provided 

as total RNA by Clinomics. Two male and two female patients were selected between the 
ages of 25 and 47. None of the patients were taking prescription drugs at the time samples 
were isolated. 

Surgical specimens of diseased colon from patients with ulcerative colitis and 
15 Crohns disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue 
from three female and three male Crohn's patients between the ages of 41-69 were used. 
Two patients were not on prescription medication while the others were taking 
dexamethasone, phenobarbital, or tylenol. Ulcerative colitis tissue was from three male 
and four female patients. Four of the patients were taking lebvid and two were on 
20 phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or 
with emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients 
ranged in age from 40-70 and all were smokers, this age range was chosen to focus on 
patients with cigarette-linked emphysema and to avoid those patients with alpha- lanti- 
25 trypsin deficiencies. Asthma patients ranged in age from 36-75, and excluded smokers to 
prevent those patients that could also have COPD. COPD patients ranged in age from 35- 
80 and included both smokers and non-smokers. Most patients were taking corticosteroids, 
and bronchodilators. 

In the labels employed to identify tissues in the Alcomprehensive panel_vl .0 
30 panel, the following abbreviations are used: 
AI = Autoimmunity 
Syn = Synovial 
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Normal = No apparent disease 
Rep22 /Rep20 = individual patients 
RA = Rheumatoid arthritis 
Backus = From Backus Hospital 
5 OA = Osteoarthritis 

(SS) (BA) (MF) = Individual patients 
Adj = Adjacent tissue 
Match control = adjacent tissues 
-M = Male 
10 -F = Female 

COPD = Chronic obstructive pulmonary disease 
Panels 5D and 51 

The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
isolated from human tissues and cell lines with an emphasis on metabolic diseases. 
1 5 Metabolic tissues were obtained from patients enrolled in the Gestational Diabetes study. 
Cells were obtained during different stages in the differentiation of adipocytes from 
human mesenchymal stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are young (18-40 years), otherwise 
healthy women with and without gestational diabetes undergoing routine (elective) 
20 Caesarean section. After delivery of the infant, when the surgical incisions were being 

repaired/closed, the obstetrician removed a small sample (<1 cc) of the exposed metabolic 
tissues during the closure of each surgical level. The biopsy material was rinsed in sterile 
saline, blotted and fast frozen within 5 minutes from the time of removal. The tissue was 
then flash frozen in liquid nitrogen and stored, individually, in sterile screw-top tubes and 
25 kept on dry ice for shipment to or to be picked up by CuraGen. The metabolic tissues of 
interest include uterine wall (smooth muscle), visceral adipose, skeletal muscle (rectus) 
and subcutaneous adipose. Patient descriptions are as follows: 
Patient 2: Diabetic Hispanic, overweight, not on insulin 
Patient 7-9: Nondiabetic Caucasian and obese (BMI>30) 
30 Patient 1 0: Diabetic Hispanic, overweight, on insulin 

Patient 1 1 : Nondiabetic African American and overweight 
Patient 12: Diabetic Hispanic on insulin 
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Adiocyte differentiation was induced in donor progenitor cells obtained from 
Osirus (a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had 
only two replicates. Scientists at Clonetics isolated, grew and differentiated human 
5 mesenchymal stem cells (HuMSCs) for CuraGen based on the published protocol found in 
Mark F. Pittenger, et al„ Multilineage Potential of Adult Human Mesenchymal Stem Cells 
Science Apr 2 1999: 143-147. Clonetics provided Trizol lysates or frozen pellets suitable 
for mRNA isolation and ds cDNA production. A general description of each donor is as 
follows: 

10 Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 

Donor 2 and 3 AM: Adipose, AdiposeMidway Differentiated 
Donor 2 and 3 AD: Adipose, Adipose Differentiated . 

Human cell lines were generally obtained from ATCC (American Type Culture 
15 Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver 
HepG2 cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These 
cells are all cultured under standard recommended conditions and RNA extracted using 
the standard procedures. All samples were processed at CuraGen to produce single 
20 stranded cDNA. 

Panel 51 contains all samples previously described with the addition of pancreatic 
islets from a 58 year old female patient obtained from the Diabetes Research Institute at 
the University of Miami School of Medicine. Islet tissue was processed to total RNA at an 
outside source and delivered to CuraGen for addition to panel 51. 
25 In the labels employed to identify tissues in the 5D and 51 panels, the following 

abbreviations are used: 

GO Adipose = Greater Omentum Adipose 
SK = Skeletal Muscle 
UT = Uterus 
30 PL = Placenta 

AD = Adipose Differentiated 

AM = Adipose Midway Differentiated 
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U = Undifferentiated Stem Cells 
Panel CNSD.01 

The plates for Panel CNSD.01 include two control wells and 94 test samples 
comprised of cDNA isolated from postmortem human brain tissue obtained from the 
5 Harvard Brain Tissue Resource Center. Brains are removed from calvaria of donors 

between 4 and 24 hours after death, sectioned by neuroanatomists, and frozen at -80°C in 
liquid nitrogen vapor. All brains are sectioned and examined by neuropathologists to 
confirm diagnoses with clear associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains two brains 
10 from each of the following diagnoses: Alzheimer's disease, Parkinson's disease, 

Huntington's disease, Progressive Supernuclear Palsy, Depression, and "Normal controls". 
Within each of these brains, the following regions are represented: cingulate gyrus, 
temporal pole, globus palladus, substantia nigra, Brodman Area 4 (primary motor strip), 
Brodman Area 7 (parietal cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 
15 17 (occipital cortex). Not all brain regions are represented in all cases; e.g., Huntington's 
disease is characterized in part by neurodegeneration in the globus palladus, thus this 
region is impossible to obtain from confirmed Huntington's cases. Likewise Parkinson's 
disease is characterized by degeneration of the substantia nigra making this region more 
difficult to obtain. Normal control brains were examined for neuropathology and found to 
20 be free of any pathology consistent with neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following 
abbreviations are used: 

PSP = Progressive supranuclear palsy 

Sub Nigra = Substantia nigra 
25 Glob Palladus= Globus palladus 

Temp Pole = Temporal pole 

Cing Gyr = Cingulate gyrus 

BA 4 = Brodman Area 4 

Panel CNSJNeurodegenerationVl.O 
30 The plates for Panel CNS_Neurodegeneration_VLO include two control wells and 

47 test samples comprised of cDNA isolated from postmortem human brain tissue 
obtained from the Harvard Brain Tissue Resource Center (McLean Hospital) and the 



381 



WO 03/010327 



PCT/US02/14199 



Human Brain and Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare 
System). Brains are removed from calvaria of donors between 4 and 24 hours after death, 
sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are 
sectioned and examined by neuropathologists to confirm diagnoses with clear associated 
5 neuropathology. 

Disease diagnoses are taken from patient records. The panel contains six brains 
from Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who 
showed no evidence of dementia prior to death. The eight normal control brains are 
divided into two categories: Controls with no dementia and no Alzheimer's like pathology 
10 (Controls) and controls with no dementia but evidence of severe Alzheimer's like 
pathology, (specifically senile plaque load rated as level 3 on a scale of 0-3; 0 = no 
evidence of plaques, 3 = severe AD senile plaque load). Within each of these brains, the 
following regions are represented: hippocampus, temporal cortex (Brodman Area 21), 
parietal cortex (Brodman area 7), and occipital cortex (Brodman area 17). These regions 
15 were chosen to encompass all levels of neurodegeneration in AD. The hippocampus is a 
region of early and severe neuronal loss in AD; the temporal cortex is known to show 
neurodegeneration in AD after the hippocampus; the parietal cortex shows moderate 
neuronal death in the late stages of the disease; the occipital cortex is spared in AD and 
therefore acts as a "control" region within AD patients. Not all brain regions are 
20 represented in all cases. 

In the labels employed to identify tissues in the CNS_NeurodegenerationJV1.0 
panel, the following abbreviations are used: 

AD = Alzheimer's disease brain; patient was demented and showed AD-like 
pathology upon autopsy 
25 Control = Control brains; patient not demented, showing no neuropathology 

Control (Path) = Control brains; pateint not demented but showing sever AD-like 
pathology 

SupTemporal Ctx = Superior Temporal Cortex 
Inf Temporal Ctx = Inferior Temporal Cortex 

30 

A. CG100126-01: Keratin Associated Protein 
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Expression of gene CG100126-01 was assessed using the primer-probe set 
Ag41 63, described in Table AA. Results of the RTQ-PCR runs are shown in Tables AB, 
AC and AD. 

Table AA. Probe Name Ag4163 



Primers 


Sequences 


Length 


Start Position 


SEQ ID No 


Forward 


5 ' -cagtgctgccagtctgtgt-3 • 


19 


217 


203 


Probe 


TET-5 ' -agctgcagcatctccagctgctg-3 ■ -TAMRA 


23 


517 


204 


Reverse 


5 ' -agctggattcacagcaagag-3 ' 


20 


546 


205 



Table AB . General _screeningjpanel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4163, Run 
221000456 


. jRel. Exp.(%) 
Tissue Name Ag41 63, Run 

(221000456 


Adipose 


0.0 


Renal ca.TK-10 jo.O 


Melanoma* Hs688(A).T 


0.0 


Bladder 


5.3 


Melanoma* Hs688(B).T 


2.4 


Gastric ca. (liver met.) NCI- 
N87 


3.4 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


100.0 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


0.0 


Colon ca. SW480 


8.1 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) 
SW620 


0.0 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ra * (\\r\r\f* rr\(±\\ 

PC-3 


0.0 


Colon ca. HCT-116 


95.9 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


2.9 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV-3 


5.8 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool J8.1 


Ovarian ca. OVCAR-5 


26.1 


Small Intestine Pool J17.4 


Ovarian ca. IGROV-1 


31.2 


Stomach Pool ]3.5 


Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool jo.O 


Ovary 


0.0 


Fetal Heart jo.O 


Breast ca. MCF-7 


0.0 


Heart Pool j4.2 


Breast ca. MDA-MB-231 


0.0 


Lymph Node Pool J9.0 


Breast ca. BT 549 


O.O 


Fetal Skeletal Muscle 0.0 


Breast ca. T47D 


81.8 


Skeletal Muscle Pool j3. 1 
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Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


2.6 


Thymus Pool 


1.0 


Trachea 


ft ft 


CNS cancer (elio/astro^ I J87- 
MG 


0.0 


Lung 


5.7 


CNS cancer (glio/astro) U-l 18- 
MG 


10.2 


Fetal Lung 


0.0 


CNS cancer (neuro;met) SK-N- 
AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


8 * 2 jCNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI-H146 


9:°.. _ jCNS cancer (glio) SNB-1 9 


31.9 


Lung ca. SHP-77 


u ; u jCNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


u :° jfirain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


°- Q jBrain (cerebellum) 


0.0 


Lung ca. NCI-H23 


26 6 |Brain (fetal) ' 


0.0 ~~" 


Lung ca. NCI-H460 


0 0 jBrain (Hippocampus) Pool 


0.0 


Lung ca. HOP-62 j 


0.0 1 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 j 


0.0 ] 


Brain (Substantia nigra) Pool 


0.0 


Liver j 


0.0 ] 


Brain (Thalamus) Pool ~~T 


0.0 


hetal Liver jo.O ] 


3rain (whole) 


3.0 


Liver ca. HepG2 jo.O < 


ipinal Cord Pool < 


3.0 


Kidney Pool 9 .0 Adrenal Gland ] 


).0 


retal Kidney ( 


1'9- {Pituitary gland Pool j( 


).0 


Renal ca. 786-0 2 


[ -2 jSalivary Gland 0 0 


Renal ca. A498 |( 


1,0 jThyroid (female) jo 0 


Renal ca. ACHN C 


)a 9 jPancreatic ca. CAPAN2 0 


1.0 


Renal ca. UO-3 1 0 


-° jPancreas Pool I5 


.6 



Tissue Name 


Rel. Exp.(%) 
Ag4163,Run 
173333942 


Tissue Name 


Rel. Exp.(%) 
Ag4163, Run 
173333942 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.1 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVECTNF alpha + IFN 
gamma 


0.3 


Secondary Thl rest 


0.0 


HUVECTNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


0.1 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.1 


Lung Microvascular EC ] 


0.0 
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TNFalpha + IL-lbeta 


~3 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


26.4 


Primary Trl act 


0 0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0 0 


Bronchial epithelium TNFalpha 
+ ILlbeta 


'o.o I 


(Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0 0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0 0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0 0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 I 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0 3 1 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL- 
lbeta 


0.2 | 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 IvmDhocvte none 


ft n 

u.u 


KU-8 12 (Basophil) 
PMA/ionomycin 


— 

0.0 j 


2ryThimi2ATrl anti- 
CD95CH11 , 


0.0 


CCD 1 106 (Keratinocytes) none 


o.o ! 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 




LAK cells IL-2 i 


3.0 


Liver cirrhosis 


o.o. J 


LAK cells IL-2+IL- 12 < 


3.0 ] 


MCI-H292 none | 


3.1 ! 


LAK cells IL-2+IFN 
gamma * 


).0 ] 


^CI-H292 IL-4 ( 


3.0 | 


LAK cells IL-2+IL- 18 j( 


).0 1 


MCI-H292 IL-9 |( 


3.1 j 


LAK cells 

PMA/ionomycin j * 


1.0 1 


^CI-H292 IL-13 ( 


1.2 



NK Cells IL-2 rest 



Two Way MLR 3 day 



Two Way MLR 5 day 



Two Way MLR 7 day 



0.0 



0.0 



0.0 



0.0 



{NCI-H292 IFN gamma b.O 



0.0 



JHPAEC none 

jHPAEC TNF alpha + IL-1 beta jo.O 



PBMC rest 


jo.o 


jLung fibroblast TNF alpha + 
|IL-1 beta 


0.4 


PBMC PWM 


jo.o 


|Lung fibroblast IL-4 


0.2 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.1 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


1.2 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN gamma 


0.0 
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B lymphocytes PWM 


0.1 


Dermal fibroblast CCD 1070 
rest 


0.1 


B lymphocytes CD40L 
and IL-4 


0.0 


LJermai fibroblast CCD 1070 
TNF alpha 


0.0 


EOL-1 dbcAMP 


u.u 


Dermal fibroblast CCD1070 IL- 
1 beta 


0.0 


EOL-1 dbcAMP 
r ivi/v/ lonomyc i n 


0.0 


Dermal fibroblast IFN gamma 


0.0 


I lpri/Hriti/^ s*a1Io mama 

jL/ciiunuc ecus none 


A A 

0.0 


Dermal fibroblast IL-4 


0.5 


L/cnormc ceils Ldro 


A A 

0.0 


Dermal Fibroblasts rest 


0.9 


ivenanuc ceils antl-UIJ4U 


A 1 

0.1 


Neutrophils TNFa+LPS 


0.7 


Monocytes rest 


0.0 


Neutrophils rest 


0.6 


Monocytes LPS 


0.0 


Colon 


0.8 


Macrophages rest 


0.1 


Lung 


3.6 • 


Macrophages LPS 


0.0 


Thymus 


15.5 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 







Table AD . General oncology screening paneljv 2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4163, Run 
268624200 


Tissue Name 


Rel. Exp.(%) 
Ag4163, Run 
268624200 


Colon cancer 1 


0.0 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 2 


0.0 


Adenocarcinoma of the prostate 
1 


0.6 


Colon cancer 3 


0.0 


Adenocarcinoma of the prostate 
2 


0.0 


Colon cancer NAT 3 


0.0 


Adenocarcinoma of the prostate 
3 


0.2 


Colon malignant cancer 4 


0.3 


Adenocarcinoma of the prostate 
4 


0.5 


Colon normal adjacent 
tissue 4 


0.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


0.3 


Adenocarcinoma of the prostate 
6 


0.0 


Lung NAT 1 


0.1 


Adenocarcinoma of the prostate 
7 


0.0 


Lung cancer 2 


0.0 


Adenocarcinoma of the prostate 
8 


0.7 


Lung NAT 2 


0.3 


Adenocarcinoma of the prostate 
9 


0.0 
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Squamous cell carcinoma 

i 

j 


0.2 


Prostate cancer NAT 10 


0.2 


T imp NAT 3 


0 0 


ivioney cancer i 


u.u 


u ic loo 10.110 Hiwimiuiiia I 




kTi^ne»x/lVT AT 1 

)\.ianeyiN/\ 1 i 


U.U 


lViClailUula Z. 


I W.X) 


Kidney cancer 2 


1 0 


Melanoma 3 


0.3 


Kidnev NAT 2 


0 5 


metastatic melanoma 4 


0.2 


Kidney cancer 3 


0.0 


metastatic melanoma 5 


0.5 


Kidney NAT 3 


0.5 


Bladder cancer 1 


0.0 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag4163 Results from one experiment 
with the CGI 00 126-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General_screeningj>anel_vl.4 Summary: Ag4163 Highest expression of the 
5 CGI 00 126-01 gene is detected in Melanoma LOX1MVI and a colon cancer HCT-1 16 cell 
lines (CTs=33.8). In addition, significant expression of this gene is also detected in a 
breast cancer T47D cell line. Therefore, expression of this gene can be used to distinguish 
these samples from other samples used in this panel and also as diagnostic marker for 
melanoma, colon and breast cancer. Furthermore, therapeutic modulation of this gene 
1 0 product can be useful in the treatment of these cancers. 

Panel 4.1D Summary: Ag4163 Highest expression of the CGI 00126-01 gene is 
detected in kidney (CT=29). Moderate expression of this gene is also seen in lung, thymus 
and microvascular dermal endothelial cells. Therefore, expression of this gene can be used 
to distinguish these samples from other samples in this panel. In addition, therapeutic 
1 5 modulation of this gene can be useful in the treatment of autoimmune and inflammatory 
diseases that affect lung and kidney. 

General oncology screening panel_v_2.4 Summary: Ag4163 Highest 
expression of the CGI 00 126-01 gene is detected exclusively in melanoma 
sample(CT=29.8). Therefore, expression of this gene may be used as a diagnostic marker 
20 for detection of melanoma and therapeutic modulation of this gene product may be 
beneficial in the treatment of melanoma. 

B. CG100146-01: UDP-Glucuronosyl Transferase 
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Expression of gene CGI 00146-01 was assessed using the primer-probe set 
Ag4165, described in Table BA. Results of the RTQ-PCR runs are shown in Tables BB 
andBC. 

Table BA . Probe Name Ag4165 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 • -agaccaggatgtctctgaaatg-3 1 


22 


193 


206 


Probe 


TET-5 • -cgtcagtctttctgctgatacagctca-3 1 - 
TAMRA 


27 


217 


207 


Reverse 


5 1 -ccacaactcccagagctaaag-3 ■ 


21 


251 


208 



Table BB . Generaljscreening_panel_yl.4 



Tissue Name 


Rel. Exp.(%)Ag4165 5 
ixun ZZ1UUU4JO 


Tissue Name 


Kei. cxp\/o) 
Ag4165,Run 
221000458 


Adipose 


7.4 


Renal ca.TK-10 


0.0 


Melanoma* 
Hs688(A).T 


U.l/ 


bladder 


16.4 


Melanoma* 
Hs688(B).T 




uastnc ca. (jiver met.) NCI-No / 


A A 

0.0 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


2.6 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC- 
4 


2.0 


Colon ca.* (SW480 met) 
SW620 


0.0 


Testis Pool 


1.6 


Colon ca. HT29 


0.0 


Prostate ca.* 
(bone met) PC-3 


0.0 


Colon ca. HCT-116 


o.o 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


17.1 


Placenta 


0.0 


Colon cancer tissue 


4.4 


Uterus Pool 


0.0 


Colon ca. SW1 116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


o.o 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


1.9 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


100.0 
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Ovarian ca. 
IGROV-1 




Stomach Pool 


0.6 


Ovarian ca. 
OVCAR-8 


0 0 


Bone Marrow Pool 


0.0 


Ovary 


4.0 


Fetal Heart 


0.0 


Breast ca. MCF- 
7 


7 4 


Heart Pool 


0.0 


Breast ca. MDA- 
JMB-231 


0 0 

v/.v 


Lymph Node Pool 


-t i— 

L _ 
0.0 


Breast ca. BT 
549 


0.0 


Feta! Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


!0.0 


Breast ca. MDA- 
N 


0 0 

V/.v 


Spleen Pool 


1.7 


Breast Pool 


0.8 


Thymus Pool 


2.8 


Trachea 


0.9 


CNS cancer (glio/astro) U87- 
MG 


0.0 


f imo 




CNS cancer (glio/astro) U-l 1 8- 
MG 


0.0 


J-ptal T unrr 

rciai l^uug 


U.o 


CNS cancer (neuro;met) SK-N- 
AS 


0.0 


Lung ca. NCI- 

Mzll 7 

■ i 


0.0 


CNS cancer (astro) SF-539 


0 0 


Lungca. LX-1 


0.0 


CNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI- 
H146 


0.0 


CNS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI- 
H526 


o o 

VI. u 


Brain (cerebellum) 


0.0 


Lung ca. NCI- 
H23 


o o 

v.u 


brain (fetal) 


0.0 


Lung ca. NCI- 

xvtov 


0.0 


Brain (Hippocampus) Pool { 


}.0 


i^ung ca. rlUr-oz \ 


J.O < 


Cerebral Cortex Pool ( 


3.0 


i^ung ca. iNL/i- 
H522 1 


3.0 ] 


Brain (Substantia nigra) Pool ( 


).0 


Liver ; 


33.2 1 


Brain (Thalamus) Pool ( 


).0 


Fetal Liver I 


5.4 1 


Brain (whole) ] 


1.0 


Liver ca. HepG2 ( 


).o J 


Spinal Cord Pool ( 


).0 


Kidney Pool ( 


).0 t 


Vdrenal Gland ] 


.4 


Fetal Kidney ( 


).0 I 


^'tuitary gland Pool C 


(.0 
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Renal ca. 786-0 | 0.0 
Renal ca. A498 0.0 



Salivary Gland 



0.0 



Thyroid (female) 



0.0 



Renal ca. ACHN 0.0 



Renal ca. UO-3 1 0.0 



Pancreatic ca. CAPAN2 



0.0 



Pancreas Pool 



1.6 



Table BC. General oncology screening panel_v_2.4 



Tissue Name 


Rel.Exp.(%)Ag4165, 

Run *)(\St&1AO'~)Q 


Tissue Name 


Rel. Exp.(%) 

Aa41£^ Pun 

268624228 


Colon cancer 1 


84. 1 


Bladder cancer NAT 2 


0 0 


Colon NAT 1 


100.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.4 


Bladder cancer NAT 4 


0.0 


Colon cancer 
NAT 2 


49.0 , 


Adenocarcinoma of the prostate 
1 


0.0 


Colon cancer 3 


2.9 


Adenocarcinoma of the prostate 
2 


0.0 


Colon cancer 
NAT 3 


28.3 


Adenocarcinoma of the prostate 

3 


0 0 


Colon malignant 
cancer 4 


19.3 


Adenocarcinoma of the prostate 
4 


0.0 


Colon normal 
adjacent tissue 4 


59.9 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


0.0 


Adenocarcinoma of the prostate 
6 


0.0 


Lung NAT 1 


\J.\J 


Adenocarcinoma of the prostate 
7 


0.0 


Lune cancer 2 


ft 0 


Adenocarcinoma of the prostate 
8 


0.0 


Lune NAT 2 


0 0 


Adenocarcinoma of the prostate 
9 


0.0 


Squamous cell | 
carcinoma 3 j 


0.2 


Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 f 


o.o 


Kidney cancer 1 


0.0 


metastatic 
melanoma 1 


5.1 


ECidneyNAT 1 


5.0 


Melanoma 2 < 


X0 j 


Kidney cancer 2 


).0 


Melanoma 3 ( 


).0 j 


ECidneyNAT 2 


).0 


metastatic 
melanoma 4 


).0 j 


Cidney cancer 3 ( 


).l 


metastatic 
melanoma 5 


>.l } 


CidneyNAT3 ( 


>.o 


Bladder cancer 1 


L0 i 


Cidney cancer 4 jc 


1.0 


Bladder cancer 0 


.0 f 


CidneyNAT4 Jo 


.0 
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NAT 1 








Bladder cancer 2 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag4165 Expression of the 



CGI 00146-01 gene is Iow/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

General_screening_panel_vl.4 Summary: Ag4165 Highest expression of the 
5 CG100146-01 gene is detected exclusively in small intestine (Ct=32.7). In addition, low 
expression of this gene is also detected in liver. Thus, expression of this gene can be used 
to distinguish these samples from other samples used in this panel. Furthermore, 
therapeutic modulation of this gene product could be useful in the treatment of small 
intestine and liver related disorders. The CGI 00 146-01 gene codes for UDP- 

10 glucuronosyltransferase. UDP-Glucuronosyltransferases (UGTs) are glycoproteins, which 
catalyze the confiigation of a broad variety of lipophilic aglycon substrates with 
glucuronic acid using UDP-glucuronic acid (UDP-GlcUA) as the sugar donor. The major 
function of glucuronidation is to change hydrophobic compounds into hydrophilic 
derivatives, a process which facilitates their detoxification and excretion (Radominska- 

15 Pandya et al., 2001, Curr Drug Metab 2(3):283-98, PMID: 1 1513331). Mutations in the 
UGT1 Al gene, one of the gene belonging to UGT family, are implicated in type I and 
type II Crigler-Najjar syndromes and the more common mild hyperbilirubinemia known as 
Gilbert syndrome (Kadakol et al., 2000, Hum. Mutat. 16: 297-306, PubMed 
ID:1 1013440). Thus, the CG100146-01 gene may also play a role in pathogenesis of 

20 Crigler-Najjar syndromes and Gilbert syndrome and therapeutic modulation of this gene 
could be beneficial in the. treatment of these diseases. 

Panel 4.1D Summary: Ag4165 Results from one experiment with the 
CGI 00 146-01 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 

25 General oncology screening panel_v_2.4 Summary: Ag41 65 Highest 

expression of the CGI 00 146-01 gene is detected in colon sample (CT=27.8). Expression 
of this gene is exclusive to colon samples, with lower expression in cancer samples 
compared to adjacent normal tissue. Therefore, this gene may play a role as tumor 
suppressor and therapeutic modulation to increase the activity of the gene product may be 

30 beneficial in the treatment of colon cancer. 
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C. CG100179-01: Cyclophilin Like 

Expression of gene CG100179-01 was assessed using the primer-probe set 
Ag4166, described in Table CA. Results of the RTQ-PCR runs are shown in Tables CB, 
CC, CD and CE. 



Table CA . Probe Name Ag4166 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward j 


5 ' -tttggacgagtgactaaaggaa-3 ' 


22 


2122 


209 


Probe ! 


J. ET ~ 5 ' ~ ca 9 a g2ratctccaacgtcaaagtcaa-3 ' -TAMRA 


26 


2155 


210 


Reverse j 


5 ' -tcatagggcttatctgttttgg-3 1 


22 


2183 


211 


Table CB.CNS neurode^eneratinn vl 


1.0 





Tissue Name 


ReL Exp.(%) Ag4166, Run 
215538908 


Tissue Name 


ReI.Exp.(%) 
Ag4166,Run 
215538908 


AD 1 Hippo 


12.2 


Control (Path) 3 Temporal 
Ctx 


7.6 


AD 2 Hippo 


24.3 


Control (Path) 4 Temporal 
Ctx 


.52.5 


AD 3 Hippo J10.2 


AD 1 Occipital Ctx 


27.2 


AD 4 Hippo 


10.7 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


97.3 


AD 3 Occipital Ctx 


14.2 


AD 6 Hippo 


61.1 


AD 4 Occipital Ctx ;21.8 


Control 2 Hippo 


18.9 


AD 5 Occipital Ctx 


37.6 


Control 4 Hippo 


11.3 


AD 6 Occipital Ctx 


28.9 


Control (Path) 3 
Hippo 


11.3 


Control 1 Occipital Ctx 


8.2 


AD 1 Temporal 
Ctx 


23.0 


Control 2 Occipital Ctx 


41.2 


AD 2 Temporal 
Ctx 


29.3 


Control 3 Occipital Ctx 


30.4 


AD 3 Temporal 
Ctx 


11.7 


Control 4 Occipital Ctx 


9.7 


AD 4 Temporal 
Ctx 


26.8 


Control (Path) 1 Occipital 
Ctx 


76.3 


ADSInf 
Temporal Ctx 


100.0 


Control (Path) 2 Occipital 
Ctx 


18.7 


AD 5 Sup 
Temporal Ctx 


49.3 


Control (Path) 3 Occipital 
Ctx 


6.2 


AD6Inf 
Temporal Ctx 


58.8 


Control (Path) 4 Occipital 
Ctx 


25.9 
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AD 6 Sup 
Temporal Ctx 


55.9 


Control 1 Parietal Ctx 


10.6 


Control 1 
Temporal Ctx 


8.4 


Control 2 Parietal Ctx 


48.6 


Control 2 
Temporal Ctx 


27.5 


Control 3 Parietal Ctx 


19.3 


Control 3 
Temporal Ctx 


16.8 


Control (Path) 1 Parietal 
ctx 


63.7 


Control 3 
Temporal Ctx 


12.9 


Control (Path) 2 Parietal 
Ctx 


30.6 


Control (Path) 1 
Temporal Ctx 


51.8 


Control (Path) 3 Parietal 
Ctx 


8.6 


Control (Path) 2 
Temporal Ctx 


34.4 


Control (Path) 4 Parietal i 
Ctx 


48.0 



Table CC . General screening panel vl.4 



Tissue Name 


Rel. Exp.(%) Ag4166, Run 
221034286 


Tissue Name 


Rel. Exp.(%) 
Ag4166, Run 
221034286 


Adipose 


9.7 


Renal ca. TK- 10 


19.9 


Melanoma* 
Hs688(A).T 


9.3 


Bladder 


22.5 


Melanoma* 
Hs688(B).T 


8.0 


Gastric ca. (liver met.) 
NCI-N87 


44.8 


Melanoma* M14 


7.5 


Gastric ca. KATO III 


23.8 


Melanoma* 
LOXIMVI 


14.6 


Colon ca. SW-948 


5.8 


Melanoma* SK- 
MEL-5 


20.3 


Colon ca. SW480 


28.1 


Squamous cell 
carcinoma SCC-4 


7.5 


Colon ca.* (SW480 met) 
SW620 


29.7 


Testis Pool 


9.1 


Colon ca. HT29 


14.3 


Prostate ca.* 
(bone met) PC-3 


7.0 


Colon ca. HCT-116 


29.1 


Prostate Pool 


8.4 


Colon ca. CaCo-2 


30.1 


Placenta 


1.7 


Colon cancer tissue 


13.6 


Uterus Pool 


9.6 


Colon ca.SWl 116 


5.0 


Ovarian ca. , 
OVCAR-3 


16.4 


Colon ca. Colo-205 


5.3 


Ovarian ca. SK- 
OV-3 


28.9 


Colon ca. SW-48 


4.9 


Ovarian ca. 
OVCAR-4 


4.0 


Colon Pool 


20.6 
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vjvanan ca. 
OVCAR-5 


45.4 


Small Intestine Pool 


26.2 


Ovarian ca. 
IGROV-1 


11.1 


Stomach Pool 


14.8 


Ovarian ca. 
OVCAR-8 


15.2 


Bone Marrow Pool 


— — — 
9.9 


Ovary 


12.0 


Fetal Heart 


12.2 


Breast ca. MCF-7 


19.8 


Heart Pool 


8 5 


Breast ca. MDA- 
MB-231 


42.3 


Lymph Node Pool 


19.3 


Breast ca. BT 549 


40.9 


Fetal Skeletal MikHp 


O.Zi 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


9.5 


Breast ca. MDA- 
N 


14.3 


Spleen Pool 


18.4 


oreast fool 


1 © A 

18.0 


Thymus Pool 


22.4 


Trachea 


5.6 


CNS cancer (glio/astro) 
U87-MG 


36.1 


Lung 


19.5 


CNS cancer (glio/astro) U- 
118-MG 


27.9 


Fetal Lung 


26.2 


CNS cancer (neuro;met) 
SK-N-AS 


26.6 


Lung ca. NCI- 
N417 


3.9 


CNS cancer (astro) SF-539 


8.1 


Lungca. LX-1 


19.8 


CNS cancer (astro) SNB- 

75 ! 


49.0 


Lung ca. NCI- 
H146 


5.1 


CNS cancer (glio) SNB-19 


10.9 


Lungca.SHP-77 J17.3 


CNS cancer (glio) SF-295 


46.0 


Lung ca. A549 


20.0 


Brain (Amygdala) Pool 


8.6 


Lung ca. NCI- 
H526 


3.3 


Brain (cerebellum) 


9.7 


Lung ca. NCI- 
H23 


20.9 


Brain (fetal) 


12.9 


Lung ca. NCI- 
H460 


15.7 


Brain (Hippocampus) Pool 


6.8 


Lung ca. HOP-62 


5.0 


Cerebral Cortex Pool 


13.5 


Lung ca. NCI- 
H522 


26.8 


Brain (Substantia nigra) 
Pool 


7.9 


Liver 


0.7 


Brain (Thalamus) Pool 


14.0 


Fetal Liver 


14.5 


Brain (whole) 


4.7 


Liver ca. HepG2 


11.4 


Spinal Cord Pool 


10.2 


Kidney Pool 


45.1 


Adrenal Gland 


3.8 
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Fetal Kidney 


27.5 


Pituitary gland Pool 


3.6 


Renal ca. 786-0 


9.5 


Salivary Gland 


0.8 


Renal ca. A498 


6.0 


Thyroid (female) 


6.2 


Renal ca. ACHN 


7.3 


Pancreatic ca. CAPAN2 


25.2 


Renal ca. UO-31 


11.7 


Pancreas Pool 


21.0 


Table CD. Panel 4.1D 




Rel.Exp.(%) Ag4166, Run 
173334537 


Tissue Name 


Rel. Exp.(%) 
Ag4166, Run 
173334537 


ocuuiiuai y i n i act 


77 O 


HUVbC lL-lbeta 


49.0 


Secondary Th2 act 


74.2 


HUVEC I FN gamma 


45.4 


Secondary Trl act 


66.0 


HUVEC TNF alpha + 1 FN 
gamma 


29.3 


Secondary Thl 
rest 


39.8 


HUVEC TNF alpha + IL4 


33.4 


Secondary Th2 
rest 


55.5 


HUVEC 1L-1 1 


32.5 


Secondary Trl rest 


58.2 


Lung Microvascular EC 
none 


57.8 


Primary Thl act 


68.8 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


39.8 


Primary Th2 act 


94.0 


Microvascular Dermal EC 
none 


33.7 


Primary Trl act 


81.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


23.0 


Primary Thl rest 


55.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


20.9 


Primary Th2 rest 


60.3 


Small airway epithelium 
none 


7.7 


Primary Trl rest 


66.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


18.7 


CD45RA CD4 
lymphocyte act 


48.6 


Coronery artery SMC rest 


17.8 


CD45ROCD4 
lymphocyte act 


87.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


22.1 


CD8 lymphocyte 
act 


66.4 


Astrocytes rest 


14.1 


Secondary CD8 
lymphocyte rest 


62.9 


Astrocytes TNFalpha + IL- 
Ibeta 


12.5 


Secondary CD8 
lymphocyte act 


33.4 


KU-8 12 (Basophil) rest 


37.6 


CD4 lymphocyte 


63.3 KU-8 12 (Basophil) j 


45.7 
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Oil II" vL»*t V 








ivxviiiuvyiCo rcsi 


ii.l 


______________ 

Neutrophils rest 


33.7 


Monocytes LPS 


35.4 


Colon 


16.4 


Macrophages rest 


31.2 


Lung 


14.9 


Macrophages LPS 


14.2 


Thymus 


62.0 


HUVEC none 


27.7 


Kidney 


60.3 


HUVEC starved 


59.0 







Table CE . General oncology screening panel_v_2.4 



i issue {Name 


Rel. Exp.(%) Ag4166, Run 
268624305 


Tissue Name 


Rel. Exp.(%) 
Ag4166, Run 
268624305 


L^oion cancer i 


20.0 


Bladder cancer NAT 2 


3.5 


Colon cancer 

MAT 1 
lN/\ 1 I 


7.5 


Bladder cancer NAT 3 




Colon cancer 2 


12.9 


Bladder cancer NAT 4 


0.0 


Colon cancer 
NAT 2 


5.3 


Adenocarcinoma of the prostate 
1 


32.1 


Colon cancer 3 


34.4 


Adenocarcinoma of the prostate 
2 


3.5 


Colon cancer 
NAT ^ 


17.4 


Adenocarcinoma of the prostate 
3 


8.1 


Colon malignant 

l/dllvcr *+ 


39.2 


Adenocarcinoma of the prostate 
4 


19.2 


vAJHJll llUIUlal 

adjacent tissue 4 


2.7 


Prostate cancer NAT 5 


9.2 


Lung cancer 1 


15.5 


Adenocarcinoma of the prostate 
6 


2.4 


Lung NAT 1 


3.9 


Adenocarcinoma of the prostate 

7 ■ 


4.5 


Lung cancer 2 


55.5 


Adenocarcinoma of the prostate 
8 


1.3 


Lung NAT 2 


5.8 


Adenocarcinoma of the prostate! 
9 J 


16.4 


Squamous cell 
carcinoma 3 


13.7 


Prostate cancer NAT 1 0 | 


0.9 


Lung NAT 3 


0.5 


Kidney cancer 1 


33.0 


metastatic 
melanoma 1 


23.2 


KidneyNAT 1 


12.5 


Melanoma 2 ; 


2.9 


Kidney cancer 2 


100.0 


Melanoma 3 < 


8.2 


fCidneyNAT2 : 


26.4 


metastatic 
melanoma 4 


13.5 ] 


<idney cancer 3 < 


10.9 
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metastatic 
melanoma 5 


66.0 


Kidney NAT 3 


11.3 


Bladder cancer 1 


7.9 


Kidney cancer 4 


10.9 


Bladder cancer 
NAT 1 


0.0 


Kidney NAT 4 


4.0 


Bladder cancer 2 


5.0 







CNS_neurodegeneration_vl.O Summary: Ag4166 This panel confirms the 
expression of the CGI 00 179-01 gene at low levels in the brains of an independent group 
of individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General^screening^anel^vl^ Summary: Ag4166 Highest expression of the 
CGI 00 179-01 gene is detected in the breast cancer T47D cell line (CT=26). High 
expression of this gene is seen in cluster of breast, ovarian, pancreatic, CNS, colon, 
10 gastric, renal, lung cancer cell lines and melanoma cell lines. Thus, therapeutic modulation 
of this gene could be beneficial in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
15 activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
20 central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4166 Highest expression of the CGI 001 79-01 gene is 
detected in anti-CD95 CHI 1 treated secondary Thl/Th2/Trl cells (CT=28.5). This gene is 
expressed at high to moderate levels in a wide range of cell types of significance in the 
25 immune response in health and disease. These cells include members of the T-cell, B-cell, 
endothelial cell, macrophage/monocyte, and peripheral blood mononuclear cell family, as 
well as epithelial and fibroblast cell types from lung and skin, and normal tissues 
represented by colon, lung, thymus and kidney. This ubiquitous pattern of expression 
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10 



15 



suggests that this gene product may be involved in homeostatic processes for these and 
other cell types and tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl .4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4 1 66 Highest 
expression of the CGI 00 179-01 gene is detected in kidney cancer (CT=28.5), with 
significant expression also seen in melanoma, colon, lung, bladder, prostate and kidney 
cancers. In addition, expression of this gene is higher in the cancers than in the normal 
adjacent tissue. Therefore, expression of this gene could be as a marker to detect the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 



20 



D. CG100212-01 and CG100212-02: Novel Gene Belonging to Zinc-Containing 
Alcohol Dehydrogenase Superfamily 

Expression of gene CG100212-01 and full length physical clone CG100212-02 
was assessed using the primer-probe set Ag4167, described in Table DA. Results of the 
RTQ-PCR runs are shown in Tables DB, DC, DD and DE. Please note that CG100212-02 
represents a full-length physical clone of the CG100212-01 gene, validating the prediction 
of the gene sequence. 

Table DA . Probe Name Ag4167 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
ID No 


Forward 


5' -ggatccttacatgcgttgtaga-3" ~ 


22 


146 


212 


Probe 


T1T-5 • -tggcactgattatataacaccttggca-3 ' -TAMRA 


27 


182 


213 


Reverse 


5 ' -tccatcaacgacttgagatagc-3 ' 


22 


209 


214 



25 



Table DB . CNS_neurodegeneration_vl.O 



Tissue Name 


ReI.Exp.(%)Ag4167, 
Run 215538909 


Tissue Name 


Rel. Exp.(%) 
Ag4167, Run 
215538909 


AD 1 Hippo 


18.0 


Control (Path) 3 Temporal 


9.9 
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Ctx 




AD 2 Hippo 


44.4 


Control (Path) 4 Temporal 
Ctx 


47.3 


AD 3 Hippo 


10.7 


AD 1 Occipital Ctx 


21.9 


AD 4 Hippo 


12.9 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


56.3 


AD 3 Occipital Ctx 


9.3 1 


AD 6 Hippo 


67.8 


AD 4 Occipital Ctx 24.5 


Control 2 Hippo 


3, 0 AD 5 Occipital Ctx 


15.9 


Control 4 Hippo 


25.3 


AD 6 Occipital Ctx 


34.6 


Control (Path) 3 
Hippo 


13.3 


Control 1 Occipital Ctx 


9.2 


AD 1 Temporal 
Ctx 


34.2 


Control 2 Occipital Ctx 


39.8 


AD 2 Temporal 
Ctx 


100.0 


Control 3 Occipital Ctx 


22.2 


AD 3 Temporal 
Ctx 


9.4 


Control 4 Occipital Ctx 


13.8 


AD 4 Temporal 
Ctx 

u 


15.1 


Control (Path) 1 Occipital 
Ctx 


87.7 


AD 5 Inf 
Temporal Ctx 


79.6 


Control (Path) 2 Occipital 
Ctx 


17.6 


AD 5 

SupTemporal Ctx 


51.1 


Control (Path) 3 Occipital 
Ctx 


5.6 


AD 6 Inf 
Temporal Ctx 


71.7 


Control (Path) 4 Occipital 
Ctx 


1 0.6 


AD 6 Sup 
Temporal Ctx 


60.3 


Control 1 Parietal Ctx 


18.4 


Control 1 
Temporal Ctx 


10.2 


Control 2 Parietal Ctx 


48.6 


Control 2 
Temporal Ctx 


34.9 


Control 3 Parietal Ctx 


14.0 


Control 3 
Temporal Ctx 


29.5 < 


Control (Path) 1 Parietal Ctx ' 


75.3 


Control 4 
Temporal Ctx 


16.6 ( 


Control (Path) 2 Parietal Ctx : 


16.Z 


Control (Path) 1 
Temporal Ctx 


>1.6 < 


Zontrol (Path) 3 Parietal Ctx ] 


10.2 


Control (Path) 2 
Temporal Ctx 


(6.9 ( 


:ontroI (Path) 4 Parietal Ctx A 


12.0 



Table DC. Generaljscreeningjpanel_vl.4 



Tissue Name 



Re!.Exp.(%)Ag4167, |T . XT 
Run 221035838 ■ r issue Name 



jRel. Exp 
}Ag4167, 



Run 
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221035838 


Adipose 


3.6 


Renal ca.TTC-10 


22.5 


Melanoma* 

riSooo^AJ. 1 


5.3 


Bladder 




Melanoma* 

riSOOO(D). 1 


5.8 


Gastric ca. (liver met.) NCI- 
N87 


1 0,0 


Melanoma* M14 


1.6 


Gastric ca. KATO III 


31.9 


Melanoma* 

JLAJAIIVI Vi 


,5.0 


Colon ca. SW-948 


7 1 


Melanoma* SK- 

\/fCT C 

ivie,l»o 


12.5 


Colon ca. SW480 


?? 7 


Squamous cell 
carcinoma SCC-4 


17.3 


Colon ca.* (SW480 met) 
SW690 


14.0 


Testis Pool 


7.0 


Pnlnn rn HT7Q 


4.1 


Prostate ca.* (bone 
met) PC-3 


27.0 


Colon ca. HCT-116 


25.7 


Prostate Pool 


6.8 


Colon ca. CaCo-2 


24.1 


Placenta 


0.7 


Colon cancer tissue 


7.4 


Uterus Pool 


2.7 


Colon ca. SW1116 


2.8 


Ovarian ca. 
OVCAR-3 


28.3 


Colon ca. Colo-205 


3, 


Ovarian ca. SK- 
OV-3 


30.4 


Colon ca. SW-48 


1.7 


Ovarian ca. 
OVCAR-4 


9.5 


Colon Pool 


5.8 


Ovarian ca. 

UVLAK-j 


59.5 


Small Intestine Pnnl 


ft ft 

o.o 


Ovarian ca. 
IGROV-l 


8.7 


Stomach Pool 


5.3 


ovarian ca. 
OVCAR-8 


3.9 


Bone Marrow Pool 


2.8 


Ovary 


XA ] 


Fetal Heart 


13.7 


Breast ca.MCF-7 : 


27.2 ] 


tfeartPool \ 


8.5 


Breast ca. MDA- 
MB-231 


17.7 ] 


_ymph Node Pool < 


5.7 


Breast ca. BT 549 i 


>4.7 I 


? etal Skeletal Muscle js.2 


Breast ca. T47D 1 


t0 P-0 (Skeletal Muscle Pool ; 


>2.2 


Breast ca. MDA-N 2 


'•3 j Spleen Pool t 


1.2 


Breast Pool ( 


jThymus Pool A 


1.5 


Trachea 1 


rg jCNS cancer (glio/astro) 

JU87-MG 2 


.4.8 


Lung 3 


-7 Jc 


?NS cancer (glio/astro) U- 1 


7.4 
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118-MG 




Fetal Lime 


10 7 


CNS cancer (neurojmet) SK- 
N-AS 


9.2 


Lung ca. NCI- 
N417 


4.1 


CNS cancer (astro) SF-539 


5.6 


Lung ca. LX-1 


27.4 


CNS cancer (astro) SNB-75 


16.0 


Lung ca. NCI- 


1.2 


CNS cancer (glio) SNB-19 


7.9 


Lung ca. SHP-77 


19.5 


CNS cancer (glio) SF-295 


15.2 


Lung ca. A549 


20.6 


Brain (Amygdala) Pool 


6.7 


Lung ca. NCI- 
H526 


1.0 


Brain (cerebellum) 


6.3 


Lung ca. NCI-H23 


33.9 


Brain (fetal) 


7.8 


Lung ca. NCI- 


8.0 


Brain (Hippocampus) Pool 


8 3 

O.J 


lino r»o LJf~M) /CO 

jLrung ca. rnji^-oz 


o c 

o.b 


Cerebral Cortex Pool 


8.1 


11 nor />*a "M/^ 1 T 

i^ung ca. iNV^i- 
H522 


27.7 


Brain (Substantia nigra) 
Pool 


7.0 


Liver 


1.3 


Rf-flin ^Thalnmiic^ Pnnl 
J-Jicuil \ I ilcilcLHiuo) rQOI 


12.6 


Fetal Liver 


8.4 


uiclllj yWUUICJ 


o.o 


Liver ca. HepG2 


7.9 


Spinal Cord Pool 


10.6 


Kidney Pool 


10.0 


Adrenal Gland 


10.5 


Fetal Kidney 


11.8 


Pituitary gland Pool 


2.8 


Renal ca. 786-0 


9.8 


Salivary Gland 


3.0 


Renal ca. A498 


2.8 


Thyroid (female) 


5.4 


Renal ca. ACHN 


9.4 


Pancreatic ca. CAPAN2 ]24.3 


Renal ca.UO-31 


15.7 


Pancreas Pool {8.0 



Table DP. Panel 4.1D 



Tissue Name 


Rel.Exp.(%)Ag4167, 
Run 173350762 


Tissue Name 


Rel. Exp.(%) 
Ag4167, Run 
173350762 


Secondary Thl act 


8.5 


HUVECIL-lbeta 


9.4 


Secondary Th2 act 


10.7 


HUVEC IFN gamma 


14.0 


Secondary Trl act 


5.3 


HUVEC TNF alpha + IFN 
gamma 


4.3 


Secondary Thl 
rest 


6.2 


HUVEC TNF alpha + IL4 


6.1 


Secondary Th2 
rest 


11.5 


HUVEC IL-11 


6.9 


Secondary Trl rest 


7.4 


Lung Microvascular EC 
none 


15.3 
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Primary Thl act 


14.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


10.2 


Primary Th2 act 


9.3 


Microvascular Dermal EC 
none 


17.1 


Primary Trl act 


9.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


6.3 


Primary Th 1 rest 


4.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


28.1 


Primary Th2 rest 


3.0 


Small airway epithelium 
none 


9.9 


Primary Trl rest 


5.9 


Small airway epithelium 
TNFalpha + IL-lbeta 


31.4 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


20.4 


CD45RO CD4 
lymphocyte act 


22.8 


Coronery artery SMC 
TNFalpha + IL-lbeta 


14.1 


CD8 lymphocyte 
act 


16.8 


Astrocytes rest 


21.2 


Secondary CDS 
lymphocyte rest 


12.4 


Astrocytes TNFalpha + 1L- 
lbeta 


8.6 


Secondary CD8 
lymphocyte act 


12.3 


KU-812 (Basophil) rest 


22.1 


CD4 lymphocyte 
none 


4.7 


ivu-oiz ^Dasopnnj 
PMA/ionomycin 


31.4 


2ry 

Thl/Th2/Trl anti- 
CD95CH11 


3.4 


CCD1 106 (Keratinocytes) 
none 


38.4 


LAIC cells rest 


21 0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


30.6 


LAK cells IL-2 


7.0 


Liver cirrhosis 


23.7 


LAK cells IL- 
2+IL-12 


6.7 


NCI-H292 none 


66.0 


LAK cells IL- 
2+IFN gamma 


4 6 


MPT T-TOQO Tl A 


ol.o 


LAK cells IL-2+ 
IL-18 


117 


Mpi TJOQO TT O 

TNd-tizyZ lL.-y 


i nn n 


LAK cells 
PMA/ionomycin 


17 7 
I /./ 


KIi~*1 inn*) TT 1*3 


59.5 


NK Cells IL-2 rest 


9.0 


NCI-H292 IFN gamma 


68.3 


Two Way MLR 3 
day 


14.8 


HPAEC none 


12.2 


Two Way MLR 5 
day 


14.5 


HPAECTNF alpha + IL-1 
)eta 


23.0 


Two Way MLR 7 


13.6 ] 


Lung fibroblast none ! 


21.5 
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day 








PBMC rest 


4.5 


Lung fibroblast TNF alpha 
+ IL-1 beta 


29.9 


PBMC PWM 


13.2 


Lung fibroblast IL-4 


15.2 


PBMC PHA-L 


20.6 


Lung fibroblast IL-9 


19.2 


Ramos (B cell) 
none 


0.7 


Lung fibroblast IL-1 3 


21.9 


Ramos (B cell) 
ionomycin 


1.1 


Lung fibroblast 1FN 
gamma 


13.1 


B lymphocytes 
PWM 


22.5 


Dermal fibroblast 
CCD 1070 rest 


9.8 


B lymphocytes 
CD40L and IL-4 


6.7 


Dermal fibroblast 
CCD 1070 TNF alpha 


12.8 


EOL-1 dbcAMP 


5.5 


Dermal fibroblast 
CCD 1070 IL-1 beta 


7.0 


EOL-1 dbcAMP 
PMA/ionomycin 


1.4 


Dermal fibroblast I FN 
gamma 


8.4 


Dendritic cells 
none 


Zo.o 


Dermal fibroblast IL-4 


28.1 


Dendritic cells 

T DC 


27.2 


Dermal Fibroblasts rest 


1 V.J 


Dendritic cells 
anti-CD40 


25.7 


Neutrophils TNFa+LPS 


1.8 


MAnrtPvfpe root 

iviunucyiCd rest 


7 O 


Neutrophils rest 


1.0 


Monocytes LPS 


8.4 


Colon 


12.0 


Macrophages rest 


52.5 


Lung 


10.4 


Macrophages LPS 


14.6 


Thymus 


22.2 


HUVEC none 


6.5 


Kidney 


77.9 


HUVEC starved 


11.0 







Table DE . General oncology screening panel_y_2.4 



Tissue Name 


Rel.Exp.(%)Ag4167, 
Run 268624306 


Tissue Name 


Rel. Exp.(%) 
Ag4167, Run 
268624306 


Colon cancer 1 


10.4 


Bladder cancer NAT 2 


0.7 


Colon NAT 1 


4.7 


Bladder cancer NAT 3 


0.2 


Colon cancer 2 


18.3 


Bladder cancer NAT 4 


2.7 


Colon cancer 
NAT 2 


10.5 


Adenocarcinoma of the 
prostate 1 


21.3 


Colon cancer 3 


22.8 


Adenocarcinoma of the 
prostate 2 


2.8 


Colon cancer 
NAT 3 


15.3 


Adenocarcinoma of the 
prostate 3 


19.2 
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Colon malignant 
cancer 4 


28.1 


Adenocarcinoma of the 
prostate 4 


19.6 


Colon normal 
adjacent tissue 4 


5.6 


Prostate cancer NAT 5 


1.5 


Lung cancer 1 


13.0 


Adenocarcinoma of the 
prostate 6 


7.5 


Lung NAT 1 


1.0 


Adenocarcinoma of the 
prostate 7 


7.0 


Lung cancer 2 


41.2 


Adenocarcinoma of the 
prostate 8 


J.J 


Lung NAT 2 

■■ ■ 


2.9 


Adenocarcinoma of the 
prostate 9 


J 1 .0 


Squamous cell 
jcarcinoma 3 


20.0 


Prostate cancer NAT 1 0 


3.1 


Lung NAT 3 


1.3 


Kidney cancer 1 


21.8 


metastatic 
(melanoma 1 


16.8 


KidneyNAT 1 


11.4 


(Melanoma 2 


1.3 


Kidney cancer 2 


100.0 


(Melanoma 3 


4.8 


Kidney NAT 2 


43.5 


metastatic 
melanoma 4 


24.7 


Jviuuc_y cdiicci O 


AA A 

44.4 


metastatic 
melanoma 5 


35.8 


Kidney NAT 3 


14.3 


Bladder cancer 1 ! 


2-9 ] 


Kidney cancer 4 


17.6 


Bladder cancer 
NAT 1 ' 


3.0 ] 


Kidney NAT 4 ! 


9.9 1 


Bladder cancer 2 I 


1.7 
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expression of the CG100212-01 gene at low levels in the brains of an independent group 
of individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag4167 Highest expression of the 
CG100212-01 gene is detected in a breast cancer T47D cell line (CT=26). High expression 
of this gene is seen in cluster of breast, ovarian, colon, gastric, renal, lung, pancreatic, 
CNS, hepatic, prostate cancer cell lines and melanoma cell lines. Thus, therapeutic 
modulation of this gene product could be beneficial in the treatment of these cancers. 
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Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
5 diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 

1 0 epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4167 Highest expression of the CG100212-01 gene is 
detected in IL-9 treated NCI-H292 cell line (CT=31). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

15 macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 

20 General_screeningjpanel_vl A and also suggests a role for the gene product in cell 

survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 

25 psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag41 67 Highest 
expression of the CG100212-01 gene is detected in kidney cancer (CT=28.9), with 
significant expression also seen in metastatic melanoma, colon, lung, bladder, prostate and 
kidney cancers. In addition, expression of this gene is higher in the cancers than in the 

30 normal adjacent tissue. Therefore, expression of this gene could be as a marker to detect 
the presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 
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E. CG100222-01: NADP-Dependent Leukotriene B4 

Expression of gene CGI 00222-01 was assessed using the primer-probe set 
Ag4169, described in Table EA. Results of the RTQ-PCR runs are shown in Tables EB 
and EC. 

5 Table EA . Probe Name Ag4169 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -gagcttcacatggaagggtt-3 ' 


20 : 


783 


215 


Probe 


TET-5 ' -attgtgacctgctggccaggagat-3 ' -TAMRA 


24 


804 


216 


Reverse 


5 1 -ctctgagacccatttcagca-3 ' 


20 


853 


217 



Table EB, General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4169, Run 
221035855 


Tissue Name 


Rel. Exp.(%) 
Ag4169, Run 
221035855 


Adipose 


0.1 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.7 


Melanoma* 
Hs688(B).T 


0.4 


Gastric ca. (liver met.) NCI-N87 


2.7 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.2 


ividaJiuiiia 

LOXIMVI 


0.2 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


1.4 


Colon ca. SW480 


1.1 


Squamous cell 
carcinoma SCC-4 


0.3 


Colon ca.* (SW480 met) SW620 


0.6 


Testis Pool 


2.4 


Colon ca. HT29 


0.6 


Prostate ca * (bone 
met) PC-3 


0.0 


Colon ca.HCT-1 16 


0.2 | 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


4.6 


Placenta 


0.1 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.2 


Ovarian ca. 
OVCAR-3 


1.6 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


100.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.8 


Small Intestine Pool 


0.2 
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Lung 



Fetal Lung 



0.1 



Lung ca. NCI-N417 



Lungca. LX-1 



0.2 



0.5 



CNS cancer (glio/astro) U87-MG 



CNS cancer (glio/astro) U-l 1 8-MG 



CNS cancer (neuro;met) SK-N-AS 



CNS cancer (astro) SF-539 



0.0 



0.3 



2.3 



CNS cancer (astro) SNB-75 



0.0 



0.4 



Lung ca. NCI-H146 



0.2 



Lung ca. SHP-77 



1.6 



Lungca. NCI-H522 



Liver 



0.0 



CNS cancer (glio) SNB-19 



CNS cancer (glio) SF-295 




0.0 



0.0 



Brain (Substa ntia nigra) Pool 
jBrain (Thalamus) Poof 



0.0 




JPancreas Pool 
Table EC . Panel 4 Lir7 



Tissue Name 



Rel. 
Exp.i 
Ag4169, 
Run 



Tissue Name 



Rel. Exp.(%) 
Ag4169,Run 
173333241 
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]l73333241 | 1 


Secondary Thl act 


|0.0 


jHUVEC IL-1 beta 


|0.4 


Secondary Th2 act 


jo.o 


jHUVEC IFN gamma 


jo.o 


Secondary Trl act 


jo.o 


jHUVEC TNF alpha + IFN gamma 


Jo.o 


Secondary Thl rest 


|o.o 


jHUVEC TNF alpha + IL4 


jo.o 


Secondary Th2 rest 


jo.o 


jHUVEC 1L- 11 


jo.o 


Secondary Trl rest 


jo.o 


iLune Microvascular EC none 


!o 4 


Primary inl act 


I ' 
0.0 


ILune Microvascular EC TNFalnha 

1 <2 v **ww uiui u\> 1 111 CX I LJ J 1 CI 

j+IL-lbeta 


0.0 


Primary Th2 act 


0. 0 

1 _ 


iMicrovascular Dermal FC none 


v.yj 


Primary Trl act 


jo.o 


jMicrosvasular Dermal EC 
TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


0.4 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest 


0.5 


Small airwav enithelinm nnnp 


fl 0 


Primary Trl rest 


0.3 


Small airway epithelium TNFalpha 
+ IL-1 beta 


0.0 


CD45RA CD4 
lymphocyte act 


0.5 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.7 


iCoronerv arterv SMC TNFalnha + 
jIL-lbeta 


0.0 


CDS lymphocyte act 


0.4 


lAstrocvtes rest 




Secondary CD8 
lymphocyte rest 


0.8 


Astrocytes TNFalpha + IL- 1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


— . 

0.3 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) PMA/ionomycin 


0.4 


2iyThl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD 1 1 06 (Keratinocytes) none 


0.0 


i^AK. cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL- 1 beta 


0.0 


LAX cells IL-2 


0.0 


Liver cirrhosis 


0 0 


LAK cells IL-2+IL- 12 


0.0 


NCI-H292 none 




LAK cells IL-2+IFN 
gamma 


0 0 


vrn hoot n a 


1.3 


LAK cells IL-2+ IL- 
18 


0.7 


NCI-H292 IL-9 


4.1 


LAK cells 
PMA/ionomycin 


D.O 


NCI-H292 IL-13 I 


3.8 


NK Cells IL-2 rest 


3.0 


NCI-H292 IFN gamma < 


3.8 


Two Way MLR 3 day i 


3.0 ] 


HPAEC none ( 


3.3 
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i wo way mjlk z> day 


0.0 


HPAEC TNF alpha + IL- 1 beta 0.0 


i wo way mlk / day 


0.0 


Lung fibroblast none Jo.O 


PBMC rest 


0.4 


Lung fibroblast TNF alpha + IL-1 1 
beta 00 

J 


IPRMr* PW1V/T 
jrDiviv^ r WlVl 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 Jo.O 


Ramos (B cell) none 


fo.o 


Lung fibroblast IL- 1 3 j0.4 


Ramos (B cell) 
ionomycin 


ft ft 


Lung fibroblast I FN gamma !o.4 j 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD1070 rest 


0.8 


B lymphocytes 
CD40L and IL-4 


ft ft 


Dermal fibroblast CCD 1070 TNF 
alpha 


0.0 


EOL-1 dbcAMP 


0.4 


Dermal fibroblast CCD1070 IL-1 
beta 


i 

0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast I FN gamma 


1.5 


Dendritic cells none 


0.4 


Dermal fibroblast IL-4 


0.9 I 


{Dendritic cells LPS 

(Dendritic cells anti- 
;CD40 j 


0 0 jDemial Fibroblasts rest 1.5 ~ 
0.0 JNeutrophils TNFa+LPS jl.3 


{Monocytes rest jfj.O (Neutrophils rest IQ4 " ! 


Monocles LPS (0.0 


Colon si 7 j 


Macrophages rest 1 


D.O ' 


Lung 


3.0 


Macrophages LPS { 


).0 


rhymus 


7.6 


HUVECnone ( 


).o {] 


<idney 


100.0 


HUVEC starved ( 


).0 
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CG100222-01 gene is low/undetectable (CTs > 34.5) across all of the samples on this 
panel (data not shown). 

General_screening_panel_vl.4 Summary: Ag4169 Highest expression of the 
CGI 00222-01 gene is detected in ovarian cancer SK-OV-3 cell line (CT=28). In addition, 
low but significant expression of this gene is seen in CNS cancer, colon cancer, lung 
cancer, ovarian cancer and melanoma cell lines. Thus, expression of this gene could be 
used as diagnostic marker in detection of these cancers. Therapeutic modulation of this 
gene product through the use of antibodies or small molecule drugs, may be beneficial in 
the treatment of these cancers, [mpattu, 25-Mar-02] 

Panel 4.1D Summary: Ag4169 Highest expression of the CG100222-01 gene is 
detected in Kidney (CT=29). Thus, expression of this gene can be used to distinguish the 
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kidney sample from other samples used in this panel. In addition, low but significant 
expression of this gene is also seen in lung and thymus. Therefore, therapeutic modulation 
of this gene can be beneficial in the autoimmune and inflammatory diseases that affect 
lung and kidney. 

Panel 5 Islet Summary: Ag4169 Expression of the CGI 00222-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4169 Results from one 
experiment with the CGI 00222-01 gene are not included. The amp plot indicates that 
there were experimental difficulties with this run. 

F. CG100266-01 and CG100266-02: Cyclophilin A Like Gene 

Expression of gene CGI 00266-01 and full length clone CGI 00266-02 was 
assessed using the primer-probe set Ag4 1 74 ? described in Table FA. Results of the RTQ- 
PCR runs are shown in Tables FB. FC and FD. 



Table FA . Probe Name Ag4174 



i 

.Primers 

! 


Sequences 


Length 


Start 
Position 


SEQ ID ! 

No j 


j Forward 


5 ' -gggagaaatttgatgacaagaa-3 • 


22 


371 


218 { 


Probe 


TET-5 1 -cttcatcctaaagcacgcaggtcctg-3 1 -TAMRA 


26 


393 


219 


Reverse 


5 ' -actgtttgtgttgggtccagta-3 ■ 


22 


438 


220 



Table FB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4174, Run 
221034287 


Tissue Name 


Rel. Exp.(%)Ag4174, 
Run 221034287 


Adipose 


1.6 


Renal ca. TK- 10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) NC1-N87 


3.2 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


2.4 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


0.0 


Testis Pool 


14.0 


Colon ca. HT29 


0.0 
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Prostate ca.* (bone 
met) PC-3 


2.9 


Colon ca. HCT-116 


5.3 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Poo! 


0.0 


Colon ca. SW1116 


0.0 


jOvariaii ca. 
jOVCAR-3 


7.1 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


! 

|0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 




III. . * «-x i 

Small Intestine Pool 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


2.3 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool • 


1.6 


Ovary 


In n 


Fetal Heart 


lo.o 


! Breast ca MTF-7 


r - - * - ■ - ■ 


Heart Pool JOjD j 


iBreast ca MDA. 
:MB-23 1 


r — - 

'o.o 


Lymph Node Pool 


j - 1 

jo.o | 


jBreast ca. BT 549 

r" " 


0.0 


Fetal Skeletal Muscle 


3.7 


i Breast ca T47D 


J.J 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


3.5 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


28.3 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18- 
MG 


5.5 


Fetal Lung 


62 o l CNS cancer (neuro;met) SK-N- 
JAS 


0.0 


Lung ca. NCI- 
N417 


0 0 jCNS cancer (astro) SF-53 9 


3.3 


Lungca. LX-1 < 


3 -0 |CNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI- 
H146 


100.0 |( 


:NS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 ( 


).o ~p 


INS cancer (glio)SF-295 ( 


[).0 


T .lino ca A^AO i 


)° jBrain (Amygdala) Pool : 


5.2 


Lung ca. NCI- 
H526 ( 


).o Jf 


irain (cerebellum) ( 


).0 


Lung ca. NCI-H23 ( 


).0 j £ 


irain (fetal) t 


i.O 


Lung ca. NCI- 
H460 C 


>-0 |E 


irain (Hippocampus) Pool 1 


4.8 
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Lung ca. HOP-62 


2.5 


Cerebral Cortex Pool 


1.8 


Lung ca. NCI- 
H522 


0.0 


Brain (Substantia nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


1.2 


Fetal Liver 


0.0 


Brain (whole) 


4.4 


Liver ca. HepG2 


2.2 


Spinal Cord Pool j3.6 


Kidney Pool |0.9 


Adrenal Gland o.O 


Fetal Kidney 


1 8.6 [Pituitary gland Pool 


2.3 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.0 


Renal ca. UO-3 1 


0.0 


Pancreas Pool 


2.7 




Table FC Panel 4.1 D 




Tissue Name 

j 


Rel. Exp.(%) 
Ag4174, Run 
173507624 


Tissue Name 


(Rel. Exp.(%) 
Ag4174, Run 
173507624 


{Secondary Th 1 act jO.O 


HUVEC IL-lbeta 


0.0 


jSecondary Th2 act jO.O 


HUVEC 1FN gamma jo.O 


{Secondary Trl act 
jSecondary Thl rest 


10.0 

b~o" 


HUVEC TNF alpha + IFN gamma 
HUVEC TNF alpha + 1L4 


jo.o ~~ ' 

|„_ 


JSecondary Th2 rest 


0 0 jHUVEC IL- 1 1 " 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.5 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest I 


3.0 


Small airway epithelium none 


0.0 


Primary Trl rest ( 


).0 ! 


Small airway epithelium TNFalpha 
*- IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act * 


).0 < 


Doronery artery SMC rest 


0.0 


CD45RO CD4 1 
lymphocyte act 


1.0 < 

I 


:oronery artery SMC TNFalpha + 
L-lbeta 


0.0 


CD8 lymphocyte 
act C 


1.0 / 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


.0 / 


Astrocytes TNFalpha + IL- 1 beta 1 


3.0 


Secondary CD8 0 


0 k 


:U-812 (Basophil) rest 


3.0 
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lymphocyte act 








\*uh lympnocyie 
none 


0.6 


KU-812 (Basophil) 
PMA/ionomvrM n 


0.0 


2ry 

ThlHWTrl anti- 
CD95 CH11 


0.0 


,vyv^i_/i i\J\J ^rvci allilUCyTCS ) IlOnC 


A A 
0-0 

t 


lai\. cells rest 


0.0 


CCD1 106 fKerafinnrvt^cA 

TNFalpha + IL-lbeta 


1 

jo.o 


LAK cells IL-2 


0.0 


Liver cirrhotic 


A A 


LAK cells IL- 
2+IL-12 


0.0 


NCI-H292 none 


0.0 


LAK cells 1L- 
2+IFN gamma 


0.0 


NCI-H292 IL-4 


1.0 


LAK cells IL-2+ 
IL-18 


0.0 


! " 1 ■ 

NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-13 


jo.o 


NK Cells IL-2 rest jO.O 


jN^i-nzvz lrfN gamma 


!n r 

0.6 


Two Way MLR 3 L „ 
day j 00 


HPAEC none 


'•0.0 


Two Way MLR 5 ] 
day j 00 


HPAEC TNF alpha + IL-i beta 


- — i 

o.o ! 

j 


Two Way MLR 7 
day 


0.6 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL- 1 
beta 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.5 


[ line* fihmhlaet II O 


0.0 


Ramos (B cell) 
none 


0.0 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) 
ionomycin 


5.0 


Lung fibroblast IFN gamma I 


3.0 


B lymphocytes 
PWM ' ' 


).0 ] 


Dermal fibroblast CCD 1 070 rest ( 


).0 


B lymphocytes 
CD40LandIL-4 ( 


).0 ] 


Dermal fibroblast CCD 1070 TNF 


).2 


EOL-1 dbcAMP 


j 


Dermal fibroblast CCD 1070 IL-1 
>eta ( 


).0 


EOL-1 dbcAMP 
PMA/ionomycin 


1.4 [ 


)ermal fibroblast IFN gamma C 


1.0 


Dendritic cells 
none 


1.0 • I 


)ermal fibroblast IL-4 1 


.4 


Dendritic cells LPS 0 


•3 L 


)ermal Fibroblasts rest 2 


.7 
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Dendritic cells 


0.0 


Neutrophils TNFa+LPS 


0.0 


lviuiiocyics rcsi 


n n 
u.u 


Neutrophils rest 


1.0 


Monocytes LPS 


iO.O 


Colon 


1.8 


Macrophages rest 


0.5 


Lung 


4.0 


Macrophages LPS 


0.0 


Thymus 


21.9 


HUVEC none 


0.0 


Kidney 


100.0 


jHUVEC starved 


0.0 




Table FD. General oncologv screening panel v 7A 


Tissue Name 


Rel. Exp.(%) 
As4174 Run 
268624376 


Tiqciip Nqidp 

1 IMUC lidlllt 


Rel. Exp.(%) 

A CI A 1 1A Dim 

268624376 


Colon cancer 1 


0.0 


Bladder cancer NAT 2 


4.3 


Colon cancer NAT 
1 


0.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 


0.0 


Adenocarcinoma of the prostate 1 


I 

'4.2 ; 


jColon cancer 3 


0.0 


Adenocarcinoma of the prostate 2 


0.0 j 


Colon cancer NAT 


0.0 


auci lucdi n iiui 1 id ui uic prostate J 


J.O j 


Colon malignant 
cancer 4 


0.0 


/AUCUU^dl LlllUlild UI IIIC Dj OS Lute i 


fi n 
u.u 


Colon normal 
adjacent tissue 4 


0.0 


nuiidic cancer i j 


ft n 
u.u 


Lung cancer ] 


4.9 


Adenocarcinoma of the prostate 6 


0.0 


Lung NAT 1 


14.2 


Adenocarcinoma of the prostate 7 


4.0 


Lung cancer 2 


5.2 


Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 


8.8 


Adenocarcinoma of the prostate 9 


0.0 j 


Squamous cell 
carcinoma 3 


5.2 


Prostate cancer NAT 10 


0.0 


Lung NAT 3 


9.7 


Kidney cancer 1 


0.0 


metastatic 
melanoma 1 


100.0 


KidneyNAT 1 


0.0 


Melanoma 2 


4.2 


Kidney cancer 2 


0.0 


Melanoma 3 


0.0 


Kidney NAT 2 


0.0 


metastatic 
melanoma 4 


0.0 


Kidney cancer 3 


0.0 


metastatic 
melanoma 5 


18.6 


Kidney NAT 3 


0.0 


Bladder cancer 1 \ 


a.o 


Kidney cancer 4 


9.0 
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Bladder cancer 
NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


0.0 







with the CGI 00266-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

Gencrai_screcning_panei_vl.4 Summary: Ag4174 Highest expression of the 
CGI 00266-01 gene is seen in a lung cancer cell line (CT=30.2). Significant levels of 
expression are also seen in fetal lung tissue, which is more proliferative than the adult 
counterpart. Thus, expression of this gene, a cyclophilin A homog. could be used to 
differentiate between fetal lung and adult lung and as a marker of lung cancer. This is in 
agreement with published reports that a novel cyclophilin like molecule is co-expressed in 
small cell lung cancer cell lines (Kim JO. Oncogene 1998 Aug 27; 1 7(8): 101 9-26). 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
useful in the treatment of lung cancer. 

Low but significant levels of expression are also seen in the amygdala, 
hippocampus and whole and fetal brain. Cyclophilin A has been implicated in neuronal 
differentiation and human embryonic brain cells (Nahreini P, Cell Mol Neurobiol 2001 
Feb;21(l):65-79). Based on the expression of this cyclophilin homolog in the CNS, 
modualation of the expression of this gene may be useful in the treatment of 
neurodegenerative disorders, including Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 4.1D Summary: Ag41 74 Expression of the CG100266-01 gene is restricted 
to normal kidney, lung and thymus (CTs=30-34). This pattern of expression suggests that 
this gene product may be involved in the normal homeostasis of these tissues. Therapeutic 
modulation of the expression or function of this gene may be useful in maintaining or 
restoring function to these organs during inflammation. 

General oncology screening panel_v_2.4 Summary: Ag4174 Expression of the 
CGI 00266-01 gene is restricted to a melanoma (CT=33.8). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of melanoma. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of melanoma. 
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G. CG100456-01: CoA Transferase-Like 

Expression of gene CG I 00456-01 was assessed using the primer-probe set 
Ag4179, described in Table GA. Results of the RTQ-PCR runs are shown in Tables GB, 
GC and GD. 

Tabic GA . Probe Name Ag4179 



Primers 


Sequences 


Length 


p St * rt iSEQ ID No 
Position | ^ 


Forward 


5 ' -ccaggtgtggaaatgaagaag-3 ■ 


21 


1030 | 221 


Probe 


TET-5 • -atcactgacacccacacgccgtct-3 1 -TAMRA 


24 


1051 | 222 


Reverse 


5 ' - gca tgtaatagagcgtcaggt t - 3 ' 


22 


1087 223 



Table GB , General screening panel_vl.4 



Tissue l^Iame 


Kel. bxp.(%) 
Ag4179, Run 
221117873 


Tissue Name 


Rel. Exp.(%) 
Ag4179, Run 
221117873 

s 


Adipose 


2.4 


Renal ca. TK.-10 


j43.2 


Melanoma* 
Hs688(A).T 


jlS.8 


Bladder 


■r" — — ■* 

8.4 


Melanoma* 
|Hs688(B)T 


20.2 


Gastric ca. (liver met.) NCI-N87 '23.5 j 


jMelanoma* Ml 4 


{23.0 


Gastric ca. KATOllI 


:55.5 j 


Melanoma* 
LOXIMVJ 


8.9 


Colon ca. SW-948 


jl9.8 


Melanoma* SK- 
MEL-5 


22.7 


Colon ca. SW480 


28.5 


Squamous cell 
carcinoma SCC-4 


22.1 


Colon ca* (SW480 met) 
SW620 


15.3 


Testis Pool 


9.2 


Colon ca. HT29 


18.0 


Prostate ca.* (bone 
met) PC-3 


24.0 


Colon ca. HCT-116 


38.2 


Prostate Pool 


3.3 


Colon ca. CaCo-2 


24.1 


Placenta 


15.6 


Colon cancer tissue 


13.1 


Uterus Pool 


1.6 


Colon ca. SW1 116 


12.0 


Ovarian ca. 
OVCAR-3 


28.7 


Colon ca. Colo-205 


16.4 


Ovarian ca. SK- 
OV-3 


36.3 


Colon ca. SW-48 


12.0 


Ovarian ca. 
OVCAR-4 


20.6 


Colon Pool 


8.0 


Ovarian ca. 
OVCAR-5 


20.3 


Small Intestine Pool 


6.0 
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Ovarian ca. 
IGROV-1 


17.3 


Stomach Pool 


4.1 


Ovarian ca. 
OVCAR-8 


12.7 


Bone Marrow Pool 


2.6 


Ovary 

r — — 


6.5 


Fetal Heart 


5.6 


.Breast ca. MCF-7 


27.7 


Heart Pool 


4.1 


; Breast ca. MDA- 
lMB-231 


39.5 


Lymph Node Pool Jq.2 

i 


Breast ca. BT 549 


100.0 


Fetal Skeletal Muscle |4.8 


Breast ca. T47D 


36.6 


Skeletal Muscle Pool ;9.5 


Breast ca. MDA-N 


16.6 


Spleen Pool 


5.1 


Breast Pool 


7.5 


Thymus Pool 


6.6 


Trachea 

j- 


10.6 


CNS cancer (glio/astro) U87- 
MG 


16.0 


jLung 


0.8 


CNS cancer (glio/astro) U- 1 18- 
MG 


34.6 


'Fetal Lung 


9.6 


CNS cancer (neuro;met) SK-N- 
AS 


124.7 


jLung ca. NCI- 
!N417 


15.2 


CNS cancer (astro) SF-539 ;28.3 ! 


jLung ca. LX-] 


21.0 


CNS cancer (astro) SNB-75 153.6 


|Lung ca. NC1- 
|H 146 


7.5 


CNS cancer (glio) SNB- 19 jl3.6 


jLung ca. SHP-77 


1 7 - 8 jCNS cancer (glio) SF-295 |40 6 


Lung ca. A549 


14.6 


Brain (Amygdala) Pool 


5.1 


Lungca.NCI- 
H526 


4.8 


Brain (cerebellum) 


14.2 


Lung ca. NCI-H23 


18.0 


Brain (fetal) 


6.7 


Lung ca. NCI- 
H460 


8.4 |] 


Brain (Hippocampus) Pool 


4.9 


Lung ca. HOP-62 


14.3 < 


"erebral Cortex Pool 


5.1 


Lung ca. NCI- 
H522 


12.1 1 


3rain (Substantia nigra) Pool < 


5.0 


Liver t 


>.0 I 


3rain (Thalamus) Pool 


7.0 


Fetal Liver ] 


0.2 I 


Jrain (whole) 


7.5 


Liver ca. HepG2 1 


'0.7 j 


>pinal Cord Pool 


7.6 


Kidney Pool 1 


1.5 / 


Vdrenal Gland c 


>.3 


Fetal Kidney 4 


■9 f 


•ituitary gland Pool 3 


..3 


Renal ca. 786-0 2 


0.3 S 


•alivary Gland ] 


0.1 


Renal ca.A498 6 


•1 1 


"hyroid (female) 6 


.2 


Renal ca. ACHN |6.7 |Pancreat«c ca. CAPAN2 ~ 2 


1.9 
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Renal ca.UO-31 13.9 Pancreas.Pool 10.7 


Table GC. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4179, Run 
173507602 


Tissue Name 


Rel. Exp.(%) 
Ag4179, Run 
1 73507602 


Secondary Th 1 act 


56.3 


HUVEC IL-1 beta J57.8 


Secondary Th2 act 


50.0 


HUVEC IFN gamma |61.6 


Secondary Trl act 


3.8 


HUVEC TNF alpha + IFN j 

gamma 1 J 1 


Secondary Thl 
rest 




— i 

HUVEC TNF alpha + IL4 jsO.O 


Secondary Th2 
rest 




, . — , — ^_ 

HUVECIL-I1 |41.5 


Secondary Trl rest 


25.5 


Lung Microvascular EC none 76.8 


Primarv Th 1 opt 




Lung Microvascular EC 1 
TNFalpha + 1L-] beta f*'" 


Primary Th2 act 

i 

jPrimary Trl act 

i 


39.8 
33.7 


[Microvascular Dermal EC none ;54.3 

Microsvasular Dermal EC I i 
TNFalpha* IL-1 beta f 9 '* \ 


!PrimaryThl rest 


19.5 


Bronchial epithelium TNFalpha ! t/x , i 
+ ILlbeta ; 40 " 6 


1 

jPrimary Th2 rest 


18.8 


Small airway epithelium none 24. 1 


Primary Trl rest 


24.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


43.5 


CD45RA CD4 
lymphocyte act 


55.5 


Coronery artery SMC rest 


49.7 


CD45ROCD4 
lymphocyte act 


35.4 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


51.1 


CD8 lymphocyte 
act 


40.9 


Astrocytes rest 


46.7 


Secondary CDS 
lymphocyte rest 


35.8 


Astrocytes TNFalpha + IL- 1 beta 


24.0 


Seenndarv PHR 

lymphocyte act 


29.1 


KU-812 (Basophil) rest 


39.2 


CD4 lymphocyte 
none 


12.9 


KU-812 (Basophil) 
PMA/ionomycin 


54.7 


2ry 

Thl/Th2/Trl anti- : 
CD95CH11 


34.9 


CCD1 1 06 (Keratinocytes) none 


50.0 


LAK cells rest < 


*6.3 : 


CCD1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


41,5 


LAK cells IL-2 ■* 


tt>.9 ] 


Liver cirrhosis 


16.8 
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LAK cells IL- 
2+IL-12 


43.5 


NCI-H292none 


17.4 


LAK cells IL- 
[2+IFN gamma 




NCI-H292 IL-4 


23.2 


|LAK cells IL-2+ 
JlL-18 


40 T 


NCI-H292 IL-9 


28.1 


|LAK cells 
jPMA/ionomycin 


10 £ 
i \j.o 


NCI-H292 IL-I3 


130.6 


NK Cells IL-2 resl 


t45.4 


NCI-H29'' IFN oamma 




25.0 


Two Way MLR 3 
day 


44.1 


HPAEC none 


i 

,44.1 

1 


Two Way MLR 5 
day 


JO.Z 


HPAEC TNF alpha -ML- 1 beta 

l 


j49.7 


Two Way MLR 7 
day 

1 — ' — ' — 


40.6 


Lung fibroblast none 


|39.8 

i 


PBMC rest 


22.2 


Lung fibroblast TNF alpha + IL 

\ UCld 


. i 

51.1 


PBMC PWM 


34.6 


Lung fibroblast IL-4 


147.6 j 


PBMC PHA-L 


42.0 


jLung fibroblast IL-9 


'50.7 j 


Ramos (B cell) 
'none 


63.7 


Luns fibroblast \\ -1 ^ 


140.6 ! 


Ramos (B cell) 
ionomycin 


100.0 


Lun«i fibroblast IFN «amma 


k i 1 
p.1 | 


B lymphocytes 
PWM 


27.4 


Dermal fibroblast CCD] Old r»ct 


i ! 56.3 


B lymphocytes 
CD40L and IL-4 


48.0 


Dermal fibroblast CCD 1 070 

TNF alnha 
* * ^ i at Lji i<x 


! 

59.5 


EOL-1 dbcAMP - 


42.0 ' 


Dermal fiKroKlact r'pni ata tt 

1 beta 


45.1 


EOL-1 dbcAMP , 
PMA/ionomycin * 


31.0 ] 


Dermal fibroblast IFN gamma 


31.6 


Dendritic cells \ t 
none ' 




Dermal fibroblast IL-4 


40.1 


Dendritic cells L 
LPS r 


19.2 i 


>ermal Fibroblasts rest 


30.6 


Dendritir r*»llc 

anti-CD40 6 


•1.1 > 


Neutrophils TNFa+LPS 


12.2 


Monocytes rest 8 


8.9 > 


Neutrophils rest j 


36.1 


Monocytes LPS j4 


0.3 c 


'olon 


42.6 


Macrophages rest 5 


0.3 L 


,ung !; 


32.8 


Macrophages LPS 3 


7.1 T 


hymus ; 


>7.4 


HUVEC none 4 


1.5 K 


Jdney < 


58.2 
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HUVEC starved 53.2 



Table GD . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4179, Run 
268695203 


Tissue Name 


Rel. Exp.(%) 
Ag4179, Run 
268695203 


Colon cancer 1 


34.4 


Bladder cancer NAT 2 


0.0 


(Colon NAT 1 


18.9 


Bladder cancer NAT 3 


0.2 

I 


Colon cancer 2 


36.1 


Bladder cancer NAT 4 


7.9" " " ~ 


Colon cancer NAT 
2 




Adenocarcinoma of the prostate 
1 


16.4 


Colon cancer 3 


76.3 


Adenocarcinoma of the prostate j 

2 \ 3 


Colon cancer NAT 
3 


j>4.2 


Adenocarcinoma of the prostate ^ 

3 i 


Colon malignant 
cancer 4 


t AA A 

100.0 


Adenocarcinoma of the prostate j _ Q ! 

4 i 1 ' - 9 ! 


Colon normal 
adjacent tissue 4 


1 "» A 

12.9 


Prostate cancer NAT 5 b.O 


Lung cancer 1 
Lung NAT 1 


1 C A 

15.4 



i.i 


Adenocarcinoma of the prostate u 7 i 

6 ? J \ 


Adenocarcinoma of the prostate U \ 
7 ' 1^-2 i 


Lung cancer 2 


25.0 


Adenocarcinoma of the prostate 
8 


i 

1.2 


Lung NAT 2 


1.5 


Adenocarcinoma of the prostate 
9 


21.8 


Squamous cell 
carcinoma 3 


24.1 


Prostate cancer NAT 10 


0.7 


Lung NAT 3 


0.4 


Kidney cancer 1 


18.9 


metastatic 
melanoma 1 


15.6 


KidneyNAT 1 


6.8 


Melanoma 2 


3.8 


Kidney cancer 2 


51.1 


Melanoma 3 


1 c 
1 .6 


Kidney NAT 2 


19.1 


metastatic 
melanoma 4 


39.2 


Kidney cancer 3 


16.6 


metastatic 
melanoma 5 


36.6 


Kidney NAT 3 


3.8 


Bladder cancer 1 


0.5 


Kidney cancer 4 


21.9 


Bladder cancer 
NAT 1 


0.0 


Kidney NAT 4 


21.5 


Bladder cancer 2 


2.5 
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General_screening_panel_vl.4 Summary: Ag4179 Highest expression of the 
CGI 00456-01 gene is detected in a breast cancer BT 549 cell line (CT=24). High 
expression of this gene is seen in cluster of breast, ovarian, colon, gastric, renal, lung, 
pancreatic, CNS, hepatic, prostate cancer cell lines and melanoma cell lines. Thus, 
therapeutic modulation of this gene product could be beneficial in the treatment of these 
cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4179 Highest expression of the CG100456-01 gene is 
detected in ionomycin treated Ramos B cells (CT=28). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 
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Genera! oncology screening panel_v_2.4 Summary: Ag4179 Highest 
expression of the CGI 00456-01 gene is detected in malignant colon cancer (CT=26), with 
significant expression also seen in metastatic melanoma, colon, lung, bladder, prostate and 
kidney cancers. In addition, expression of this gene is higher in the cancers than in the 
normal adjacent tissue. Therefore, expression of this gene could be as a marker to detect 
the presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 



10 



H. CG100466-01: Adenine Nucleotide Translocator 2 (ANT 2) (ADP/ATP 
Translocase 2) 

Expression of gene CG100466-01 was assessed using the primer-probe set 
Ag4177. described in Table HA. Results of the RTQ-PCR runs are shown in Tables HB, 
HC and HD. 

Table HA . Probe Name Ag4177 



(Primers Sequences 

I 

{Forward 5 * -tgtgggtaaagctgaagctg-3 • 


~ r 

Lengthj 

i. 

20 j 


Start 
Position 

452™" 


SEQID 
No 

224 


jProbe TET-5 1 -aggcctctgtgactgcctggttaaga-3 » -TAMRA 


26 j 


485. 


225 


[Reverse |5 1 -ttggtacaggcccttaatcc-3 • 


20 | 


526 


226 



15 



Table HB . CNS_neurodegeneration_vl.0 



Tissue Name 


Rel.Exp.(%)Ag4177, 
Run 215539616 


Tissue Name 


Rel. Exp.(%) 
Ag4177, Run 
215539616 


AD 1 Hippo 


11.4 


Control (Path) 3 Temporal Ctx 


9.3 


AD 2 Hippo 


31.2 


Control (Path) 4 Temporal Ctx 


30.1 


AD 3 Hippo 


4.1 


AD 1 Occipital Ctx 


18.2 


AD 4 Hippo 


5.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


96.6 


AD 3 Occipital Ctx 


4.6 


AD 6 Hippo 


60.3 


AD 4 Occipital Ctx 


20.2 1 


Control 2 Hippo 


42.6 


AD 5 Occipital Ctx 


59.0 


Control 4 Hippo 


11.0 


AD 6 Occipital Ctx 


47.6 ; 


Control (Path) 3 
Hippo 


5.4 


Control 1 Occipital Ctx 


3.7 


AD 1 Temporal 
Ctx 


11.2 


Control 2 Occipital Ctx 


100.0 


AD 2 Temporal 
Ctx 


28.1 


Control 3 Occipital Ctx 


16.7 
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AD 3 Temporal 
Ctx 


3.2 


Control 4 Occipital Ctx 


7.2 


AD 4 Temporal 
Ctx 


27.5 


Control (Path) 1 Occipital Ctx 


95.3 


AD 5 Inf 
Temporal Ctx 


93.3 


Control (Path) 2 Occipital Ctx 


8.6 


AD 5 Sup 
Temporal Ctx 


34.4 


Control (Path) 3 Occipital Ctx 


6.5 


AD 6 Inf 
Temporal Ctx 


42.9 


Control (Path) 4 Occipital Ctx 


15.3 


AD 6 Sup 
Temporal Ctx 


49.0 


Control 1 Parietal Ctx 


5.8 


Control 1 
Temporal Ctx 


5.5 


Control 2 Parietal Ctx 


31.6 


Control 2 
Temporal Ctx 


47.6 


Control 3 Parietal Ctx 


18.9 


Control 3 

Temporal Ctx j 


16.6 


Control (Path) 1 Parietal Ctx j8 1 .2 


Control 3 
Temporal Ctx 


15.1 j 


Control (Path) 2 Parietal Ctx |32.8 1 

— J ... i 



jControl (Path) 
jTemporal Ctx 



i44.8 



Control (Path) 2 
jTemporal Ctx 



jContro! (Path) 3 Parietal Ctx ^5.0 



42.9 



-Control (Path) 4 Parietal Ctx (48.3 



Table HC General_screening_panel_vl.4 



Tissue Name 


Rel.Exp.(%) Ag4177, 
Run 221036658 


Tissue Name 


Rel. Exp.(%) 
Ag4177, Run 
221036658 


Adipose 


2.4 


Renal ca.TK-10 


18.7 


Melanoma* 
Hs688(A).T 


5.2 


Bladder 


12.9 


Melanoma* 
Hs688(B).T 


6.5 


Gastric ca. (liver met.) NCI- 
N87 


17.8 


Melanoma* M14 


53.2 


Gastric ca. KATO III 


100.0 


Melanoma* 
LOXIMVI 


33.7 


Colon ca. SW-948 


19.5 


Melanoma* SK- 
MEL-5 


58.6 


Colon ca. SW480 


81.2 


Squamous cell 
carcinoma SCC-4 


21.5 


Colon ca.* (SW480 met) 
SW620 


57.0 


Testis Pool 


2.3 


Colon ca. HT29 


35.4 


Prostate ca.* 
(bone met) PC-3 


37.1 


Colon ca. HCT-116 


52.9 
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Prostate Pool 


1.5 


Colon ca. CaCo-2 


23.7 


Placenta 


3.4 


Colon cancer tissue 


27.0 


Uterus Pool 


0.6 


Colon ca. SWM16 


8 5 


Ovarian ca. 
OVCAR-3 


34.2 


Colon ca. Colo-205 


17.7 


Ovarian ca. SK- 
IOV-3 


27.0 


Colon ca. SW-48 


19.8 


jOvarian ca. 
OVCAR-4 


32.1 




Colon Pool 

I 


3.3 


Ovarian ca. 
OVCAR-5 


19.3 


Small Intestine Pool 




1.9 


Ovarian ca. 
IGROV-1 


19.2 


Stomach Pool 


2.8 


Ovarian ca. 
OVCAR-8 


13.7 


Dune ividrrow rooi 


i n 
1 .U 


Ovary 


3.4 


Fetal Heart 


5.2 


Breast ca. MCF-7 


39.5 


Heart Pool 


1.7 


jBreast ca. MDA- 
|MB-231 


37.6 

• 


— — — _ _ — 
Lymph Node Pool 


j.O , 

f 

1 


IBreast ca. BT 549 


60.7 


Fetal Skeletal Muscle 


1.3 ! 


jBreast ca. T47D 


34.4 ISkeletal Muscle Pool 


!2- — — i 


; Breast ca. MDA- 
N 


65.1 


Spleen Pool 


2.5 


Breast Pool 


3.2 


Thymus Pool 


J.J 


Trachea 


4.7 


CNS cancer (glio/astro) U87- 
MG 


25.2 


Lung 


0.9 


CNS cancer (glio/astro) U-l 18- 
MG 


26.2 


Fetal Lung 


4.8 


CNS cancer (neuro;met) SK- 
N-AS 


29.1 


Lung ca. NCI- 
N417 


13.8 


CNS cancer (astro) SF-539 


ZO.J 


Lungca. LX-1 


46.3 


CNS cancer (astro) SNB-75 


43.8 


Lung ca. NCI- 
H146 


13.1 


CNS cancer (glio) SNB-19 


15.7 


Lung ca. SHP-77 


28.5 


CNS cancer (glio) SF-295 


18.8 


Lungca. A549 


48.3 1 


Brain (Amygdala) Pool 


3.7 


Lung ca. NCI- 
H526 


10.2 


Brain (cerebellum) 


7.3 


Lung ca. NCI- 
H23 


24.7 I 


Brain (fetal) < 


3.2 


Lung ca. NCI- 


13.7 1 


Brain (Hippocampus) Pool 


3.5 
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H460 








Lung ca. HOP-62 


8.9 


Cerebral Cortex Pool 


5.1 


Lung ca. NCI- 
H322 


25.7 


Brain (Substantia nigra) Pool 


4.9 


jLiver 


2.0 


Brain (Thalamus) Pool 


6.4 


Fetal Liver 


10.2 


Brain (whole) 


4.1 


Liver ca. HepG2 


23.0 


Spinal Cord Pool 


2.9 1 


Kidnev Pool 


j .j- 


Adrenal Gland 


3.9 "j 


Fetal Kidney 


4.5 


Pituitary gland Pool Jo.8 


iRenal ca. 786-0 


17.8 


Salivary Gland J4.O 


jRenal ca. A498 


6.5 


Thyroid (female) 


2.1 


! Renal ca. ACHN 


14.5 


Pancreatic ca. CAPAN2 


29.3 


iRenal ca. UO-3 1 


11.9 


Pancreas Pool j4.8 j 



Table HP. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) Ag4177, 
Run ! 73333376" 


Tissue Name 


Rel. E\p.(%) 
Ag4177, Run 
U73333376 


Secondary Th 1 act 


72.2 


HUVECIL-lbeta 


M i 


Secondary Th2 act 


51.8 


HUVEC IFN oamma 


: JH,V 


Secondary Trl act 


47.6 


I^vicl^?'alp17a~ IFN 
gamma 


1 * ! 

123.5 

i 


Secondary Th 1 
jrest 


17.2 


HUVEC TNF alpha + IL4 


r~-- 1 

30.6 


Secondary Th2 
rest 


17.4 


HUVEC IL-11 


22.4 


Secondary Trl rest 


17.8 


Lung Microvascular EC none 


39.0 


Primary Th] act 


52.1 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


25.0 


Primary Th2 act 


60.3 


Microvascular Dermal EC 
none 


25.2 


Primary Trl act 


80.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


24.0 


Primary Thl rest 


25.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


40.6 


Primary Th2 rest 


19.6 


Small airway epithelium none 


16.3 


Primary Trl rest 


37.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


44.8 


CD45RA CD4 
lymphocyte act 


61.1 


Coronery artery SMC rest 


16.5 


CD45RO CD4 
lymphocyte act 


74.7 


Coronery artery SMC 
TNFalpha + IL-lbeta 


16.6 
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CD8 lymphocyte 
act 


96.6 


Astrocytes rest 


11.2 


Secondary CD8 
lymphocyte rest 


57.0 


Astrocytes TNFalpha + IL- 
lbeta 


11.6 


Secondary CD8 
lymphocyte act 


38.4 


KU-8 12 (Basophil) rest 


27.2 


CD4 lymphocyte 
none 


6.2 


KU-8 12 (Basophil) 
PMA/ionomycin 


41.2 


2ry 

lni/inz/iri anti- 
CD95 CH11 




CCD1 106 (Keratinocytes) 
none 




LAK cells rest 


31.6 


CLDl luo (tveratinocytes) 
TNFalpha + IL-1 beta 


57.0 


laiv cens XL,- A 




Liver cirrhosis 


7.0 


LAK cells IL- 


25.9 


NCI-H292 none 




56.6 


LAK Cells IL- 

2+1 FN gamma 




32.8 


NC1-H292 IL-4 


61.1 


LArv Cells lL-Z-r 

IL-18 


36.3 


NCI-H292 IL-9 


100.0 


LAK. cells 
PMA/ionomycin 


56.3 


NCI-H292IL-13 


51.1 


ink. Cells 1L-2 rest 


43.4 


x r s~* i n^m ir\i _ 

NL1-H292 IrN gamma 


4.7 


Two Way MLR 3 
day 


25.0 


HPAEC none 


24.8 


i wo way mlk j 
day 


49.3 


HPAEC TNF alpha + IL-1 beta 


29.7 


Two Way MLR 7 
day 


42.6 


Lung fibroblast none 


14.8 


PBMC rest 


11.0 


Lung fibroblast TNF alpha + 
IL-1 beta 


11.4 


PBMC PWM 


50.0 


Lung fibroblast IL-4 


17.2 


DD\/fp PLI A 1 




Lung iibroblast IL-v 


lV.5 


Ramos (B cell) 
none 


67.4 


Lung fibroblast IL-1 3 


15.3 


Ramos (B cell) 

HJiiuuiydii 


82.4 


Lung fibroblast IFN gamma 


21.2 


B lymphocytes 
PWM 


48.6 


Dermal fibroblast CCD 1070 
rest 


33.7 


B lymphocytes 
CD40LandIL-4 . 


54.0 


Dermal fibroblast CCD 1070 
TNF alpha 


57.4 


EOL-1 dbcAMP 


44.1 


Dermal fibroblast CCD 1070 
IL-1 beta 


25.2 
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EOL-] dbcAMP 
PMA/ionomvcin 

* »* « Vllwlij V Will 


31.9 


Dermal fibroblast IFN eamma 


20 3 


Dendritic cell*; 
none 


45.1 


Dermal fibroblast 1L-4 


28.3 


Dendritic cells 
LPS 


J 1 .2 


Dermal Fibroblasts rest 


15.1 


Dendritic cells 
out! r^r\A/\ 


36.1 


i 


j.O 


iviuinjcyies rest 


24.1 


Neutrophils rest 


4.3 


Monocytes LPS 


13.4 


Colon 


29.9 


Macrophages rest 


50.3 


Lung 


12.9 


Macrophages LPS 


15.4 


Thymus 


29.1 


HUVEC none j 


32.8 


Kidney 


51.4 


HUVEC starved 


37.9 







— *f O .... . vvuiiuiio Lilt 

expression of the CG100466-01 gene at low levels in the brains of an independent group 
of individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

GeneraI_ S creening_paneI_vl.4 Summary: Ag41 77 Highest expression of the 
CGI 00466-01 gene is detected in a gastric cancer KATO III cell line (CT=22). High 
expression of this gene is seen in cluster of breast, ovarian, colon, gastric, renal, lung, 
pancreatic, CNS, hepatic, prostate cancer cell lines and melanoma cell lines. Thus, 
therapeutic modulation of this gene product could be beneficial in the treatment of these 



cancers. 



Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

The CGI 00466-01 gene codes for adenine nucleotide translocator (ANT 2) 
homologue. Dysfunctioning of the ANT2 have been shown to to induce myopathies in 
mouse and in humans (Fiore et al., 2001, Clin Chim Acta 3 1 1(2): 125-35, PMID: 
1 1566172). ANT has a role in mtDNA maintenance and mutation in ANT has been 
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implicated in autosomal dominant progressive external ophthalmoplegia and other 
mitochondrial diseases (Kaukonen et aL, 2000, Science 289(5480):782-5, PMID: 
10926541). Mice deficient in the heart/muscle specific isoform of the ANT1 exhibit many 
of the hallmarks of human oxidative phosphorylation (OXPHOS) disease, including a 
5 dramatic proliferation of skeletal muscle mitochondria (Murdoch et aL, 1999, J Biol Chem 
274(20): 14429-33, PMID: 10318868). Therefore, therapeutic modulation of the ANT 
protein encoded by the CGI 00466-01 gene through the use of small molecule drug could 
be useful in the treatment mitochondria related diseases including autosomal dominant 
progressive external ophthalmoplegia and cancers. 
10 In addition, this gene is expressed at high levels in all regions of the central 

nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 
15 Panel 4.1D Summary: Ag4167 Highest expression of the CG100466-01 gene is 

detected in IL-9 treated NCI-H292 cell line (CT-25). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
20 and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening _panel_vl .4 and also suggests a role for the gene product in cell 
25 survival and proliferation. Therefore, modulation of the gene product with a functional 

therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 
30 General oncology screening panel_v_2.4 Summary: Ag4177 Results from one 

experiment with the CG100466-01 gene are not included. The amp plot indicates that 
there were experimental difficulties with this run. 
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I. CG100609-01: Glutathione S-Transferase 

Expression of gene CG100609-01 was assessed using the primer-probe set 
Ag4182, described in Table IA. Results of the RTQ-PCR runs are shown in Tables IB IC 
and ID. 

Table IA . Probe Name Ag4182 



Primers 



Forward 
Probe 



Reverse 



Sequences 



5 ' -ctctacatggacctgctgtca-3 1 



TET- 5 1 -ccgtgccgtctacatcttctcgaag-3 » -TAMRA 



-agttgaactggatgtcatgctt-3 1 



Length 



Start 
Position 



SEQ ID 
No 



21 



73 



227 



25 



102 



228 



9? 



Table IB. CNS_neurodegenerationvl.O 



127 



229 



Tissue Name 


Rel. Exp.(%)Ag4I82, 
Rim 215539692 


Tissue Name 


Rel. Exp.(%) 
Ag4182.Run 
215539692 


AD 1 Hippo 
1 


0.0 


Control (Path) 3 Temporal 
(Ctx 


So.o 

i 


jAD 2 Hippo 


0.0 


Control (Path) 4 Temporal 
Ctx 


1 

! 
I 


23.8 


AD 3 Hippo 


0.0 


AD 1 Occipital Ctx 


jo.o ! 


AD 4 Hippo 


r~ 

0.0 


AD 2 Occipital Ctx 
(Missing) 


|2.0 


AD 5 hippo 


0.0 


AD 3 Occipital Ctx 


Jo.o 


AD 6 Hippo 


323 


AD 4 Occipital Ctx 


"loo 


Control 2 Hippo 


0.0 


AD 5 Occipital Ctx 


jo.o 


Control 4 Hippo 


22.7 


AD 6 Occipital Ctx 


jo.o 


Control (Path) 3 
Hippo 


0.0 


Control 1 Occipital Ctx 


0.0 


AD 1 Temporal 
Ctx 


78.5 


Control 2 Occipital Ctx 


0.0 


AD 2 Temporal 
Ctx 


0.0 


Control 3 Occipital Ctx 


0.0 


AD 3 Temporal 
Ctx 1 


o.o 


Control 4 Occipital Ctx 


22.2 


AD 4 Temporal 
Ctx < 


53.3 ( 
< 


Control (Path) 1 Occipital 
Ctx 


25.0 


AD 5 Inf " 
Temporal Ctx 1 


00.0 ( 

( 


Control (Path) 2 Occipital 

:tx 


0.0 


AD 5 

SupTemporal Ctx C 


).o < 

( 


:ontrol (Path) 3 Occipital 

:tx 


0.0 
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AD6Inf 
Temporal Ctx 


23.7 


Control (Path) 4 Occipital 
Ctx 


0.0 


AD 6 Sup 
Temporal Ctx 


27.4 


Control 1 Parietal Ctx 


21.2 


Control 1 
Temporal Ctx 


0.0 


Control 2 Parietal Ctx 


0.0 


Control 2 
Temporal Ctx 


0.0 


Control 3 Parietal Ctx 


0.0 


Control 3 
Temporal Ctx 


0.0 


Control (Path) 1 Parietal 
Ctx 


o.o j 


Control 4 
Temporal Ctx 


0.0 


Control (Path) 2 Parietal 
Ctx 


0.0 


Control (Path) 1 
Temporal Ctx 


0.0 


Control (Path) 3 Parietal 
Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


0.0 


Control (Path) 4 Parietal 
Ctx 


0.0 



Table IC . General_screening_panel_vl.4 



Adipose 



Tissue Name 



Melanoma* 



■Rel. 

;Run 

j 

0.0 

i 

•0.0 



Exp.(%) A»4IS2. 
2211 18504" 



[Tissue Name 

J. _.; 

(Renal ca. TK-10 

Bladder 



!Rel. Exp.(%) 
,Ag4182, Run 
1221118504 



jO.O 

.. 

|0.6 



Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI- 
N87 


0.8 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


1.5 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca* (SW480 met) 
SW620 


0.0 


Testis Pool 


100.0 


Colon ca. HT29 


0.0 


Prostate ca.* 
(bone met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


2.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca.SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


9.1 


Colon ca. CoIo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.6 


Colon ca. SW-48 


0.0 
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Ovarian ca. 
OVCAR-4 


0.6 


Colon Pool 


0.4 


Ovarian ca. 
OVCAR-5 


0.0 


bmall Intestine Pool 


0.7 


Ovarianxa. 
IGROV-l 


1.0 


Stomach Pool 


0.5 


Ovarian ca. 
OVCAR-R 


0.0 


Bone Marrow Pool 


0.4 


Ovary 




Fetal Heart 


0.0 


Breast ca. MCF-7 


0.6 


Heart Pool 


0.0 


Breast ca. MDA- 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


1.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA- 
N 


o n 

t 


bpleen Pool 


0.4 


Breast Pool 


0.5 


Thymus Pool 


0.0 


Trachea 


1.6 


CNS cancer (glio/astro) 
U87-MG 


1 

0.0 ! 


Luns 


0 0 


CNS cancer (glio/astro) U- 
118-MG 


0.0 


Fetal Liint* 


0 ft 
xj.vj 


CNS cancer (neuro;met) 
SK-N-AS 


i 

0.0 j 


Lung ca. NCI- 
N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


5.4 


CNS cancer (astro) SNB-75 


0.8 


Lung ca. NCI- 
H146 


0.0 


CNS cancer (glio) SNB : 1 9 


0.6 


Lung ca. SHP-77 |0.0 


CNS cancer (glio) SF-295 


2.3 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI- 
H526 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. NCI- 
H23 


O.O 


Brain (tetal) 


0.3 


Lungca.NCI- 
H460 1 


[).0 


Brain (Hippocampus) Pool 


0.0 


Lung ca. HOP-62 I 


3.0 < 




J.I 


Lung ca. NCI- 
H522 ; 


3.0 1 

] 


Brain (Substantia nigra) 
Pool ~ ( 


3.8 


Liver ( 


).0 ] 


Srain (Thalamus) Pool ( 


).0 


Fetal Liver ( 


).0 | 


3rain (whole) ( 


).0 


Liver ca. HepG2 0.7 < 


spinal Cord Pool ( 


).0 
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Kidney Pool 


0.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.7 


Salivary Gland 


0.8 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.6 


Renal ca. UO-3 1 


0.0 


Pancreas Pool j0.5 



Table ID . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) Ag4l82, 
Run 173607859 


Tissue Name 


Rel. Exp.(%) 
Ag4182, Run 
173607859 


Secondary Th 1 act 


0.0 


HUVEClL-lbeta 


0.0 


Secondary Tli2 act 


o;o 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + I FN 
gamma 


0.0 


Secondary Th 1 
rest 


0.0 


HUVEC TNF alpha + IL4 ;0.0 

i 


Secondary Th2 
rest 


,0.0 


jHUVEClL-ll 0.0 


Secondary Trl rest;0.9 


Lung Microvascular EC none 0.0 


Primary Thl act io.9 jLung Microvascular EC 

jTNFalpha + IL-lbeta. ! 


Primary Th2 act jO.O 


Microvascular Dermal EC none jO.O 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


7 ** 

0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha 
+ ILlbeta 


0.6 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 i 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.0 


CD8 lymphocyte 
act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 j 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL-] beta 


0.0 


Secondary CD8 
lymphocyte act 


O.O | 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte 
none 


5.0 j 


OJ-812 (Basophil) 
^MA/ionomycin 


D.O 


2ry ( 


3.0 ( 


XDl 106 (Keratinocytes) none jo.O 
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Thl/Th2/Trl anti- 
CD95CH11 








LAK cells rest 


0 0 

yj ,\j 


CCD1106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


1.7 


LAK cells IL- 
2+IL-12 


0.0 


NCI-H292 none 


0.0 


LAK cells IL- 
2+IFN gamma 


0 0 


NL1-H292 IL-4 


0.0 


LAK cells IL-2+ 
IL-18 


0 0 


NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ionomycin 


0 0 


NCI-H292 IL-13 


0.0 


NK Cells IL-2 rest 


0.5 


NCI-H292 I FN gamma 


0.0 } 


Two Way MLR 3 
day 


0.0 


HPAEC none 


0.0 


Two Way MLR 5 
day 


0.0 


HPAEC TNF alpha + IL-1 beta jo.O 


jf wo Way MLR 7 
jday 


0.0 


Lung fibroblast none jo.O 


■PBMCrest il.O 

! ! 


Lung fibroblast TNF alpha + IL- L « 
I beta I 0 0 


jPBMC PWM |0.8 


Lung fibroblast IL-4 lo.O 


PBMC PHA-L jo.O 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
none 


0.0 


Lung fibroblast IL-13 


1.5 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast I FN gamma 


0.0 


B lymphocytes 
PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


1.5 


B lymphocytes 
CD40L and IL-4 


1.0 


Dermal fibroblast CCD 1 070 
TNF alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD1070 IL- 
1 beta 


0.0 


EOL-1 dbcAMP j 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


D.O 


Dendritic cells 
none 


3.0 


Dermal fibroblast IL-4 1 


3.0 


Dendritic cells 
LPS ( 


3.0 ] 


Dermal Fibroblasts rest l 


3.7 


Dendritic cells 
anti-CD40 ( 


).0 1 


Neutrophils TNFa+LPS 


1.1 


Monocytes rest ( 


).0 i 


Neutrophils rest 


1.6 
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10 



15 



20 



25 



Monocytes LPS 



Macrophages rest 



0.0 



Macrophages LPS 



HUVEC none 



HUVEC starved 



0.0 



0.0 



0.7 



0.0 



Colon 



Lung 



0.8 



Thymus 



Kidney 



0.8 



7.9 



100.0 



CNS_neurodegeneration_vl.O Summary:" Ag4182 Ti^anefconfirms the"" 
expression of the CGI 00609-01 gene at very low levels in the brain in an independent 
group of individuals. This gene is found to be upregulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, therapeutic modulation of the expression or 
function of this gene may decrease neuronal death and be of use in the treatment of this 
disease. 

General_screening_panel_vl.4 Summary: Ag4182 Highest expression of the 
CGI 00609-01 gene is detected in testis (CT=29). Thus, expression of this gene can be 
used to distinguish testis from other samples in this panel. In addition, therapeutic 
modulation of this gene can be useful in the treatment of testis related diseases such as 
fertility and hypogonadism. 

Low levels of expression of this gene is also associated with a CNS cancer, colon 
cancer, lung cancer, and an ovarian cancer cell lines. Therefore, therapeutic modulation of 
this gene can be beneficial in the treatments of these cancers. 

Panel 4.1D Summary: Ag4182 Highest expression of the CG100609-01 gene is 
detected exclusively in kidney (CT=30.6). In addition low expression of this gene is also 
seen in thymus. Thus, expression of this gene can be used to distinguish these two samples 
from other samples in this panel. Furthermore, small molecule therapies designed with the 
protein encoded for by this gene could modulate kidney function and be important in the 
treatment of inflammatory or autoimmune diseases that affect the kidney, including lupus 
and glomerulonephritis 

Panel CNS_1 Summary: Ag4182 Expression of the CG100609-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4182 Expression of the 
CG100609-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



J. CG100710-01: AAA (ATPase Associated with Various Activities) 
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. . Expression of gene CG 1 007 1 0-0 1 was assessed using the primer-probe set 
Ag4249, described in Table JA. 

Table JA . Probe Name Ag4249 



•Primers 


Sequences 


Length 


Start Position 


SEQID No 


Forward 


5 ' -cacaggagcggctgtca-3 ' 


17 


213 


230 


Probe 


TET-5 ' -tgccagccctgagcaagtgcc-3 ' -TAMRA 


21 


235 


23, | 


Reverse 


5 1 -ctgcagagcacagcactca-3 1 


19 


262 


232 I 



CNS_neurodegenerationjvl.O Summary: Ag4249 Expression of the 
CGI 007 10-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



General_screcningjpanel_vL4 Summary: Ag4249 Expression of the 
CG100710-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

Panel 4.1D Summary: Ag4249 Expression of the CG100710-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening pancl_v_2.4 Summary: Ag4249 Expression of the 
CG100710-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

K. CG100730-01: Exoribonuclease 

Expression of gene CGI 0073 0-01 was assessed using the primer-probe set 
Ag4187, described in Table KA. Results of the RTQ-PCR runs are shown in Tables KB, 
KC, KD and KE. 



Table KA . Probe Name Ag4187 



Primers 


Sequences 


• jLength; 


Start 
Position 


SEQID 
No 


Forward 


5 1 -gcaaatggctctgctgtaatac-3 1 


j 22 


201 


233 


Probe 


TET-5 • -taatggccacagccgtcagtaaaaca-3 • 


- TAMRA j 26 ! 


241 


234 


Reverse 


5 » -ta.aactgggaaggggaaggt-3 ' 


1 20 


269 


235 



Table KB . CNS_neurodegenerationjvl.O 



Tissue Name 


Rel.Exp.(%) Ag4l87, 
Run 215539701 


Tissue Name 


Rel. Exp.(%) 
Ag4187, Run 
215539701 


AD 1 Hippo 


0.0 


Control (Path) 3 Temporal 
Ctx 


7.5 
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AD 2 Hippo 


0.0 


Control (Path) 4 Temporal 


0.4 


/\u j rlippo 


u.u 


AU 1 WCCIpiial L.TX 


u.u 


AD 4 Hippo 


0.0 


AD 2 Occipital Ctx (Missing) 


100.0 


AD 5 Hippo 


1.9 


AD 3 Occipital Ctx 


.... — 

0.0 


IAD 6 Hippo 


3.0 


AD 4 Occipital Ctx 


il-3 


jContro! 2 Hippo 


0.0 


AD 5 Occipital Ctx 


1.1 


r ■"" - - 
{Control 4 Hippo 


0.0 


AD 6 Occipital Ctx 


2.6 


{Control (Path) 3 
HioDO 


0.3 


Control 1 Occipital Ctx 


0.0 


AD 1 Temporal 
Ctx 


0.0 


Control 2 Occipital Ctx 


0.0 


jAD 2 Temporal 
Ctx 


0.0 


Control 3 Occipital Ctx 


0.0 


IAD 3 Temporal 

iCtx 


0.0 


Control 4 Occipital Ctx 


0.0 


AD 4 Temporal 
Ctx 


0.9 

m 


Control (Path) 1 Occipital Ctx 


1.2 


;AD 5 Inf 
'•Temporal Ctx 


2.9 


Control (Path) 2 Occipital Ctx 


0.0 


jAD 5 Sup 
! Temporal Ctx 


2.1 


Control (Path) 3 Occipital Ctx 


0.0 


AD 6 Inf 
Temporal Ctx 


9.9 


Control (Path) 4 Occipital Ctx 


1.0 


AD 6 Sup 
Temporal Ctx 


7-5 


Control 1 Parietal Ctx 


0.0 


Control 1 
Temporal Ctx 


0.0 


Control 2 Parietal Ctx 


12.8 


Control 2 
Temporal Ctx 


0.0 


Control 3 Parietal Ctx 


0.0 


Control 3. 
Temporal Ctx 


0.0 


Control (Path) 1 Parietal Ctx 


0.1 


Control 3 
Temporal Ctx 


0.0 


Control (Path) 2 Parietal Ctx 


5.8 


Control (Path) 1 i 
Temporal Ctx 


0.0 


Control (Path) 3 Parietal Ctx 


6.7 


Control (Path) 2 
Temporal Ctx 


0.0 


Control (Path) 4 Parietal Ctx 


0.4 


Table KC. General screening panel vl:4 


Tissue Name 


Rel. Exp.(%) Ag4187, 
Run 221154082 


Tissue Name 


Rel. Exp.(%) 
Ag4187, Run 
221154082 
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Adipose 


13.2 


Renal ca.TK-10 


41.5 


Melanoma* 
Hs688(A).T 


2.2 


Bladder 


16.4 


Melanoma* 
Hs688(B).T 


5.7 


Gastric ca. (liver met.) NCI- 
N87 


0.0 


Melanoma* M14 


0.0 


Gastric ca K.ATO III 

! 


0 0 

v/.V/ 


jMelanoma* 
iLOXIMVI 


0.0 


Colon ca. SW-948 


1 

0.0 j 


iMelanoma* SK- 

;mel-5 


i 

4.4 


Colon ca. SW480 


i 

0.0 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca * (SW480 met) 
SW620 


Q 7 


Testis Pool 


10.7 


Colon ca. HT29 


1.0 


j Prostate ca.* 
!(bone met) PC-3 


9.9 


Pnlnn HPT- 1 1 (\ 


J .z 


Prostate Pool 


19.8 


Colon ca. CaCo-2 


0.0 


(Placenta 

i 


4.1 


Colon cancer tissue 


18.6 


Uterus Pool 


6.1 


Colon ca SW1 116 


1 0 


iOvarian ca. 
iOVCAR-3 


6.9 


Colon ca. Colo-205 


0.0 j 


iOvarian ca. SK- 
OV-3 


31.0 


Colon ca. SW-48 


o.o : 


iOvarian ca. 
;OVCAR-4 


0.0 


Colon Pool 


9.6 


Ovarian ca. 
OVCAR-5 


55.1 


Small Intestine Pool 


33.0 


Ovarian ca. 
IGROV-1 


1.6 


Stomach Pool 


21.9 


Ovarian ca. 
OVCAR-8 


12.9 


ouiic iviurruw rooi 


i 

U.o 


Ovary 


10.5 


Fetal Heart 


0.0 


Breast ca. MCF-7 


17.3 


Heart Pool 


14.7 


Breast ca. MDA- 
MB-231 


9.1 


L^yiiipii lNuue rooi 


i O.J 


Breast ca. BT 549 


24.8 


Fetal Skeletal Muscle 


7,2 


Breast ca. T47D 


75.8 


Skeletal Muscle Pool 


24.3 


Breast ca. MDA- 
N 


0.8 


Spleen Pool 


26.4 


Breast Pool 


23.3 


Thymus Pool 


26.2 


Trachea 


25.2 


CNS cancer (glio/astro) U87- 
MG 


8.5 


Lung 


3.4 


CNS cancer (glio/astro) U- 


3.7 
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118-MG 




Fetal Lung 


A A 
0.0 


CNS cancer (neuro;met) SK- 
N-AS 


0.7 


Lung ca. NCI- 
N417 


0.0 


CNS cancer (astro) SF-539 


4.9 


Lung ca. LX-1 


29.3 


CNS cancer (astro) SNB-75 


100.0 


Lung ca. NCI- 
H146 


0.0 


CNS cancer (glio) SNB-I9 


5.8 


Lung ca. SHP-77 


2.9 


CNS cancer (glio) SF-295 


76.3 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


13.8 


Lung ca. NCI- 
H526 


1 .0 


Brain (cerebellum) 


0.0 


Lung ca. NCI- 
H23 


0.0 


Brain (fetal) 


49.7 


Lung ca. NCI- 
H460 


41.8 


Brain (Hippocampus) Pool 


36.9 


Lung ca. HOP-62 


25.7 


Cerebral Cortex Pool 


18.9 


Lung ca. NCI- 
H522 


2.4 


Brain (Substantia nigra) Pool 


7.7 


Liver Jo.O 


Brain (Thalamus) Pool 




Fetal Liver 


3.7 |Brain (whole) 


1 s 5J 

l J.O 


Liver ca. HepG2 


4.7 


Spinal Cord Pool 


I D.J 


Kidney Pool 


45.7 


Adrenal Gland 


2.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


1.9 


Renal ca. 786-0 


2.3 


Salivary Gland 


1.3 


Renal ca. A498 


3.6 


Thyroid (female) 


2.5 


Renal ca. ACHN 


21.0 


Pancreatic ca. CAPAN2 


92.7 


Renal ca. UO-3 1 


3.9 


Pancreas Pool 


10.8 



Table KD. Panel 4.1D 



Tissue Name 


Rel.Exp.(%)Ag4187, 
Run 182086758 


Tissue Name 


Rel. Exp.(%) 
Ag4187,Run 
182086758 


Secondary Th 1 act 


2.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


1.8 


HUVEC IFN gamma 


2.4 


Secondary Trl act 


3.0 


HUVEC TNF alpha + IFN gamma 


0.5 


Secondary Thl 
rest 


3.1 


HUVEC TNF alpha + IL4 


0.2 


Secondary Th2 
rest 


0.0 


HUVEC 1L-1 1 


4.3 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.4 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha: 


1.8 
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+- IL-I beta 




Primary Th2 act 


0.0 


Microvascular Dermal EC none 


1.8 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-l beta 




Primary Thl rest 


7.2 


Bronchial epithelium TNFalpha + 
ILlbeta 


1 "7 

1./ 


Primary Th2 rest 


4.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


5.5 


Small airway epithelium 
TNFalpha + IL-I beta 


1.3 


CD45RA CD4 
lymphocyte act 


4.8 


Coronery artery SMC rest 


0.4 


CD45RO CD4 
lymphocyte act 


7.6 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.0 


CD8 lymphocyte 
act 


6.7 


Astrocytes rest 


0.8 


Secondary CD8 
lymphocyte rest 


3.4 


Astrocytes TNFalpha + IL-l beta 


0.0 


Secondary CD8 
lymphocyte act 


5.3 


KU-8 12 (Basophil) rest 


,.0 


|CD4 lymphocyte 
jnone 


6.6 


PMA/ionomycin 


1 

'l 6.4 ! 

! 


2ry 

Thl/Th2/Trl_anti- 
CD95 CH11 


4.5 


CCD 1 106 (Keratinocytes) none 


! 

2.3 


LAK cells rest 


10.4 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-I beta 


1 C\ 

1.9 


LAK cells IL-2 


5.0 


Liver cirrhosis 


0.7 


LAK cells IL- 
2+IL-12 


3.1 


NCI-H292 none 


0.0 


LAK cells IL- 
2+IFN gamma 


6.7 


NPI-H9Q? TI -A 


ft ft 

o.o 


LAK ceIls IL-2+ 
IL-18 


7.7 




0.2 


LAK cells 
PMA/ionomycin 


3.8 


MPT 14909 TT 17 


ft c 

0.5 


NK Cells IL-2 rest 


3.4 


NCI-H292 IFN gamma 


0.4 


Two Way MLR 3 

uay 


15.9 


HPAEC none 


1.4 


Two Way MLR 5 . 
day 


3.1 


HPAEC TNF alpha + IL-l beta 


3.2 


Two Way MLR 7 
day ( 


3.5 


Lung fibroblast none 


7.0 


PBMC rest 


1.1 ] 


Lung fibroblast TNF alpha + IL-l 


1.8 
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beta 




rrJMC rWM 


4.1 


Lung fibroblast IL-4 


1.7 


PBMC PHA-L 


3.3 


Lung fibroblast IL-9 


2.8 


Ramos (B cell) 
none 


2.8 


Lung fibroblast IL-13 


0.3 


Ramos (B cell) 
jionomycin 


,0.3 


Lung fibroblast IFN gamma 


0.5 


B lymphocytes 
PWM 


, t 

I 0 0 . Dermal fibroblast CCD 1070 rest 


3.4 


B lymphocytes 
CD40L and IL-4 


0.3 


Dermal fibroblast CCD1070 TNF 
alpha 


6.0 


EOL-1 dbcAMP 


9.9 


Dermal fibroblast CCD1 070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


2.7 


Dermal fibroblast IFN gamma 


0.9 


Dendritic cells 
none 


10.7 


Dermal fibroblast IL-4 


1.9 


Dendritic cells 
LPS \ 


4.7 \ 


Dermal Fibroblasts rest 


1.1 


Dendritic cells ' 
anti-CD40 r J 


Neutrophils TNFa+LPS 


4.3 


Monocytes rest j 1 9,2 


Neutrophils rest I7.4 


Monocytes LPS \\ 


2.7 | 


Colon 


0.0 


Macrophages rest < 


3.5 ] 


Lung 


3.7 


Macrophages LPS ( 


).0 


Thymus 


17.9 


HUVEC none ]( 


).8 ] 


Cidney 


100.0 


HUVEC starved ] 


.0 







Table KE. General oncology screening panel_v_2.4 



Tissue Name 


Rel.Exp.(%)Ag4187, 
Run 268689531 


Tissue Name 


Rel. Exp.(%) 
Ag4187, Run 
268689531 


Colon cancer 1 


7.8 


Bladder cancer NAT 2 


1.5 


Colon cancer NAT 
1 


0.0 


Bladder cancer NAT 3 


2.4 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


1.6 


Colon cancer NAT 

2 


0.0 


Adenocarcinoma of the 
prostate 1 


86.5 


Colon cancer 3 


0.0 


Adenocarcinoma of the 
prostate 2 


2.1 


Colon cancer NAT 
3 


8.8 


Adenocarcinoma of the 
prostate 3 


0.0 


Colon malignant 


2.3 


Adenocarcinoma of the 


6.2 
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cancer 4 




prostate 4 




Colon normal 
adjacent tissue 4 


0.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


0.0 


Adenocarcinoma of the 
prostate 6 


15.2 


Lung NAT 1 


0.0 


Adenocarcinoma of the 
prostate 7 


14.7 


Lung cancer 2 


100.0 


Adenocarcinoma of the 
prostate 8 


1 .V 

! 


Lung NAT 2 


3.9 


Adenocarcinoma of the 
prostate 9 


ion 
1 5.0 


Squamous cell 
carcinoma 3 


0.0 


Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 


0.0 


Kidney cancer 1 


3.3 


metastatic 
melanoma 1 


26.8 


KidneyNAT 1 


13.4 


Melanoma 2 10.0 


Kidney cancer 2 


71.2 


Melanoma 3 16^8 

metastatic i rt 
melanoma 4 r * 


Kidney NAT 2 
r 

rvianey cancer «> 

< 


23.7 
79.0 


metastatic j 
melanoma 5 j 


Kidney NAT 3 


7.5 


Bladder cancer 1 jo.0 


Kidney cancer 4 


0.0 


Bladder cancer 
NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


3.5 







CNS_neurodegeneration_vl.O Summary: Ag4187 Results from one experiment 
with the CG100730-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General_screening_panel_vl.4 Summary: Ag4187 Highest expression of the 
CG100730-01 gene is detected in CNS cancer SNB-75 cell line (CT=32). In addition high 
expression of this gene is seen in pancreatic cancer, CNS cancer, renal, lung, breast and 
ovarian cancer cell line. Therefore, therapeutic modulation of this gene through the use of 
small molecule drugs can be useful in the treatment of these cancers. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, and cerebral cortex. Therefore, this gene may play a role in central nervous 
system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple 
sclerosis, schizophrenia and depression. 
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Panel 4.1D Summary: Ag4187 Highest expression of the CG100730-01 gene is 
detected in kidney (CT=32.7). Thus, expression of this gene can be used to distinguish 
kidney samples from other samples in this panel. Furthermore, small molecule therapies 
designed with the protein encoded for by this gene could modulate kidney function and be 
important in the treatment of inflammatory or autoimmune diseases that affect the kidney, 
including lupus and glomerulonephritis. 

In addition, low expression of this gene is seen in resting primary Thl and Trl 
cells, activated CD45RO CD4 lymphocyte, 3 day Two Way MLR, eosinophils, 
monocytes, macrophages, dendritic cells, ionomycin treated basophils and thymus. 
Therefore, therapeutic modulation of this gene may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4187 Highest 
expression of the CG100730-01 gene is detected in lung cancer sample (CT=33.7). with 
significant expression also seen in metastatic melanoma, lung, prostate and kidney 
cancers. In addition, expression of this gene is higher in the cancers than in the normal 
adjacent tissue. Therefore, expression of this gene could be as a marker to detect the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 

L. CG100819-01: Polynucleotide Phosphorylase 

Expression of gene CGI 008 19-01 was assessed using the primer-probe set 
Ag4195, described in Table LA. Results of the RTQ-PCR runs are shown in Tables LB, 
LC and LD. 



Table LA . Probe Name Ag4195 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
ID 
No 


Forward 


5 ' -atcgttcaattagaccgctctt-3 • 


22 


430 \ 


236 


Probe 


TET-5 ■ -tccagctggctacttctatgatacacagg-3 ' -TAMRA 


29 


452 


237 


Reverse 


5 ' -acatcaggctcatttacaccat-3 ' 


22 ; 


505 


238 



Table LB . General screening panel vl.4 
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Tissue Name 


Rel. Exp.(%) Ag4195, 
Run 221157738 


Tissue Name 


Rel. Exp.(%) 
Ag4195,Run 
221157738 


Adipose 


3.4 


Renal ca.TK-10 


16.8 


Melanoma* . 

T I„/OOZ A \ TP 

Hs688(A).T 


5.7 


Bladder 


13.3 


Melanoma* 
Hs6a8(B).T 


5.6 


Gastric ca. (liver met.) NCI- i 1ftA . A 
N87 I 1000 


jMelanoma* M 1 4 


16.8 


Gastric ca. KATOIII 


58.2 


Melanoma* 
LOXIMVI 


25.3 


Colon ca. SW-948 


7.1 


Melanoma* SK- 

TV <f t~< T C 

MEL-5 


25,9 


Colon ca. SW480 


35.8 


Squamous cell 
carcinoma SCC-4 


10.1 


Colon ca.* (SW480 met) 
SW620 


19.2 


Testis Pool 


5.1 


Colon ca. HT29 


9.0 


Prostate ca.* (bone 
met) PC-3 


! 

20.4 


Colon ca. HCT-116 


27.5 


Prostate Pool ;2.1 


Colon ca. CaCo-2 


30.8 | 


Placenta j0.6 


Colon cancer tissue |8.8 5 


Uterus Pool ;2.3 


Colon ca. SW1116 


3.0 1 


Ovarian ca. ! 
OVCAR-3 j 10 ' 6 


Colon ca. Colo-205 


! 


Ovarian ca. SK- 
OV-3 


17.6 


Colon ca. SW-48 


7.3 


Ovarian ca. 
OVCAR-4 


5.8 


Colon Pool 


4.9 


Ovarian ca. 
OVCAR-5 


12.8 


Small Intestine Pool 


3.2 


Ovarian ca. 
IGROV-l 


5.8 


Stomach Pool 


2.7 


Ovarian ca. 
OVCAR-8 


3.1 


Bone Marrow Pool 


1.8 


Ovary 


2.5 


Fetal Heart 


3.7 


Breast ca.MCF-7 14.3 


Heart Pool 


3.4 


Breast ca. MDA- 
MB-231 


15.1 


Lymph Node Pool 


6.3 


Breast ca. BT 549 J; 


26.8 


Fetal Skeletal Muscle 


1.8 


Breast ca. T47D ; 


>1.0 


Skeletal Muscle Pool 


7.7 


Breast ca. MDA-N 


11.3 


Spleen Pool j. 


5.0 


Breast Pool i 


5.6 


rhymus Pool 


4.4 


Trachea : 


5-1 < 


"NS cancer (glio/astro) U87- 


14.0 
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MG 




Lung 


0.9 


CNS cancer (glio/astro) U-1I8- 
MG 




Fetal Lung 


8.6 


CNS cancer (neurojmet) SK-N- 


22.1 


Lung ca.NCI-N417 


3.5 


v^ino cdiicei ^asiro ) orOjV 


.3.1 


Lung ca. LX-1 


20.0 


v^ino cdiiccr ^asiroj oinjd-o 


\\lA 


Lung ca. NCI-HI 46 


6.7 


^lNO CdllCCr ^gnoj oiNn-t y 


5.J 


Lung ca. SHP-77 


39.0 


^ino cdiiLcr ^giio^ jr-/Vj 


20.4 


Lung ca. A549 


23.0 


Did in ^/\rnygaaia^ IOOI 


j.9 


Lung ca. NCI-H526 


4.4 


ordHi ^cereoenum^ 


2.2 


Lung ca. NCI-H23 


21.3 


Brain (fetal) 


3.8 


Lung ca. NCI-H460 


16.8 


Brain (Hippocampus) Pool 


3.7 


Lung ca. HOP-62 


6.5 


Cerebral Cortex Pool 


5.0 


jLungca. NCI-H522 jl2.3 


Brain (Substantia nigra) Pool 


3.4 


Liver 0.4 


Brain (Thalamus) Pool 


7.2 


Fetal Liver |6.3 jBrain (whole) i3 5 


Liver ca. HepG2 


S.2 


Spinal Cord Pool 


4.2 


K.dney Pool |6.2 (Adrenal Gland 


2.8 
1.3 


Fetal Kidney (5.4 [Pituitary aland Pool ! 


Renal ca. 786-0 j 


5 -3 [Salivary Gland jo.6 


Renal ca. A498 |3. 1 


Thyroid (female) 


1.8 


Renal ca. ACHN 


6.4 


Pancreatic ca. CAPAN2 


8.5 


Renal ca. UO-3 1 1 


9.9 


Pancreas Pool 


5.4 ■ 


Table LC. Panel 4.1D 




Tissue Name 


Rel. Exp.(%) 1 
Ag4195,Run 
174255656 j 


Tissue Name 


Rel. Exp.(%) 
Ag4195, Run 
174255656 


Secondary Thl act 


54.3 J] 


HUVEC IL-lbeta 


20.6 


Secondary Th2 act 


1 00.0 jHUVEC IFN gamma 


0.0 


Secondary Trl act 


287 HUVECTNF alpha + IFN 
[gamma 


26.4 


Secondary Thl rest 


7.5 1 


-IUVEC TNF alpha + IL4 


15.6 


Secondary Th2 rest |i 


6.6 J 


iUVECIL-11 jn.4 


Secondary Trl rest ' 


7.5 I 


^ung Microvascular EC none : 


>0.7 


Primary Thl act 


57.0 * 


„ung Microvascular EC ^ 
rNFalpha + IL-Ibeta 1 


5.9 


Primary Th2 act u 


J7.0 ft 


Microvascular Dermal EC none ] 


9.8 


Primary Trl act 5L4 ft 


/Iicrosvasular Dermal EC 1 


2.0 
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TNFalpha + IL-lbeta 




Primary Thl rest 




Bronchial epithelium TNFalpha 
+ ILlbeta 


10.5 


Primary Th2 rest 


3.0 


Small airway epithelium none 


5.7 


Primary Tr I rest 




Small airway epithelium 
TNFalpha + IL-lbeta 


12.4 


CD45RA CD4 
j lymphocyte act 


46.7 


Coronery artery SMC rest 


1 1.7 


ICD45RO CD4 
j lymphocyte act 


62.0 


Coronery artery SMC TNFalpha 
+ 1L-Ibeta 


10.0 


jCDS lymphocyte act 


37.1 


Astrocytes rest 


4.8 


Secondary CD8 
lymphocyte rest 


41.2 


Astrocytes TNFalpha + IL-lbeta 


6.1 


Secondary CD8 
lymphocyte act 


14.8 


KU-812 (Basophil) rest 


29.1 


;CD4 lymphocyte • 
none j 


KU-8 12 (Basophil) 
PMA/ionomycin 


49.3 


2ry j ~I 







iThl/Th2/Trl_anti- ;8.1 
!CD95 CH11 



|LAK cells rest [12.3 



CCD 1 1 06 (Keratinocytes) none |24.0 



CCD 1 106 (Keratinocytes) 
•TNFalpha + IL-lbeta' 



24.5 



ILAK cells IL-2+IL- 
i 12 


28.9 


NC1-H292 none 


J" ' 

13.1 


LAK cells IL-2+IFN 
gamma 


25.3 


NCI-H292 IL-4 


29.5 


LAK cells IL-2+ IL- 
18 


33.4 


NCI-H292 IL-9 


30.1 


LAK cells 
PMA/ionomycin 


31.2 


NCI-H292 IL-13 


31.6 


NK Cells 1L-2 rest 


24.7 


NCI-H292 I FN gamma 


47.6 


Two Way MLR 3 
day 


15.5 


HPAEC none 


14.2 


Two Way MLR 5 
day 


24.1 


HPAEC TNF alpha + IL-1 beta 


27.9 


Two Way MLR 7 
day 


13.5 


Lung fibroblast none 


13.1 


PBMC rest 


4.6 


Lung fibroblast TNF alpha + IL- 
1 beta 


22.5 


PBMC PWM 


29.5 


Lung fibroblast IL-4 


16.2 


PBMC PHA-L 


18.4 


Lung fibroblast IL-9 


21.2 


Ramos (B cell) none 


50.7 


Lung fibroblast IL-13 


15.8 
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Ramos (B cell) 
ionomycin 


56.3 


Lung fibroblast IFN gamma 


20.2 


B lymphocytes 
PWM 


37.4 


Dermal fibroblast CCD1070 rest 


23.7 


B lymphocytes 
CD40L andIL-4 


17 7 


Dermal fibroblast CCD1070 
TNF alpha 


26.8 


EOL-1 dbcAMP 


22.7 


Dermal fibroblast CCD 1 070 IL- 
1 beta 


16.0 


EOL-1 dbcAMP 
rjvi/A/junornycin 


7.6 


Dermal fibroblast IFN gamma 


7.0 


uenaruic cens none 


7.6- 


Dermal fibroblast IL-4 


10.6 


uenaruic ceils Lro 


27.9 


Dermal Fibroblasts rest 


5.7 


L/enanuc ceus anti- 
CD40 


10.5 


Neutrophils TNFa+L PS 


0.9 


Monocytes rest 


8.2 


Neutrophils rest 


2.5 


Monocytes LPS 


41-5 iColon 


3.1 


Macrophages rest 


9.9 


Lung 


6.8 


jMacrophages LPS j 


30.4 ; 

L 


Thymus 


10.4 


HUVEC none 


12.5 'Kidney 7.4 


HUVEC starved 


14.0 j T 



Table LP . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4195, Run 
268689535 


i 

Tissue Name 


Rel. Exp.(%) 
Ag4195, Run 
268689535 


Colon cancer 1 


23.2 


Bladder cancer NAT 2 


0.7 


Colon cancer NAT 1 


7.3 


Bladder cancer NAT 3 


1.6 


Colon cancer 2 


45.4 


Bladder cancer NAT 4 


4.0 


Colon cancer NAT 2 


19.6 


Adenocarcinoma of the prostate 
1 


26.2 


Colon cancer 3 


95.9 


Adenocarcinoma of the prostate 
2 


2.0 


Colon cancer NAT 3 


18.7 


Adenocarcinoma of the prostate 
3 


15.5 


Colon malignant cancer 4 


72.7 


Adenocarcinoma of the prostate 
4 


30.6 


Colon normal adjacent tissue 4 


8.0 


Prostate cancer NAT 5 


3.3 


Lung cancer 1 


18.0 


Adenocarcinoma of the prostate 
6 


4.6 


Lung NAT 1 


2.2 


Adenocarcinoma of the prostate 
7 


5.8 


Lung cancer 2 


100.0 


Adenocarcinoma of the prostate 
8 


1.8 
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Lung NAT 2 


3.4 


Adenocarcinoma of the prostate 
9 


18.3 


Squamous cell carcinoma 3 


39.2 


Prostate cancer NAT 1 0 


2.1 


Lung NAT 3 


0.6 


Kidney cancer I 


17.1 


metastatic melanoma 1 


22.7 


KidneyNAT 1 


7.7 


jMelanoma 2 


2.0 


Kidney cancer 2 


38.4 


jMelanoma 3 


2.5 


Kidney NAT 2 


15.5 


metastatic melanoma 4 


31.0 


Kidney cancer 3 


10.2 


f ; — 

{metastatic melanoma 5 


44.4 


Kidney NAT 3 


7.2 


Bladder cancer I 


2.5 


Kidney cancer 4 


18.7 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


7.7 


Bladder cancer 2 


3.8 







CNS_neurodegencration_vl.O Summary: Ag4195 Results from one experiment 
with the CG100819-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General jscreening_panel_vl .4 Summary: Ag4195 Highest expression of the 
5 CG 1 008 1 9-0 1 gene is detected in a gastric cancer NCI-N87 cell line (CT=25.3). High 
expression of this gene is seen in cluster of CNS cancer, colon, gastric, renal, lung, breast, 
ovarian, pancreatic, prostate, squamous cell carcinoma cell lines and melanoma. 
Therefore, therapeutic modulation of this gene could be beneficial in the treatment of these 
cancers. 

1 0 Among tissues with metabolic or endocrine function, this gene is expressed at high 

to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

1 5 Interestingly, expression of this gene is higher in fetal lung and liver (CTs=29) as 

compared to the corresponding adult tissues (CTs=32-33). Therefore, expression of this 
gene could be useful in distinguishing the fetal lung and liver from the corresponding adult 
tissues. 

In addition, this gene is expressed at high levels in all regions of the central 
20 nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
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central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4195 Highest expression of the CG100819-01 gene is 
detected in activated secondary Th2 cells (CT=27). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening pancl_vJ2.4 Summary: Ag4195 Highest 
expression of the CGI 008 19-01 gene is detected in lung cancer (CT=28.7), with 
significant expression also seen in metastatic melanoma, colon, lung, prostate and kidney 
cancers. In addition, expression of this gene is higher in the cancers than in the normal 
adjacent tissue. Therefore, expression of this gene could be as a marker to detect the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 

M. CG100872-01: Protein-Arginine Deiminase 

Expression of gene CG100872-01 was assessed using the primer-probe sets 
Ag4197 and Ag4299, described in Tables MA and MB. Results of the RTQ-PCR runs are 
shown in Table MC. 



Table MA . Probe Name Ag4197 



Primers Sequences 


| Length 


Start 
Position 


SEQID 
No 


Forwardjs 1 -gtcccagatgactctgaatgtc-3 1 


i 22 


539 


239 
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Probe 


TET-5 • -caaggccccagctgtatcttaaagaa-3 ' -TAMRA 


26 


561 




Reverse 


5 1 -ccttggaggtatggaggactag-3 • 


22 


1 594 






Table MB. Probe Name Ag4299 




Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -gtcccagatgactctgaatgtc-3 ' 


22 


539 


242 


Probe 




TET-5 » -caaggccccagctgtatcttaaagaa-3 ' -TAMRA 


26 


561 


243 


Reverse 


5 1 -ttcgactcttccttggaggtat-3 » 


22 


604 


244 



Table MC . Panel 4. ID 



Tissue Name 


Rel. 

Exp.(%) 
Ag4l97, 
Run 

174255682 


Rel. Exp.(%) 
Ag4299, Run 
183712613 


Tissue Name 


Rel. 

Exp.(%) 
Ag4197, 
Run 

174255682 


Rel. 

Exp.(%) 
Ag4299. 
Run 

183712613 


iSecondary Thl act 


o.o jo.o 


HUVEC I L-l beta 


0.0 


0.0 


(Secondary Th2 act 


o.o jo.o 


nuvtt iriN gamma 


0.0 


0.0 


i 

-Secondary Tr I act 

j ■*** 


! 

0.5 ,29.1 

i 


HUVEC TNF alpha 
+ 1 FN gamma 


0.0 


0.0 


'Secondary Th 1 
irest 


o.o io.o 

i 


HUVEC TNF alpha 

4- II A 


0.0 


0.0 

: 


Secondary Th2 
rest 


o.o Io.o 

i 


HUVEC IL-11 


0.0 


1 

0.0 


Secondary Trl rest 


0.0 


0.0 


Lung Microvascular 
EC none 




u.u 


Primarv Th 1 art 


0.0 


0.0 


Lung Microvascular 
EC TNFalpha + IL- 
Ibeta 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


0.0 


Primary Trl act 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Primary Thl rest 


0.0 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


0.0 


Primary Th2 rest 


0.0 


0.0 


Small airway 
epithelium none 


0.0 


0.0 


Primary Trl rest 


0.0 


0.0 


Small airway 
epithelium 
TNFalpha +IL- 
lbeta 


0.0 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC rest 


0.0 


o.o 
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CD45ROCD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalnha + 
IL-lbeta 




A ft 


CD8 lymphocyte 
act 


0.0 


0.0 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


0.0 

i 


Astrocytes 

m lull \j\+y lvj»3 

TNFalpha + IL- 
lbeta 


0.0 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


jo.o 


KU-812 CBasonhih 
rest 


|o.o 


0.0 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


0.0 


ivy 

Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


! 
1 

lo.o 


CCD1 106 

(Keratinocytes) none 


ft ft 


u.u 


LAK cells rest 


0.0 


! 

i 

;0.0 

i 

1 


CCD] 106 
(Keratinocytes) 
TNFa)pha+ IL- 
lbeta 


0.0 


0.0 


LAK cells IL-2 


0.0 


=0.0 


Liver cirrhosis 


0.0 


|0.0 i 


LAK cells IL- 
'2+1L-12 


0.0 


jo.o 


NCI-H292 none 


0.0 


"i 

0.0 


LAK cells IL- 
2+1 FN gamma 


0.0 


1 

0.0 

i 


NCI-H292 IL-4 


0.0 


0.0 | 


LAK cells IL-2+ 
1L-18 


0.0 


0.0 


NCI-H292 IL-9 


0.0 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


0.0 


NCI-H292 IL-13 


0.0 


0.0 


NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292 I FN 
gamma 


0.0 


0.0 


Two Way MLR 3 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 


Two Way MLR 5 
day 


0.0 


0 0 


HPAEC TNF alpha 
+ IL-1 beta 




0.0 


Two Way MLR 7 
day 


0.0 


0.0 


Lung fibroblast none 


3.0 


0.0 


PBMC rest I 


3.0 


0.0 

1 


Lung fibroblast TNF 
alpha + IL-1 beta 


3.0 


3.0 




J.O 


O.O ] 


Lung fibroblast IL-4 


1.1 1 


3.0 


PBMC PHA-L < 


).0 l< 


3.0 1 


Lung fibroblast IL-9 ( 


).0 1 


3.0 


Ramos (B cell) 
none 


).0 ( 


3.0 1 


^ung fibroblast IL- , 
13 


>.5 ( 


3.0 


Ramos (B cell) 
ionomycin 


).0 ( 


).0 1 

i 


^ung fibroblast IFN 
;amma 


).0 ( 


).0 
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B lymphocytes 
PWM 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


0.0 


Dermal fibroblast 
CCD J 070 TNF 
alpha 


0.0 


0.0 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1 070 I L- 1 beta 


0.0 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
I FN gamma 


1.4 


0.0 


Dendritic cells 
none 


0.0 


0.0 


Dermal fibroblast 
IL-4 




A A 


Dendritic cells 
LPS 


0.0 


0.0 


Dermal Fibroblasts 
rest 


0.0 


0.0 


Dendritic cells 
anti-CD40 


0.0 


0.0 


Neutrophils 
TNFa+LPS 


2.8 


0.0 


Monocytes rest 


0.0 


0.0 


Neutrophils rest 


0.0 


0.0 


Monocytes LPS 


0.0 ! 0.0 

1 


Colon 


1.0 


30.4 


Macrophages rest 


0.0 


0.0 


Lung 


0.0 


0.0 


[Macrophages LPS 


0.0 


0.0 (Thymus 


10.8 


0.0 


JhUVEC none 


0.0 | 


0.0 jKidney 


100.0 


100.0 | 


|HUVEC starved 


0.0 

1 


0.0 ] 


I 
1 




i 
i 

« 



1 — ! i i | 

CNS_neurodegeneration_vl.O Summary: Ag4197/Ag4299 Expression of the 
CG100872-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 



(data not shown). 

General_screeningjpanel_vl.4 Summary: Ag41 97 Expression of the 
CGI 00872-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

Panel 4.1D Summary: Ag4197/Ag4299 Low expression of the CG100872-01 
gene is seen in kidney (CTs=32-36). Therefore, expression of this gene can be used to 
distinguish this sample from other samples used in this panel. In addition, therapeutic 
modulation of this gene can be useful in the treatment of autoimmune and inflammatory 
diseases that affect the kidney including lupus and glomerulonephritis. 

General oncology screening panel_v_2.4 Summary: Ag4197 Expression of the 
CG100872-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

N. CG100980-01: Protein-Arginine Deiminase Type III 
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Expression of gene CGI 00980-01 was assessed using the primer-probe set 
Ag4200, described in Table NA. Results of the RTQ-PCR runs are shown in Tables NB 
andNC. 

Table NA . Probe Name Ag4200 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -actgcaccttcattgatgactt-3 ' 


22 


1881 


245 


Probe 


TET-5 » -actccataccacatgctgcatggg-3 ' -TAMRA 


24 


1904 


246 


Reverse 


5 1 -cacttgaaagagaagggctttc-3 ' 


22 


1956 


247 



Table NB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4200, Run 
221178493 


Tissue Name 

- - 


Rel. Exp.(%) 
Ag4200, Run 
221178493 


Adipose 


0.0 


Renal ca. TK-10 


jo.i 


Melanoma* 
Hs688(A)T 


0.0 


Bladder 


jo.o 


Melanoma* 
Hs688(B).T 


0.0 

i 


"Gastric ca. (liver met.) NCI- 
jN87 


i0.8 

1 


Melanoma* M14 jO.O 

. , , i 


Gastric ca. KATO HI 


0.1 


Melanoma* 
LOXIMVI 


i 

jo.o 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


2.1 


Squamous cell 
carcinoma SCC-4 


2.4 


Colon ca.* (SW480 met) 
SW620 


0.0 


Testis Pool 


0.1 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.1 


Colon ca. HCT-1 16 


0.1 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV- 

3 


0.4 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.1 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


3.6 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
1 


0.2 


Stomach Pool 


0.0 
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Ovarian ca. 
OVCAR-8 


8.5 


Bone Marrow Pool 


0.0 


Ovary 


0 0 


Fetal Heart jo.O 


Breast ca. MCF-7 


0 9 


Heart Pool jo.O 


Breast ca MDA- 
MB-231 


0.0 


Lymph Node Pool jo.O 


Breast ca BT 549 


0 2 


Fetal Skeletal Muscle 


jo.o 


{Breast ca. T47D 


8.1 


Skeletal Muscle Pool 


0.0 


(Breast ca. MDA-N 


0.0 


Spleen Pool jo.O 


(Breast Pool 


0.0 


Thymus Pool jo.O 


Trachea 


0.0 


CNS cancer (glio/astro) U87- 
MG 


6.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18- 
MG 


0.0 


jpetal Lung 


0.0 


CNS cancer (neuro:met) SK-N- 
AS 


0.0 


[Lung ca.NCT-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


jLung ca. LX-I 


0.2 


CNS cancer (astro) SNB-75 :0.4 


(Lung ca.NCI-H 146 jO.O 


CNS cancer (g! io) SN B- 1 9 jo.3 


jLung ca. SHP-77 jo.O 


CNS cancer (glio) SF-295 ;2.4 


jLung ca. A549 jo.5 


Brain (Amygdala) Pool 


o.o ! 


Lungca.NCI-H526 |o.O 


Brain (cerebellum) |0.0 


Lung ca. NCI-H23 jo.l 


Brain (fetal) 


0.0 


Lungca. NCI-H460 jO.O 


Brain (Hippocampus) Pool 


0.0 


Lungca. HOP-62 


0.2 


Cerebral Cortex Pool 


0.0 


Lungca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.6 


Thyroid (female) 


0.0 


Renal ca. ACHN 


100.0 


Pancreatic ca. CAPAN2 


0.1 


Renal ca. UO-3 1 i 


3.2 


Pancreas Pool 


0.0 


Table NC. General oncology screening p*n*l v 7 d 


] 

Tissue Name , 


Rel. Exp.(%) I 
^g4200, Run 
>68695263 


rissue Name 

: 


Rel. Exp.(%) 
Ag4200, Run 
268695263 
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Colon cancer 1 


0.2 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


6.2 


Colon cancer 2 


2.1 


Bladder cancer NAT 4 


0.2 


Colon cancer NAT 2 


0.2 


Adenocarcinoma of the prostate 
1 


4.9 


Colon cancer 3 


0.2 


Adenocarcinoma of the prostate 
2 


0.0 


Colon cancer NAT 3 


0 2 iAdenocarcinoma of the prostate 


0.2 


Colon malignant 
cancer 4 


0.0 


Adenocarcinoma of the prostate 
4 


11.7 


Colon normal 
adjacent tissue 4 


0.0 


Prostate cancer NAT 5 


0.8 


Lung cancer 1 


q j iAdenocarcinoma of the prostate 


0.0 


Lung NAT 1 


n n ; Adenocarcinoma of the prostate 

o.o j7 


0.2 


Lung cancer 2 j0.4 1 Adenocarcinoma of the prostate | Q Q 

; :8 j 


Lung NAT 2 jo 0 1 Adenocarcinoma of the prostate 

i ; 9 


0.0 


Squamous cell 
carcinoma 3 


7.9 'Prostate cancer NAT 10 


1.0 


Lung NAT 3 


0.0 


Kidney cancer 1 


0.2 


metastatic 
melanoma 1 


0.0 


KidneyNAT 1 


0.0 


Melanoma 2 


0.9 


Kidney cancer 2 


0.6 


Melanoma 3 


0.4 


Kidney NAT 2 


0.0 


metastatic 
melanoma 4 


0.2 


Kidney cancer 3 


0.0 


metastatic 
melanoma 5 


0.0 


Kidney NAT 3 


0.0 


Bladder cancer 1 


0.2 


Kidney cancer 4 


0.2 


Bladder cancer NAT 
1 


D.O 


Kidney NAT 4 


0.0 


Bladder cancer 2 


100.0 







CNS_neurodegeneration_vl.O Summary: Ag4200 Expression of the 
CGI 00980-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



General_screening_panel_vl.4 Summary: Ag4200 Highest expression of the 
CGI 00980-01 gene is detected in renal cancer ACHN cell line (CT=23.6). Significant 
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expression of this gene is seen exclusively in cluster of CNS, pancreatic, colon, renal, 
lung, breast, ovarian, squamous cell carcinoma and prostate cancers cell lines. Therefore, 
expression of this gene can be used as diagnostic marker for these cancers and therapeutic 
modulation through the use of small molecule target could be useful in the treatments of 
5 these cancers. 

Panel 4.1 D Summary: Ag4200 Expression of the CGI 00980-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 5 Islet Summary: Ag4200 Expression of the CG100980-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

10 General oncology screening panel v 2.4 Summary: Ag4200 Highest 

expression of the CGI 00980-01 gene is detected in bladder cancer (CT=28.7), with low 
but significant expression of this gene in squamous cell carcinoma, colon and prostate 
cancers. In addition, expression of this gene is higher in the cancers than in the normal 
adjacent tissue. Therefore, expression of this gene may be useful as a marker to detect the 

1 5 presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 

O. CG56763-01: GPCR 

Expression of gene CG56763-01 was assessed using the primer-probe set Ag3012, 
20 described in Table OA. Results of the RTQ-PCR runs are shown in Tables OB and OC. 

Table OA . Probe Name Ag3012 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 1 -ctctttgtcctggtggagaac-3 ' 


21 


162 


248 


Probe 


TET-5 1 -acctccctccacaggcccatgtacta-3 ' -TAMRA 


26 


213 


249 


Reverse i 


5 ' -gaaagacatggagctcagaaag-3 * \ 


22 


239 


250 



Table OB. Panel 13D 



Tissue Name 


Rel. Exp.(%) 
Ag3012, Run 
167810404 


Tissue Name 


Rel. Exp.(%)Ag3012, 
Run 167810404 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


1.6 


Renal ca. A498 


O.O 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 
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Salivary gland 


2.8 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


12.6 


(Liver ca. (hepatoblast) 
|HepG2 


0.0 


Brain (cerebellum) 


0.0 jLung 


0.0 


Brain (hippocampus) 


5.5 ;Lung (fetal) 


0.0 


Brain (substantia nigra) 


3.0 jLung ca. (small cell) LX-l 


1.4 


Brain (thalamus) 


i — ........ 

, jLung ca. (small cell) NCI- 
|H69 


0.0 


Cerebral Cortex 


fl 0 ILung ca. (sxell var.) SHP- 
j77 


2.4 


Spinal cord 


, ftn A iLung ca. (large ceII)NCI- 
,00 ° 1H460 


0.0 


glio/astro U87-MG 


n n ILung ca. (non-sm. cell) 
U *° !A549 


0.0 


glio/astro U-l 18-MG 


0.0 


Lung ca. (non-s.cell) NCI- 

;H23 


0.0 


astrocytoma SW I78j 


0.0 


Lung ca. (non-s.cell) HOP- \ n n 

62 ^ j 


neuro*; metSK-N-AS 


0.0 


Lung ca. (non-s.cl) NCI- L n 
H522 | 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) SW 900 


0.0 


astrocytoma SNB-75 


0.0 


Lung ca. (squam.) NCl- 
H596 


0.0 


glioma SNB-19 


3.2 


Mammary gland 


2.9 


glioma U25l 


0.0 jBreastca.* (pl.ef) MCF-7 


3.5 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) MDA- 
MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) T47D 


0.0 


Heart 


2.6 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


I2.5 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR-3 


3.0 


Thymus 


9.5 


Ovarian ca. OVCAR-4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR-5 


3.5 


Lymph node 


0.0 


Ovarian ca. OVCAR-8 


0.0 


Colorectal 


4.3 


Ovarian ca. IGROV- 1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) SK- 
OV-3 


0.0 
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Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


35.4 


Colon ca. CaCo-2 


0.0 


Melanoma Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


6.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX 1MV1 


0.0 


Trachea 


0.0 


Melanoma* (met) SK- 
MEL-5 


0.0 


Kidney 


0.0 


Adipose 


0.0 


Table OC. Panel 4D 


|Rel. Exp.(%) 
Tissue Name j Ag30 1 2, Run 
1 164404080 


Tissue Name 


Rel. Exp.(%) 
Ag3012, Run 
164404080 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC 1FN gamma 


0.0 


Secondary Trl act 


0.0 


HUVECTNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVECTNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + lLlbeta 


0.0 


Primary Th2 rest 


8.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 
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CD45RO CD4 
lymphocyte act 


u.u 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astroc}4es rest 


7.2 


Secondary CDS 
lymphocyte rest 


7.9 


Astrocytes TNFalpha + IL- 
1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


ICD4 Ivmnhorvtp nrm<» 

i 

j 




KU-8 12 (Basophil) 
PMA/ionomycin 


i 

7.7 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCDU06(Keralinocytes) 
none 


0.0 


LAK cells rest 


10.7 


CCDlI06(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


9.4 


NCI-H292 none 


0.0 

1 


LAK cells IL-2+IL- 18 


14.1 

1 

0.0 


NCI-H292 IL-4 io.O ^ 

\ i 


LAK cells 
PMA/ionomycin 


NCI-H292 IL-9 


• 

18.9 | 


NK Cells IL-2 rest "0.0 


NCI-H292IL-13 


0.0 1 


(Two Way MLR 3 day |4.0 


NCI-H292 IFN gamma 


j 

0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0:0 


HPAECTNF alpha + IL-1 
beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


8.1 


Lung fibroblast TNF alpha + 
IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 |] 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


o.o ~p 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes CD40L 
andIL-4 


19.6 1 
ji 


Dermal fibroblast CCD1070 
•est 


0.0 


EOL-1 dbcAMP 


16.7 || 


Dermal fibroblast CCD 1 070 
rNF alpha • { 


o.o 


EOL-1 dbcAMP 
PMA/ionomycin 


u 


Dermal fibroblast CCD 1 070 
L-lbeta * 


3.0 


Dendritic cells none ( 


).0 jDermal fibroblast I FN gamma ( 


).0 


Dendritic cells LPS ^ 


1.5 (Dermal fibroblast IL-4 ( 


).0 


Dendritic cells anti- ( 


>.0 IBD Colitis 2 ( 


).0 
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CD40 








Monocytes rest 


0.0 


IBD Crohn's 


57.0 


Monocytes LPS 


0.0 


Colon 


56.6 


Macrophages rest 


0.0 


Lung 


65.5 


Macrophages LPS 


0.0 


Thymus 


14.9 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved lo.O 







CNS_neurodegeneration_vl.O Summary: Ag3012~Expression of this gene is 
low/undetectable (CTs > 34.5) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag3012 This gene is expressed at low levels in the 
samples derived from spinal cord (CT = 32.1 ) and testis (CT=33.6). Thus, the expression 
5 of this gene could be used to distinguish these samples from the other samples in the 

panel. Furthermore, therapeutic modulation of the expression or function of this gene may 
be effective in the treatment of CNS disorders, fertility and hypogonadism. 

Panel 4D Summary: Ag3012 This gene is expressed at low levels in a sample 
derived from liver cirrhosis (CT = 33.4). In addition, expression of this gene is not 
1 0 detected in normal liver in Panel 1 .3D, suggesting that its expression is unique to liver 
cirrhosis. This gene encodes a putative GPCR; therefore, antibodies or small molecule 
therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. This gene is also 
expressed at low levels in the lung and colon. 

General oncology screening panel_v_2.4 Summary: Ag301 2 Expression of this 
1 5 gene is low/undetectable (CTs > 34.5) across all of the samples on this panel (data not 
shown). 

P. CG56777-01: Prostaglandin-F Synthase 1 

Expression of gene CG56777-01 was assessed using the primer-probe set Ag3017, 
20 described in Table PA. 

Table PA. Probe Name Ag3017 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 • -agaaacttggaccggactatgt-3 1 


22 


349 


251 


Probe 


TET-5 ' -tcatgtaccatttgctatgaagcctg-3 1 -TAMRA 


26 


386 


252 


Reverse 


5 ■ -tcctttggcagtaattctttcc-3 ' 


22 


412 


253 
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CNS_neurodegeneration_yl.O Summary: Ag3017 Results from one experiment 
with the CG56777-01 gene are hot included. The amp plot indicates that there were 
experimental difficulties with this run. 

Panel 1.3D Summary: Ag3017 Expression of the CG56777-01 gene is 
5 Iow/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3017 Expression of the CG56777-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag30 17 Expression of the 
CG56777-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
10 (data not shown). 

Q. CG56941-01: >ptnr:SPTREMBL-ACC:O60523 Ribonuclease H Type II 

Expression of gene CG56941-01 was assessed using the primer-probe set Ag3096. 
described in Table QA. Results of the RTQ-PCR runs are shown in Tables QB, QC ? QD 
15 andQE. 

Table OA . Probe Name Ag3096 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -gtttcagaagggcaggaaaa-3 ' 


20 


321 


254 


Probe 


TET-5 • -caacatggacaagaatcggagacgaa-3 ' -TAMRA 


26 j 


342 


255 


Reverse 


5 ' -ttcatctccatctccatcca-3 ' 


20 


394 


256 



Table OB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3096, Run 
208976837 


Tissue Name 


Rel. Exp.(%) Ag3096, 
Run 208976837 


AD 1 Hippo 


18.9 


Control (Path) 3 Temporal 
Ctx 


11.3 


AD 2 Hippo 


17.1 


Control (Path) 4 Temporal 
Ctx 


43.2 


AD 3 Hippo 


7.1 


AD 1 Occipital Ctx 


46.7 


AD 4 Hippo 


5.1 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


6.7 


AD 6 Hippo 


34.6 


AD 4 Occipital Ctx 


29.3 


Control 2 Hippo 


13.8 


AD 5 Occipital Ctx 


20.3 
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Control 4 Hippo 


8.0 |AD 6 Occipital Ctx 


27.2 


Control (Path) 3 
Hippo 


6.0 


Control 1 Occipital Ctx 


1 1 5 


AD 1 Temporal Ctx 


16.2 


Control 2 Occipital Ctx 


4.1 


AD 2 Temporal Ctx 


25.5 




49.0 


AD 3 Temporal Ctx 


3.4 


Control 4 Occipital Ctx 


4.0 


AD 4 Temporal Ctx 


22.5 


Control (Path) 1 Occipital 
Ctx 


61.6 

i 


AD 5 Inf Temporal 
Ctx 


55.5 


Control (Path) 2 Occipital 
Ctx 


18.2 


AD 5 Sup Temporal 
Ctx 


33.7 


Control (Path) 3 Occipital 
Ctx 


5.5 


AD 6 Inf Temporal 
Ctx 


45.1 


Control (Path) 4 Occipital 
Ctx 


79.6 


AD 6 Sup Temporal 
Ctx 


61.1 


Control I Parietal Ctx 


9.6 


Control 1 Temporal 
Ctx 


8.9 


Control 2 Parietal Ctx 


58.2 


Control 2 Temporal 
Ctx 


20.4 


Control 3 Parietal Ctx 


15.9 

■ 


Control 3 Temporal 
Ctx 


18.8 


Control (Path) 1 Parietal Ctx 


37.9 1 


Control 3 Temporal 
Ctx 


12.9 


Control (Path) 2 Parietal Ctx 


18.3 


Control (Path) 1 
Temporal Ctx 


46.0 


Control (Path) 3 Parietal Ctx 


2.2 


Control (Path) 2 
Temporal Ctx 


18.6 


Control (Path) 4 Parietal Ctx 


59.0 


Table OC Panel 1.3D 


Tissue Name 


Rel. Exp.(%) 
Ag3096, Run 
167985249 


Tissue Name 


Re!. Exp.(%) Ag3096, 
Run 167985249 


Liver 

adenocarcinoma 


0.5 


Kidney (fetal) 


0.2 


Pancreas 


0.0 


Renal ca. 786-0 


0.2 


Pancreatic ca. 
CAPAN 2 


0.1 


Renal ca. A498 


0.2 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.1 


Thyroid 


0.0 


Renal ca. ACHN 


0.1 


Salivary gland 


0.1 


Renal ca. UO-3 1 


0.2 


Pituitary gland 


0.5 


Renal ca. TK-10 


0.1 


Brain (fetal) 


83.5 


Liver 


0.0 
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Brain (whole) 


0.7 


Liver (fetal) 


0.0 


Brain (amygdala) 


1.1 


Liver ca. (hepatoblast) 
HepG2 


3.7 


Brain (cerebellum) 


100.0 


Lung 


0.0 


Brain 

(hippocampus) 


1.1 


Lung (fetal) 


0.1 


Brain (substantia 
nigra) 


1 

0.0 jLung ca. (small cell) LX- ! 

■ 


0.3 


Brain (thalamus) 


rt 0 iLuns ca. (small cell) NCI- 
0J !H69" 


0.1 


Cerebral Cortex 


1.0 


Lung ca. (s.cell var.) SHP- 
77 


1.4 


Spinal cord 


0.1 


Lung ca. (large cell)NCl- 
H460 


0.4 


glio/astro U87-MG 


n jLung ca. (non-sm. cell) 
U:> ;A549 


1.2 


gIio/astroU-118- 
MG 


'Ltinaca. (non-s.cell) NCI- L , 
0.1 j H23 ~ |0.3 


astrocytoma 
SW1783 


n 7 Lung ca. (non-s.cell) HOP- L . 

:62 " ! 


neuro*; met SK-N- 
AS 


. . Luna ca. (non-s.cl) NCI- ; n 
H522 i 


astrocytoma SF-539 


0.1 :Lung ca. (squam.) S\V 900 jO.l 


astrocytoma SNB- 
75 


0.4 


Lung ca. (squam.) NCI- 
H596 


0.3 


glioma SNB-19 


0.1 


Mammary gland 


0.0 


glioma U251 


0.4 


Breast ca.* (pl.ef) MCF-7 


0.3 


glioma SF-295 


0.3 


Breast ca.* (pl.ef) MDA- 
MB-231 


0.1 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) T47D 


0.1 


Heart 


0.1 


Breast ca. BT-549 


0.2 


Skeletal muscle 
(fetal) 


0.0 


Breast ca. MDA-N 


0.2 


Skeletal muscle 


0.1 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR-3 


4.4 


Thymus 


0.0 


Ovarian ca. OVCAR-4 


0.1 


Spleen 


0.0 


Ovarian ca. OVCAR-5 


0.4 


Lymph node 


0.1 


Ovarian ca. OVCAR-8 


0.5 


Colorectal 


0.0 


Ovarian ca. IGROV-1 


0.5 


Stomach 


0.1 


Ovarian ca.* (ascites) SK- 
OV-3 


0.6 


Small intestine 


0.0 


Uterus 


0.1 
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Colon ca. SW480 


0.4 


Placenta 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.8 


Prostate 


0.0 


Colon ca. HT29 


0.1 


Prostate ca.* (bone met)PC- 

3 


0.3 


Colon ca. HCT-116 


0.3 


Testis 


1.2 i 


Colon ca. CaCo-2 


0.5 


Melanoma Hs688(A)T 


0.0 


Colon ca. 
tissue(OD03866) 


0.1 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC- 
2998 


0.3 


Melanoma UACC-62 


0.1 


Gastric ca.* (liver 
met) NCI-N87 


0.6 


Melanoma M14 


0.1 


Bladder 


2.0 


Melanoma LOX 1MVI 


0.3 


Trachea 


ft n Melanoma* (met) SK- 
U '° jMEL-5 


0.1 


Kidney 


0.1 jAdipose 


0.1 


Tabic OD. Panel 4.1D 


jReL Exp.(%) 
Tissue Name |Ag3096, Run 
i 169990855 


i ! 

iRel. Exp.(%)A°3096J 
Tissue Name j Ru n 169990855 j 

i » 


Secondary Th 1 act 


41.2 


HUVEC iL-i beta 


45.4 


Secondary Th2 act 


70.7 


HUVEC I FN gamma 


35.1 


Secondary Trl act 


65.5 


HUVEC TNF alpha + IFN 
gamma 


38.7 


Secondary Thl rest 


14.4 


HUVEC TNF alpha + IL4 


33.4 


Secondary Th2 rest 


46.0 


HUVEC IL-11 


18.4 


Secondary Trl rest 


34.2 


Lung Microvascular EC 
none 


46.0 


Primary Thl act 


54.7 


Lung Microvascular EC 
TNFalpha + lL-lbeta 


39.5 


Primary Th2 act 


57.0 


Microvascular Dermal EC 
none 


100.0 


Primary Trl act 


44.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


40.9 


Primary Thl rest 


12.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


17 J 


Primary Th2 rest 


18.2 


Small airway epithelium 
none 


21.6 


Primary Trl rest 


17.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


34.4 
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CD45RA CD4 
lymphocyte act 


67.4 


v^oroiicry aricry diviv^ rest 


77 1 


CD45RO CD4 
lymphocyte act 


60.3 


Coronery artery SMC 
TNFalpha -ML- 1 beta 


97 ^ 


CD8 lymphocyte act 


55.1 


Astrocytes rest 


20.2 


Secondary CDS 
lymphocyte rest 


46.7 


Astrocytes TNFalpha + IL- 
lbeta 


26.1 


Secondary CD8 
lymphocyte act 


42.0 


:KU-812 (Basophil) rest 


i 

1 1 8.2 


CD4 lymphocyte 
none 


26.4 


KU-8 1 2 (Basophil) 
PMA/ionomycin 


31.2 


2ry 

Thl/Th2/Trl_anti- 
CD95 CH11 


17.9 


CCD! 106 (Keratinocytes) 


62.4 

i 


LAK cells rest 


31.4 


CCD 11 06 (Keratinocytes) 
TNFalpha + IL-1 beta 


9^ 9 

i 


LAK cells IL-2 


41.5 


|Liver cirrhosis 


iii.8 


LAK cells IL-2+IL- 
12 


53.6 


! 

'NC1-H292 none 


23.3 


LAK cells IL-2+IFN 
gamma 


59.5 


1 


90 Q 


LAK cells IL-2+IL- 
IS 


36.6 




4 j .4 


LAK cells 
PMA/ionomycin 


36.3 




HO.O 


NK Cells IL-2 rest 


42.9 


NCI-H292 I FN gamma 


48.0 


Two Way MLR 3 
day 


69.3 


HPAEC none 


28.5 


Two Way MLR 5 
day 


79.0 


HPAECTNF alpha +IL-I 
beta 


nO.U 


Two Way MLR 7 
dav 


21.5 


Lung fibroblast none 


26.4 


PBMC rest 


13.7 


Lung fibroblast TNF alpha 
t* lb- 1 oeta 


30.6 


PBMC PWM 


46.0 


Lung fibroblast IL-4 


34.4 


PBMC PHA-L 


18.3 


Lung fibroblast IL-9 


62.9 


Ramos (B cell) none 


59.0 


Lung fibroblast I L- 13 


62.0 


Ramos (B cell) 
ionomycin 


49.3 


Lung fibroblast IFN gamma 


28.5 


B lymphocytes 
PWM • 


54.0 


Dermal fibroblast ' 
CCDl 070 rest 


58.6 


3 lymphocytes 
CD40LandIL-4 


57.6 ; 


Dermal fibroblast 

CCD! 070 TNF alpha 1 


57.4 
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EOL-1 dbcAMP 


22.8 


Dermal fibroblast 
CCD1070IL-1 beta 


33.7 


EOL-1 dbcAMP 
PM A/ i on o my c i n 


20.2 


Dermal fibroblast IFN 
gamma 


12.9 


Dendritic cells none 


81.8 


Dermal fibroblast IL-4 


35.6 


Dendritic cells LPS 


15.5 


Dermal Fibroblasts rest 


22.8 


Dendritic cells anti- 
CD40 


14.9 |Neutrophils TNFa+LPS 

i 


0.0 


Monocytes rest 


25.9 


Neutrophils rest 


1. A 


Monocytes LPS j37.l 


Colon 


3.5 


Macrophages rest 


36.9 


Lung 


11.2 


Macrophages LPS (14.8 


Thymus 


16.8 


HUVEC none 147.6 


Kidney 


14.6 


HUVEC starved 128.3 






Table OE. General oncology screening panpl v ? d 


jRel. Exp.(%) 
Tissue Name j Ag3 096, Run 
1267920136 


Tissue Name 

i 


Kei. bxp.(/o) Agj 1)96,1 
Run 267920136 j 


Colon cancer 1 : 1 1 .8 jBladder cancer NAT 2 


0.0 j 


Colon cancer NAT 1 ;4.8 jBladder cancer NAT 3 


; , 

0.0 i 


Colon cancer 2 ;7.2 ^Bladder cancer NAT 4 


3.3 j 


Colon cancer NAT 2 


7.8 


Adenocarcinoma of the 
prostate 1 


i 

31.2 


Colon cancer 3 


22.7 


Adenocarcinoma of the 
prostate 2 


0.0 


Colon cancer NAT 3 


12.2 


Adenocarcinoma of the 
prostate 3 


3-2 


Colon malignant 
cancer 4 


6.4 


Adenocarcinoma of the 
prostate 4 


12.0 


Colon normal 
adjacent tissue 4 


5.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


10.8 


Adenocarcinoma of the 
prostate 6 


2.0 


Lung NAT 1 


0.0 


Adenocarcinoma of the 
prostate 7 


9.0 


Lung cancer 2 


100.0 


Adenocarcinoma of the 
prostate 8 


0.0 


Lung NAT 2 


2.6 


Adenocarcinoma of the 
prostate 9 


11.4 


Squamous cell 
carcinoma 3 


17.2 


Prostate cancer NAT 10 


0.0 


Lung NAT 3 


0.0 


Kidney cancer 1 


9.3 
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metastatic 
melanoma 1 


4.6 


KidneyNAT 1 


10.5 


Melanoma 2 


0.0 


Kidney cancer 2 


40.6 


Melanoma 3 


2.6 


Kidney NAT 2 


7.7 


metastatic 
melanoma 4 


12.8 


If i r! n c*\! r* q nfpr "2 
iviuiicy CallLcr .? 


^ A 
J A 


metastatic 
melanoma 5 


20.0 


Kidney NAT 3 


0.0 

1 


Bladder cancer 1 


0.0 


Kidney cancer 4 


10.4 


Bladder cancer NAT 
1 


0.0 


Kidney NAT 4 


5.4 


Bladder cancer 2 


0.0 







expression of the CG56941-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

Panel 1.3 D Summary: Ag3096 Highest expression of the CG56941-01 gene is 
detected in brain (cerebellum) (CT=27.7). In addition, high to moderate expression of this 
gene is also detected in fetal brain and other regions of central nervous systems, including 
amygdala, hippocampus, substantia nigra, thalamus, cerebellum, cerebral cortex, and 
spinal cord. Therefore, therapeutic modulation of this gene product may be useful in the 
treatment of central nervous system disorders such as Alzheimer's disease, Parkinson's 
disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Low levels of expression of this gene is also seen in a ovarian cancer, two lung 
cancer, liver cancer, a colon cancer and a CNS cancer cell line. Therefore, therapeutic 
modulation of this gene could be useful in the treatment of these cancers. 

Panel 4.1D Summary: Ag3 096 Highest expression of the CG5 694 1-01 gene is 
detected in microvascular dermal endothelial cells (CT=34.4). In addition, low levels of 
expression of this gene is also seen in dendritic cells, and two way MLR. Therefore, 
therapeutic modulation of this gene could be useful in the treatment of autoimmune and 
. inflammatory diseases that involve these cells, such as lupus erythematosus, asthma, 
emphysema, Crohn's disease, ulcerative colitis, rheumatoid arthritis, osteoarthritis, and 
psoriasis. 
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General oncology screening panel_v_2.4 Summary: Ag3096 Highest 
expression of the CG56941-01 gene is detected in lung cancer (CT=34.3). Expression of 
this gene is higher in the cancer sample than in the corresponding adjacent control sample 
(CT=39.6). Thus, expression of this gene may be useful as diagnostic marker for detection 
5 of lung cancer and therapeutic modulation of this gene may be useful in the treatment of 
this cancer. 

R. CG57109-01 and CG57109-02 and CG57109-03 and CG57109-04 and CG57109-05 
and CG57109-06: Doublecortin/CAMKinase 

10 Expression of gene CG57109-01, variants CG57109-02, CG57109-03, CG57109- 

04, CG57 109-06 and full length physical clone CG57 109-05 was assessed using the 
primer-probe sets Agl 1 37, Agl 1 50, Agl 860, Ag3 1 1 2 and Ag428 1 , described in Tables 
RA, RB, RC, RD and RE. Results of the RTQ-PCR runs are shown in Tables RF, RG, RH, 
RI, RJ, RK, RL, RM and RN. Please note that the variants CG57109-03, CG571 09-04, 

15 CG57109-06 correspond to the probe and primer sets Agl50 and Ag31 12 only. CG57109- 
05 represents a full-length physical clone of the CG57109-01 gene, validating the 
prediction of the gene sequence. 



20 



Table RA. Probe Name Agl 137 


Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -gacatggtggacagtgagatct-3 ' 


22 


1338 


257 


Probe 


TET-5 * -cctctctoaccccaacatcgtgaaat-3 1 -TAMRA 


26 


1373 


258 


Reverse 


5 1 -tctgtttcgtagacttcatgca-3 • 


22 


1399 


259 


Table RB. Probe Name Asll50 




Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -gaaattggctgattttggactt-3 « 


22 


1634 


260 


Probe 


TET-5 ' -cctatatttactgtgtgtgggacccca-3 • -TAMRA 


27 


1674 


261 


Reverse 


5 ' -agaatttcgggagctacgtaag-3 ' 


22 


1702 


262 




Table RC. Probe Name Agl. 


860 




Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


S ' -gacatggtggacagtgagatct-3 ' 


22 


1338 


263 


Probe 


TET-5 ' -cctctctcaccccaacatcgtgaaat-3 ' -TAMRA 


26 


1373 


264 


Reverse 


5 • -tctgtttcgtagacttcatgca-3 ' 


22 


1399 


265 
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Table RD . Probe Name Ag3112 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 • -gaaattggctgattttggactt-3 • 


22 j 


1634 


266 


Probe 


TET-5 ■ -cctatatttactgtgtgtgggacccca-3 1 -TAMRA 


27 


1674 


267 


Reverse 


5 • -agaatttcgggagctacgtaag-3 • 


22 


1702 


268 



Table RE . Probe Name Ag4281 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -gacatggtggacagtgagatct- 3 1 


22 


1338 


269 


Probe 


TET-5 ' -atccagagcctctctcaccccaacat-3 ' -TAMRA 


26 


1365 


270 


Reverse 


5 ' -tcgtagacttcatgcaatttca-3 • 


22 


1393 


271 



Table RF . Al comprehensive panelvl.O 



1 — , — 

Tissue Name 


Rel. Exp.(%) 
Agl860, Run 
225404259 


{"Tissue Name 


Rel. Exp.(%) Agl860, 
Run 225404259 


110967 COPD-F 


3.7 


112427 Match Control 
Psoriasis-F 


7.9 


110980 COPD-F 


0.0 


1 12418 Psoriasis-M 


13.1 


110968 COPD-M 


20.0 


112723 Match Control j 

Psoriasis-M I 17,9 

.. i 


110977 COPD-M 


3.0 


112419 Psoriasis-M 126.8 


1 10989 Emphysema- 
F 


26.2 


112424 Match Control 
Psoriasis-M 


19.5 


11 0992 Emphysema- . 
F j 4 * 2 


1 12420 Psoriasis-M 


36.9 


110993 Emphysema- j 
F 


11.0 


112425 Match Control 
Psoriasis-M 


23.3 


110994Emphysema- 
F 


19.8 


104689 (MF) OA Bone- 
Backus 


13.0 


1 10995 Emphysema- 
F 


8.5 


1 04690 (MF)Adj 
"Normal" Bone-Backus 


3.9 


110996Emphysema- 
F 


0.0 


104691 (MF) OA 
Synovium-Backus 


29.5 


110997 Asthma-M 


17.2 


1 04692 (BA) OA 
Cartilage-Backus 


0.0 


111001 Asthma-F 


38.4 


1 04694 (BA) OA Bone- 
Backus 


17.3 


111002 Asthma-F 


27.9 


1 04695 (BA)Adj 
'Normal" Bone-Backus 


0.0 
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111003 Atopic 
Asthma-F 


50.7 


104696 (BA) OA 
Synovium-Backus 


93.3 


111004 Atopic 
Asthma-F 


68.3 


104700 (SS)OA Bone- 
Backus 


11.7 


111005 Atopic 
Asthma-F 


56.6 


104701 (SS) Adj 
"Normal" Bone-Backus 


15.3 


111006 Atopic 
Asthma-F 


17.9 


1 04702 (SS) OA 
Synovium-Backus 


100.0 


111417 Allergy-M 


31.0 


1 17093 OA Cartilage 
Rep7 


19.1 


112347 Allergy-M 


8.3 


112672 OA BoneS 


6.8 


112349 Normal 
Lung-F 


2.5 


112673 OA Synoviums 


4.8 


112357 Normal 
Lung-F 


4.4 


1 12674 OA Synovial Fluid 
cellsS 


13.9 


112354 Normal 
Lung-M 


0.0 


1 17100 OA Cartilage 
Rep 14 


4.5 


112374 Crohns-F 


27.5 


1 12756 OA Bone9 


63.7 


112389 Match 
Control Crohns-F 


9.0 


112757 OA Synovium9 


17.7 


1 12375 Crohns-F 


36.6 


1 12758 OA Synovial Fluid 
Cells9 


11.7 


112732 Match 
Control Crohns-F 


0.0 


117125 RA Cartilage 
Rep2 


27.5 


112725 Crohns-M 


4.4 


113492 Bone2RA 


20.0 


112387 Match 
Control Crohns-M 


17.1 


113493 Synovium2RA 


6.8 


112378 Crohns-M 


4.2 


113494 Syn Fluid Cells 
RA 


8.8 


112390 Match 
Control Crohns-M 


49.3 


1 13499 Cartilage4RA 


4.7 


112726 Crohns-M 


30.1 


113500 Bone4 RA 


8.8 


112731 Match 
Control Crohns-M 


0.0 


113501 Synovium4RA 


3.5 


112380 Ulcer Col-F 


37.9 


113502 Syn Fluid Cells4 
RA 


4.0 


112734 Match 
Control Ulcer Col-F 


17.9 


1 13495 Cartilage3 RA 

. . . , 


10.4 


112384 Ulcer Col-F 


29.7 


113496 Bone3 RA 


4.2 


112737 Match 
Control Ulcer Col-F 


9.0 . 


113497 Synovium3 RA 


0.0 


112386 Ulcer Col-F 


5.0 


113498 Syn Fluid Cells3 
RA 


7.9 
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112738 Match 
Control Ulcer Col-F 




1 17106 Normal Cartilage 
Rep20 


n ft 


112381 Ulcer Col-M 


0.0 


1 13663 Bone3 Normal 


0.0 


1 12735 Match 
Control Ulcer Col-M 


28 7 


1 13664 Synovium3 
Normal 


ft ft 


1 12382 Ulcer Col-M 


10 0 


1 13665 Syn Fluid Cells3 
Normal 


n ft 


112394 Match 
Control Ulcer Col-M 


0.0 


1 17107 Normal Cartilage 
Rep22 


0.0 


112383 Ulcer Col-M 


35.4 


1 13667 Bone4 Normal 

. 


13.3 


112736 Match 
Control Ulcer Col-M 


6.3 


1 13668 Synovium4 
Normal 


0.0 


112423 Psoriasis-F 


21.5 


113669 Syn Fluid Cells4 
Normal 


9.4 



Table RG . CNS_neuro degeneration^ 1.0 



Tissue Name 


Rel. . 
Exp.(%) 
Agl860, 
Run 

207807786 


Rel. 

Exp.(%) 
A23I12, 
Run 

1208976863 

! 


Rel. 

Exp.(%) 
Ag428l, 
Run 

2240752 

|94 

! 


! 
! 

iTissue 
jName 

i 
s 

i 


Re!. 

Exp.(% 

) 

Agl860 
, Run 
207807 
786 


Rel. 

Exp.(%) 
Ag3112, 
Run 

208976863 

i 


Rel. 

Exp.(%) 
Ag4281, 
Run 

22407529 
4 


AD 1 Hippo 


s 

i 

7.6 


! 

j 

2.7 


9.7 


Control 
(Path) 3 
Temporal 
Ctx 


0.5 


! 
i 

2.9 


3.1 


AD 2 Hippo 


29.1 


21.8 


24.5 


Control 
(Path) 4 
Temporal 
Ctx 


28.7 


23.0 


27.9 


AD 3 Hippo 


1.5 


3.2 


7.1 


AD 1 

Occipital 

Ctx 


17.6 


9.8 


27.4 


AD 4 Hippo 


4.2 


1.1 


5.6 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


0.0 


AD 5 hippo 


100.0 


100.0 


95.3 


AD 3 

Occipital 

Ctx 


4.1 


3.0 


1.3 


AD 6 Hippo 


19.9 


12.2 


26.1 


AD 4 

Occipital 

Ctx 


7.9 


4.8 


8.5 


Control 2 
Hippo 


27.2 


15.9 


39.0 


AD 5 

Occipital 

Ctx 


13.8 


44.1 


50.0 
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Control 4 
Hippo 


3.1 


2.1 


7.2 


AD 6 

Occipital 

Ctx 


33.4 


8.3 


14.8 


Control (Path) 
3 Hippo 


2.1 


0.0 


5.0 


Control 1 
Occipital 
Ctx 


0.0 


0.0 


2.0 


AD 1 

Temporal Ctx 


2.9 


4.1 


9.3 


Control 2 
Occipital 
Ctx 


62.0 


81.2 


100.0 


AD 2 

Temporal Ctx 


21.6 


27.0 


23.5 


Control 3 
Occipital 
Ctx 


5.0 


2.2 


15.5 


AD 3 

Temporal Ctx 


2.7 


0.0 


6.3 


Control 4 
Occipital 
Ctx 


0.5 


0.0 


2.4 


AD 4 

Temporal Ctx 


14.6 


15.6 


12.8 


Control 
(Path) 1 
Occipital 
Ctx 


34.6 


42.6 


43.8 


AD 5 Inf 
Temporal Ctx 


38.2 


■ 

48.3 


54.7 


Control 
(Path) 2 
Occipital 
Ctx 


1 

< 

6.2 |7.1 

! 


6.7 


AD i 

SupTemporal 
Ctx 


26.8 


26.2 


27.0 


Control 
(Path) 3 
Occipital 
Ctx 


0.0 


0.0 


0.6 


AD 6 Inf 
Temporal Ctx 


12.6 


11.7 


25.2 


Control 
(Path) 4 
Occipital 
Ctx 


15.8 


4.7 


25.7 


AD 6 Sup 
Temporal Ctx 


17.7 


10.9 


17.4 


Control 1 

Parietal 

Ctx 


3.7 


1.0 


1.2 


Control 1 
Temporal Ctx 


4.0 


1.1 


4.6 


Control 2 

Parietal 

Ctx 


18.8 


12.9 


19.9 


Control 2 
Temporal Ctx 


22.5 


12.6 


23.0 


Control 3 

Parietal 

Ctx 


8.1 


2.7 


7.9 


Control 3 
Temporal Ctx 


7.0 


4.8 


4.8 


Control 
(Path) 1 
Parietal 
Ctx 


33.4 


44.1 


42.9 


Control 4 
Temporal Ctx 


4.8 


1.9 


6.5 


Control 
(Path) 2 
Parietal 


17.4 


11.0 


13.6 
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Ctx 








Control (Path) 
1 Temporal 

Ctx 


37.6 


34.9 


47.0 


Control 
(Path) 3 
Parietal 
Ctx 


1.7 


0.0 


0.7 


Control (Path) 
2 Temporal 

Ctx 


28.1 


13.6 


27.9 


Control 
(Path) 4 
Parietal 
Ctx 


19.9 


6.9 


24.8 



Table RH . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4281, Run 

22218j233 


Tissue Name 


Rel. Exp.(%) Ag4281, 
Run 222183233 


Adipose 


U.5 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


5.4 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* M14 


0.0 


Gastric ca. KATO 111 


0.0 


Melanoma* 
LOX1MVI 


0.0 


Colon ca. SW-948 


0.0 


ivteianoma ojs>- 
MEL-5 


0.0 


Colon ca. SW480 


0.2 


OLJUalllUUb CCil 

carcinoma SCC-4 


0.0 


v^oion ca. W4ou met) 
SW620 


0.0 


Tpctic Pnnl 




colon ca. ti 1 zy 


U.U 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.9 


Colon ca. CaCo-2 


2.6 


Placenta 


2.0 


Colon cancer tissue 


5.6 


Uterus Pool 


0.8 


Colon ca.SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.7 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV- 
3 


0.6 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


4.0 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.5 


Ovarian ca. IGROV- 
1 


0.0 


Stomach Pool 


0.4 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


1.1 
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vvaiy 


ft £ 


Fetal Heart 


4.5 


jl>I vdol \*ci. VvkK^r 1 


ft ft 


Heart Pool 


2.7 


JJl Caol ta, ivi l*Jt\~ 

MB-231 


0.0 


Lymph Node Pool 


5.9 


DICaol td. DI J t t7 


ft ft 

u.u 


Fetal Skeletal Muscle 


22.1 


Breast ca. T47D 


1.0 


Skeletal Muscle Pool 


9.3 


Breast ca. MDA-N 


0.0 


jSpleen Pool 


2.4 


Breast Pool 


3.1 


Thymus Pool 


3.1 


Trachea 


1 .7 


CNS cancer (glio/astro) 
U87-MG 


A A 
0.0 


Lun<* 


0 0 

v.V 


CNS cancer (glio/astro) U- 
118-MG 


A A 

0.0 


Fetal Lung 


14.4 


CNS cancer (neuro;met) 

pi/ \l AO 

SK-N-AS 


0.8 


Lung ca. inv^i-in'H / 


ft n 
U.U 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-l 


0.0 


CNS cancer (astro) SNB- 
75 


0.0 


ILung ca. NCI-H146 


0.9 


CNS cancer (alio) SNB-19 


0.0 


|Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


0.0 


jLung ca. A549 


0.0 


Brain (Amygdala) Pool 


25.5 


luung ca. iNV_i-rl.)Zo 

1 — 


A A 

0.0 


Brain (cerebellum) 


5.4 


IT nno rt\ "MP* I LI 01 
<L>Ung Ca. iNV-/l-nZJ 


Z. I 


Brain (fetal) 


100.0 


j^ung ca. i\d-M4oU 


A A 
0.0 


Brain (Hippocampus) Pool 


59.0 


T uncr net HOP fJ~) 

i^ung ca. nUr-oz 


ft A 

U.U 


Cerebral Cortex Pool 


14.1 


Lung ca. NCI-H522 


0.7 


Brain (Substantia nigra) 
Poo! 


15.1 


Liver 


0.3 


Brain (Thalamus) Pool 


28.9 


Fetal Liver 


1.0 


Brain (whole) 


33.9 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


25.2 


Kidney Pool 


4.2 


Adrenal Gland 


2.3 


Fetal Kidney 


4.6 


Pituitary gland Pool 


36.6 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.4 


Renal ca. A498 


0.0 


Thyroid (female) 


0.4 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


2.1 



Table RI. Panel 13D 



Tissue Name 


Rel. Exp.(%) 
Agl860, Run 
165981809 


Rel. Exp.(%) 
Ag3112, Run 
167985258 


Tissue Name 


Rel. Exp.(%) 
Agl860,Run 
165981809 


Rel. 

Exp.(%) 
Ag3112, 
Run 
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167985258 


Liver 

f% /A £k n r"\ r* o »*/■» t n Am r* 

auenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


0.8 


1.4 


rancreas 


U.U 


0.0 


Renal ca. 786-0 


0.0 


0.0 


r aiiCrcmlC Ca. 

CAPAN 2 


0.0 


0.0 


Renal ca. A498 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


Renal ca. RXF 393 


0.0 


0 0 


Thyroid 


0.0 


0.0 


Renal ca. ACHN 


0.0 


0.0 


Salivary gland 


0.0 


0.0 


Renal ca. UO-3 1 


0.0 


0.0 


Pituitary gland 


12.6 


6.7 


Renal ca. TK-10 


0.0 


0.0 


Brain (fetal) 


26.6 


99.3 


Liver 


0.0 


0.0 


Brain (whole) (7.0 


8.3 


Liver (fetal) 


0.0 


0.0 


Brain (amygdala) 


35.6 


19.3 


L^ivcr ca. 

(hepatoblast) 
HepG2 


0.0 


0.0 


Brain 

(cerebellum) 


1.2 


1.3 


Lung 


b.o 


0.0 


Brain 

(hippocampus) 


7.0 


19.5 


Lung (fetal) 


1.0 


2.4 


Brain (substantia 
nigra) 


0.4 |6.7 


Lung ca. (small j 
cell)LX-l | 00 


0.0 


Brain (thalamus) 


5.8 


0.6 


Lung ca. (small 
cell)NCl-H69 


0.0 


0.0 


v^ereorai cortex 


13.6 


9.3 


Lung ca. (s.cell 
var.) SHP-77 


0.0 


0.0 


Spinal cord 


100.0 


100.0 


Lung ca. (large 
ceII)NCI-H460 


0.0 • 


0.0 


glio/astro U87- 
MG 


0.0 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


1.3 


glio/astro U- 118- 
MG 


0.0 


0.0 


Lung ca. (non- 
s.cell)NCI-H23 


2.1 


0.0 


astrocytoma 
SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.0 


0.0 


neuro*; met SK- 
N-AS 


0.0 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


0.0 


astrocytoma SF- 
539 1 


0.0 


».6 


Lung ca. (squam.) 
SW 900 


0.0 


0.0 


astrocytoma 
SNB-75 ' 


3.0 


3.0 


Lung ca. (squam.) 
MCI-H596 


0.0 


0.0 


glioma SNB-19 ( 


3.0 j( 


3.0 1 


Vlammary gland 1 


3.0 1 


3.0 


glioma U251 ( 


).0 ( 


,o : 


Breast ca.* (pl.ef) 
VICF-7 


3.0 1 


3.0 


glioma SF-295 ( 


).0 ( 


>.0 Breast ca.* (pl.ef) ( 


3.0 ( 


3.0 
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|MDA-MB-231 






Heart (fetal) 


0.0 


0.0 


Breast ca * (xA eft 
jT47D 


0.0 


0.0 


Heart 


10.7 


4.5 


jBreast ca. BT-549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


A fx 


19.2 


jBreast ca. MDA-N 


0.0 


0.0 


Skeletal muscle 


4.9 


8.1 


(Ovary 


0.0 


0.0 


Bone marrow 


0.0 


0.0 


lOvarian ca. 
iOVCAR-3 


0.0 


0.0 


Thymus 


1.1 


2.2 


Ovarian ca. 
|OVCAR-4 


0.0 


0.0 


Spleen 


0.0 


0.0 


jOvarian ca. 
jOVCAR-5 


0.0 


0.0 


L»yiupii iiuue 


U. / 


1.3 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


2.8 


0.3 


Ovarian ca. 
IGROV-I 


0.0 


0.0 




u.u 


1.1 


Ovarian ca.* 
(ascites^ 


0.0 


1.7 


jSmall intestine 


1.1 


2.0 


Uterus 


0.6 


0.0 


Colon ca. SW480 


0.0 


0.0 


Placenta 


2.8 


0.0 


SW620(SW480 
met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


42.6 


36.1 


Colon ca. CaCo-2 


0.7 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03 866); 


2.0 


4.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma UACC- 
62 


0.0 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


u.u 


0.0 


Melanoma M14 


0.0 


0.0 


Bladder 


4.0 


1.3 


Melanoma LOX 
IMVI 


0.0 


0.0 


Trachea 


3.0 


1.4 


Melanoma* (met) 
SK-MEL-5 


0.0 


0.0 


Kidney 


0.0 


0.0 


Adipose 


1.3 


2.6 


Tab 


e RJ. Panel 2.2 





Tissue Name 



Rel. Exp.(%) 



Tissue Name 
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Agl860, Run 
174229165 




Ael860 Run 

JxtZ.1 1\UI1 

174229165 


Normal Colon 


20.4 


Kidney Margin (OD04348) 


0.0 


Colon cancer 
(OD06064) 


0.0 


Kidney malignant cancer 
(OD06204B) 


28.9 


Colon Margin 
(OD06064) 


12.2 


Kidney normal adjacent 
tissue (OD06204E) 


0.0 


Colon cancer 
(OD06159) 


0.0 


Kidney Cancer (OD04450- 
01) 


0.0 


Colon Margin 
(OD06159) 


12.6 


Kidney Margin (OD04450- 
03) 


0.0 


Colon cancer 
(OD06297-04) 


37.6 


Kidney Cancer 8120613 


20.0 


Colon Margin 
(OD06297-05) 


15.0 


Kidney Margin 8120614 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


45.4 


Kidney Canqer 9010320 


17.7 


CC Margin (OD03921) 


35.1 


Kidnev Marsin 9010321 


21.0 


jColon cancer metastasis 
j(OD06104) 


0.0 




Kidneyfcancer 8120607 


16.5 I 

i 


ILung Margin 
(OD06104) 


19.1 


Kidney Margin 8120608 


0.0 i 


Colon mets to lung 
(OD04451-01) 


51.8 


Normal Uterus 


27.4 | 


Lung Margin 
(OD04451-02) 


0.0 


Uterine Cancer 064011 


0.0 


Normal Prostate 


0.0 


Normal Thyroid 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Thyroid Cancer 064010 


28.9 


Prostate Margin 
(OD04410) 


0.0 


Thyroid Cancer A302152 


33.9 


Normal Ovary 


0.0 


Thyroid Margin A302153 


0.0 


Ovarian cancer 
(OD06283-03) 


0.0 


Normal Breast 


28.3 


Ovarian Margin 
(OD06283-07) 


0.0 


Breast Cancer (OD04566) 


56.3 


Ovarian Cancer 064008 


30.4 


Breast Cancer 1024 


55.5 


Ovarian cancer 
(OD06145) 


0.0 . 


Breast Cancer (OD04590- 
01) 


94.0' 


Ovarian Margin 
(OD06145) 


35.1 


Breast Cancer Mets 
(OD04590-03) 


36.9 


Ovarian cancer 
(OD06455-03) 


0.0 


Breast Cancer Metastasis 
(OD04655-05) 


40.3 
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Ovarian Margin 
(OD06455-07) 


0.0 


breast Cancer 064006 


0.0 


Normal Lung 


13.3 


Breast Cancer 9 1 00266 


0.0 


Invasive poor diff. lung 
adeno (ODO4945-0I 


0.0 


Breast Margin 9100265 


0.0 


jLung Margin 
](ODO4945-03) 


20.3 


Breast Cancer A209073 


37.4 


jLung Malignant Cancer 
{(OD03I26) 


26.2 


Breast Margin A2090734 


23.2 


jLung Margin 
j(OD03126) 


0.0 


Breast cancer (OD06083) 


19.3 


|Lung Cancer 
{(OD05014A) 


0.0 


Breast cancer node 
metastasis (OD06083) 


60.3 


iLung Margin 
|(OD05014B) 


0.0 


Normal Liver 


15.5 


Lung cancer 
(OD06081) 


9.4 


Liver Cancer 1 026 


16.5 


Lung Margin 
(OD06081) 


0.0 


Stiver Cancer 1025 

i 


f 


Lung Cancer 
(OD04237-01) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Margin 
(OD04237-02) 


100 0 

1 UU.V 


Liver I issue 6004-N 


... 

0.0 

• 


Ocular Melanoma 
Metastasis 


o n 


Liver Lancer 6005-T 


0.0 


Ocular Melanoma 
Margin (Liver) 


0.0 


Liver Tissue 6005-N 


0.0 


Melanoma Metastasis 


0.0 


Liver Cancer 064003 


0.0 


Melanoma Margin 
(Lung) 


0 0 


Normal Bladder 


0.0 


Normal Kidney 


0.0 


Bladder Cancer 1023 


27.2 


Kidney Ca; Nuclear 
grade2(OD04338) 


0.0 


Bladder Cancer A302173 


0.0 


Kidney Margin 
(OD04338) ( 


5.0 


Normal Stomach 


17.3 


Kidney Ca Nuclear 
grade 1/2(OD04339) 


).0 ( 


3astric Cancer 9060397 : 


20.0 


Kidney Margin 
(OD04339) ( 


).0 ! 


Stomach Margin 9060396 i 


59.0 


Kidney Ca, Clear cell 
type (OD04340) ( 


).0 ( 


3astr ic Cancer 90603 95 1 


4.4 


Kidney Margin 
(OD04340) C 


M> 5 


Stomach Margin 9060394 ( 


59.7 
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Kidney Ca, Nuclear 
grade 3 (OD04348) 


0.0 


Gastric Cancer 064005 


48.0 


Table RK. Panel 3D 




Tissue Name 


Rel. Exp.(%) 
Ag3 1 1 2, Run 
182114339 


Tissue Name 


RpI Pvn 
Ag3U2, Run 
182114339 


Daoy- 

|MeduIloblastoma 


0.0 


Ca Ski- Cervical epidermoid 
icarcinoma (metastasis) 


0.0 


TE67 1 - 

Medulloblastoma 


0 q !pS-2- Ovarian clear cell 
jcarcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 

i 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


|SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


.SF-268- Glioblastoma 


0 0 jDaudi- Burkitt's lymphoma jO.O 


jT98G- Glioblastoma jo.O 


U266- B-cell plasmac>toma jo.O 


jSK-N-SH- 

jNeuroblastoma 

j(metastasis) 


i — • - 
! 

0-0 |CA46- Burkitt's lymphoma 

i 

i 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-ceil 
lymphoma 


0.0 


Cerebellum 


6.5 


JM 1 - pre-B-cell lymphoma 


0.0 


Cerebellum 


2.4 


Jurkat- Tcell leukemia 


7.3 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


0.0 


TF-1- Erythroleukemia 


0.0 


DMS-114- Small cell 
lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell 
lung cancer 


100.0 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


).0 


CU-8 1 2- Myelogenous 
eukemia 


0.O 


NCI-H526- Small cell 
lung cancer 


).0 

( 


769-P- Clear cell renal 
carcinoma 


3.0 


NCI-N417- Small cell 
lung cancer 


>.0 < 


I^aki-2- Clear cell renal 
:arcinoma 


).0 


NCI-H82- Small cell 
lung cancer 


c 


>W 839- Clear cell renal 
carcinoma 


).0 


NCI-HI 57- Squamous |0.0 < 


3401- Wilms' tumor ( 


).0 
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cell lung cancer 
(metastasis) 








NCI-H1155- Large cell 
lung cancer 


3.6 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


iNCi-nlzyy- Large cell 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


|nC1-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
[carcinoma (liver metastasis) 


0.0 


NCI-UMC-11-Lung 
carcinoid 


0 o jB.\PC-3- Pancreatic 
^adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


0 0 IHPAC- Pancreatic 
Jadenocarcinoma 


0.0 


Colo-205- Colon 
cancer 


q q |MIA PaCa-2- Pancreatic 
|carcinoma 


0.0 


KM 12- Colon cancer 


0 0 jcFPAC-1- Pancreatic ductal 
jadenocarcinoma 


0.0 


KM20L2- Colon 
cancer 


0 0 jPANC-1- Pancreatic 

'epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon 
cancer 


0 o |T24- Bladder carcinma 
j(transitional cell) 


i - 

0.0 1 

1 


SW-48- Colon 
adenocarcinoma 


0.0 j563 7- B ladder care inoma 


0.0 


SW1116- Colon 
adenocarcinoma 


0.0 


HT-1 197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1 080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO m- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


0.0 


NCI-SNU-16- Gastric 
carcinoma 


5.0 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.0 


NCI-SNU-1- Gastric 


3.0 


A43 1 - Epidermoid carcinoma I 


3.0 


RF-1- Gastric 
adenocarcinoma 


).0 


WM266-4- Melanoma < 


3.0 


RF -48- Gastric 
adenocarcinoma 


).0 

I 


3U 1 45- Prostate carcinoma 
[brain metastasis) 


).0 


MKN-45- Gastric ( 


).0 ] 


MDA-MB-468- Breast ( 


).0 
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carcinoma 




adenocarcinoma 




NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 

! 


SCC-I5- Squamous cell 
carcinoma of tongue 


0.0 


HelaSS- Cervical 
adenocarcinoma 


-j — .. 

! 0.0 

i 


•CAL 27- Squamous cell 
jcarcinoma of tongue 


;».» . 


Table RL. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4281,Run 
181080824 


Tissue Name 


Rel. Exp.(%) 

Aa4?81 Run 

181080824 


Secondary Th 1 act 


10.0 


|HUVEC IL-lbeta 


17 4 


Secondary Th2 act 


jo.o 


HUVEC 1FN gamma 


0.0 


Secondary Trl act 


jo.o 


jHUVEC TNF alpha + [FN 
| gam ma 


0.0 


oeconaary i n t rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


;o.o 


[HUVEC IL-11 


0.0 j 


[Secondary Trl rest 


0.0 


'Lung Microvascular EC none 


0.0 j 


1 

j Primary Th 1 act 


.0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


8.5 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


1.6 


Primary Th I rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


2.8 


Coronery artery SMC rest 


5.7 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


25.7 


CD8 lymphocyte act 


0.0 


Astrocytes rest 1 


3.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL- 
lbeta 


1.1 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest ( 


).0 


CD4 lymphocyte none 


13 


KU-8 12 (Basophil) 
PMA/ionomycin 


).0 
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2ryThl/Th2/Trl anti- 
CD95CH11 


2.9 


CCD1 1 06 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD 1 106 (Keratinocytes) 
TNFa!pha + IL-lbeta 


n n 
u.u 


LAK cells IL-2 


0.0 


Liver cirrhosis 


1.4 


LAK cells IL-2+1L-12 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 

i 




n o 


LAK cells IL-2+ IL-18 |0.0 


NCI-H292 1L-9 


I.2 _ 


LAK cells 
PMA/ionomycin 


0.0 


KIP! W?Q? M 1 1 


A Q 


NK Cells IL-2 rest 


1.8 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 3 day 


1.1 


HPAEC none 


2.9 


Two Way MLR 5 day 


1.3 


HPAEC TNF alpha + IL-1 
beta 


1 f\f\ ft 


Two Way MLR 7 day ;0.0 


Lung fibroblast none 


r 2.9 : 


PBMC rest 


i 

b.o 


Lung fibroblast TNF alpha + 
IL-fbeta 


34 ! 


PBMC PWM 


io~.'o 


Lung fibroblast IL-4 


0.0 ! 


PBMC PHA-L 


!0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 




I nno fibroblast II -1 "3 

1 lUi UU Idol 1 1_j I J 


1 Q i 


Ramos (B cell) 
ionorhycin 


0.0 


Lung fibroblast I FN gamma 




B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 
rest 


5.4 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast CCD 1070 
TNF alpha 


11.3 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 
IL-1 beta 


49.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


O. / 


Dendritic cells none 


4.9 


Dermal fibroblast IL-4 


0.6 


Dendritic cells LPS 


4.3 


Dermal Fibroblasts rest 


3.5 


Dendritic cells anti- 
CD40 


8.7 


Neutrophils TNFa+LPS 


0.9 


Monocytes rest 


0.0 


NIp*ntrY\T^ntlc r&ct 
lectin v/Uilllo ICol 


v.U 


Monocytes LPS 


3.9 


Colon 


0.9 


Macrophages rest 


0.0 


Lung 


27.4 


Macrophages LPS 


4.7 


Thymus 


17.0 


HUVEC none 


0.0 


Kidney 


81.8 


HUVEC starved 


0.0 
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Table RM. Panel 4D 



Tissue Name 


Rel Exp.(%; 
Ael860 Rui 
165828919 


) Rel. Exp.(%) 

1 Aoll 1? Run 

164526081 


1 iroim A.T j-t inn r-i_ 

i issue Name 


Rel. 

Exp.(%) 
Agl860, 
Run 

165828919 


Rel. 

Exp.(%) 
Ag3112, 
Run 
1 164526081 


Secondary Th 1 acl 


t 0.0 


jO.O 


jHUVEC JL-lbeta 


6.5 


2.2 


Secondary Th2 act 


0.0 


1 

•0.0 

i 


inu v izk, Ir in 
j gam ma 


0.0 


i 

0.0 ! 


Secondary Trl act 


0.8 


! 
I 

0.0 


iHI IVPP TKir oUKo 

jnuvcL l iNr aipna 
|+ IFN gamma 


0.6 


1 

0.0 


Secondary Th 1 
rest 


0.0 


1.7 


HU VEC TNF alpha 

4- II A 

' JLr4 


0.6 


0.0 


Secondary Th2 
rest 


0.0 


0.0 


HUVEC IHI 


0.0 


0.0 


Secondary Trl rest 


0.6 


| 

lo.o 

I 


Lung Microvascular 
EC none 


0.0 


0.0 


Primary Thl act 


0.0 

L 


I 

;o.o 


jLung Microvascular 
jEC TNFalpha + IL- 
j 1 beta 


9.7 

i 


9.7 


j 

jPrimary Th2 act 

; _._ 


b.o 

___ 


r 

! 0.0 

! 


(Microvascular 
i Dermal EC none 


n n 
U.U 


J 

i 

A A i 

0.0 - 

f 


Primary Trl act 


0.7 


0.0 


;Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


1.4 


I 

5.9 


rnmary I hi rest 


0.7 


0.0 


Bronchial 
epithelium 
TNFalpha + IL1 beta 


0.6 


0.0 


Primary Th2 rest 


0.6 


0.0 


Small airway 
epithelium none 


0.0 


0.0 


Primary Trl rest i 


3.0 


0.0 


Small airway 
epithelium 
TNFalpha + IL- 

1 UCla 


D.O 1 


3.0 


CD45RA CD4 
lymphocyte act 


5.7 : 


1.9 


Coronery artety ( 
SMC rest 


?.7 


10.4 


CD45RO CD4 
lymphocyte act 


).0 ( 


i 

).0 ! 

1 


^oronery artery 
5MC TNFalpha + : 
L-lbeta 


13.7 • 


12.8 


CD8 lymphocyte 
act 


-4 C 


1.0 j 


\strocytes rest ( 


>.0 ( 


1.0 


Secondary CD8 
ymphocyte rest 


.0 0 


/ 

.0 1 

1 


\.strocytes 

rNFalpha + IL- 1 
beta 


1.7 0 


.0 
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oeconaary tuo 
lymphocyte act 


0.0 


1.9 


KU-812 (Basophil) 
rest 


0.0 


0.0 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


0.0 


2ry 

i n 1/ 1 nz/ 1 n anti- 
CD95CHI1 


U.U 

*# 


2.0 


CCD1106 

(Keratinocytes) 

none 


0.0 


0.0 


LAK. cells rest 


0.9 


0.0 


CCD1 106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


LAK cells IL-2 


2.7 


0.0 


Liver cirrhosis 


20.0 


6.4 


LAK cells 1L- 
2+1L-12 


0.0 


0.0 


Lupus kidney 


0.0 


0.0 

i 


LAK cells IL- 
2+IFN gamma 


0.4 


0.0 


NC1-H292 none 


0.0 


0.0 


LAK cells IL-2+ 
IL-18 


0.0 


0.0 


NCI-H292 IL-4 


0.0 


0.0 


LAK cells 
PMA/ionomycin 


1.8 


0.0 


NC1-H292 IL-9 


0.0 


2.6 : 


NK Cells IL-2 rest 


1.7 • 


0.0 |NC1-H292IL-13 


0.0 


1.7 i 


Two Way MLR 3 
day 


1.6 jo.O 


NCI-H292 1FN 
gamma 


i 

0.0 


0.0 i 

i 

! 

0.0 


Two Way MLR 5 
day 


0.0 


0.0 


HPAEC none 


0.0 


i wo way mlk / 
day 


0.0 


0.0 


HPAEC TNF alpha 
+ IL-1 beta 


100.0 


69.3 


PBMC rest 


0.8 


0.0 


Lung fibroblast 
none 


0.0 


0.0 


PBMC PWM 


1.5 


1.2 


Lung fibroblast 
TNF alpha +IL-1 
beta 


16.4 


3.5 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast IL-4 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast IL- 
13 


0.7 


0.0 


R lvmnhnpvtf»c 

PWM 


0.0 


2.5 


Lung fibroblast I FN 
gamma 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


1-7 


Dermal fibroblast 
CCD 1070 rest 


2.3 


4.1 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


28.7 


42.6 
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EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
CCD1070IL-1 beta 


49.7 


100.0 


Dendritic cells 
none 


4.6 


0.0 


Dermal fibroblast 
I FN gamma 


u.u 


0.9 


jLycnuniic ceils 
LPS 


4.2 


2.9 


Dermal fibroblast 
IL-4 


0.3 


0.0 


Dendritic cells 
anti-CD40 


15.8 


H-5 llBD Colitis 2 


1.9 


0.0 


Monocytes rest 


0.0 


0.0 jlBD Crohn's 


2.7 


5.1 j 


Monocytes LPS 


2.3 jO.O 1 


Colon 


15.6 


3.2 j 


Macrophages rest 


5.7 


3.4 


Lung 


23.0 


27.7 


Macrophages LPS 


6.8 


3.4 


Thymus 


0.0 


1.7 


HUVEC none 


0.0 


0.0 ~^ 


Kidney 


26.8 


27.5 


HUVEC starved 


O.O ~T 


D.O 







Tissue Name 


|ReI. Exp.(%) Agl860, 
|Run 171634856 


jTissue Name 


Rel. Exp.(%)Agl860, 
Run 171634856 


BA4 Control jll.7 


, : BA17PSP 


7.5 


BA4Control2 1 15.2 


|BA 1 7 PSP2 


7.9 


BA4 ; 
Alzheimer's2 j 4 ' 8 


!Sub Nigra Control 


0.0 


BA4 Parkinson's 


123.3 


Sub Nigra Control2 


1.4 


BA4 

Parkinson's2 


18.6 


Sub Nigra 
Alzheimer^ 


0.0 


BA4 

Huntington's 


4.9 


Sub Nigra Parkinson's2 


2.3 


BA4 

Huntington*s2 


2.5 


Sub Nigra Huntington's 


10.5 


BA4 PSP 


5.0 


Sub Nigra 
Huntington's2 


23.3 


BA4PSP2 


2.5 


Sub Nigra PSP2 


1.2 


(BA4 Depression 


5.9 


Sub Nigra Depression 


0.0 


|BA4 

JDepression2 


3.3 


Sub Nigra Depression2 


5.1 


|BA7 Control 


19.5 


Glob Palladus Control 


12.1 


|BA7 Contro!2 


23.5 


GlobPalladus ControI2 


16.6 


BA7 

Alzheimer 


10.5 1 

i 


Glob Palladus 
Alzheimer's : 


5.8 


BA7 Parkinson's 


11.0 { 

i 


3lob Palladus 
\lzheimer's2 


11.0 


BA7 


15.5 ( 


3lob Palladus 1 


00.0 
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Parkinson's2 




Parkinson's 




BA7 

Huntington's 


11.8 


Glob Palladus 
Parkinson's2 


30.6 


BA7 

HuntinptonV? 


9.5 


Glob Palladus PSP 


1.7 


BA7 PSP 


2.3 


Glob Palladus PSP2 


21.9 


BA7 PSP2 


5 5 jGlob Palladus 

! Depress ion 


1.5 


BA7 Depression 


6 0 iTemp Pole Control 


1 1.1 


BA9 Control 


•4.8 Ilemp Pole Control 


47.3 


BA9 Control 


33.4 


Temp Pole Alzheimer's 


2.1 1 


BA9 Alzheimer's 


6.7 


Temp Pole 
;Alzheimer's2 


4.6 


BA9 

Alzheimer's2 


11.1 


Temp-Pole Parkinson's 


19.1 


BA9 Parkinson's 


1 1 g 'Temp Pole 

jParkinson's2 


20.9 


BA9 

Parkinson's2 

i * ' — 


. ! 24.1 

i 


Temp Pole 
jHuntington's 


25.5 


|BA9 ! 
(Huntington's ; 


Temp Pole PSP 


0.0 


BA9 

Huntington's! 


8.0 


Temp Pole PSP2 


7.7 


BA9 PSP 


0.0 


Temp Pole 
Depression2 


1.9 


BA9 PSP2 


3.8 


Cing Gyr Control 


4.7 


BA9 Depression ; 


3.5 


Cing Gyr Control2 


20.4 


BA9 

Depression2 


5.6 


Cing Gyr Alzheimer's 


6.0 


BA 17 Control 


21.3 


Cing Gyr AIzheimer's2 


4.5 


BA17Control2 


46.0 


Cing Gyr Parkinson's 


1.7. 


BA17 

Alzheimer's2 


4.0 


Cing Gyr Parkinson's2 


10.4 


BA17 
Parkinson's 


12.5 


Cing Gyr Huntington's : 


20.2 


BA17 

Parkinson's2 


35.6 

• 


Cing Gyr 
Huntington's2 


1.9 


BA17 

Huntington's 


18.6 ( 


Cing Gyr PSP 1 


3.0 


BA17 

Huntington's2 


10.2 ( 


Cing Gyr PSP2 ; 


13 


BA17 
Depression 


1.2 < 


Cing Gyr Depression ( 


).0 



486 



WO 03/010327 



PC17US02/14199 



BA17 

Depression2 


18.4 


Cing Gyr Depression2 


1.4 



0 



CG57109-01 transcript in this panel is seen in synovium from an OA patient (CT=33.7). ■ 
Overall, this gene is expressed in OA tissue but not in normal joint tissue and is expressed 
in pulmonary tissue from patients with atopic asthma but not in normal lung tissue. Please 
see panel 4D for discussion of utility of this gene in inflammation. 

CNS_neurodegeneration_vl.O Summary: Agl860/Ag31 12/Ag4281 Three 
experiments with two different probe and primer sets produce results that are in very good 
agreement. This panel does not show differential expression of the CG57109-01 gene in 
Alzheimer's disease. However, this expression profile confirms the presence of this gene 
in the brain. Please see Panel 1 .4 for discussion of utility of this gene in the central " 
nervous system. One experiment with Ag6051, which is specific to CG57109-06 only, is 
in general agreement with the results above. Highest expression in this panel was in the 
temporal cortex of an Alzheimer's patient (CT=34.6). 

GencraI_scrcening_paneLvl.4 Summary: Ag4281 Highest expression of the 
CG57109-01 gene appears to be in the fetal brain (CT=29.5). Overall, expression of this 
gene appears to be highly brain-specific in this panel, with moderate levels of expression 
in the amygdala, hippocampus, thalamus and spinal cord and low but significant levels in 
the cerebral cortex and the substantia nigra. This gene encodes a novel doublecortin/CAM 
kinase like protein. Other members of this family have been implicated in the calcium- 
signaling pathway that controls neuronal migration in the developing brain. In addition, 
CAM kinase has been shown to play a crucial role in hippocampal LTP from studies in 
transgenic and knock-out mice, and may also play a role in memory formation in the 
mature nervous system as well as the developing brain. CAM kinases have also been 
shown to phosporylate tau, an integral component of the neurofibrillary tangles seen in 
Alzheimer's, in a manner which shifts tau electrophorytic motility to that seen in the AD 
brain. Furthermore, tau from AD brains shows aberrent phosphorylation. Thus, based on 
the expression of this DCAM kinase homolog in the brain, therapeutic modulation of the 
expression or function of this gene product may be useful in the treatment of learning and 
memory deficits that are a result of aging or neurodegenerative disease and also in the 
treatment of neurologic disorders themselves, including Alzheimer's disease. 
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Moderate to low levels of expression are also seen in a variety of samples from 
normal tissues, including testis, fetal and adult heart and skeletal muscle and fetal lung. 

In addition, this gene is expressed at much higher levels in fetal lung (CT=32.3) 
when compared to expression in the adult counterpart (CT=40). Thus, expression of this 
gene may be used to differentiate between the fetal and adult source of this tissue. 

Panel 1.3D Summary: Agl860/Ag31 12 Two experiments with the same probe 
and primer set produce results that are in excellent agreement, with highest expression of 
the CG57109-01 gene in the spinal cord (CTs=31.7). Expression of this gene is restricted 
to the nervous system and the testis. Thus, expression of this gene could be used to 
differentiate between neural and non-neural tissue. Please see Panel 1.4 for further 
discussion of utility of this gene in the CNS . 

Panel 2.2 Summary: Agl860 Expression of the CG57109-01 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 3D Summary: Ag31 12 Expression of the CG57 109-01 gene is restricted to 
a sample derived from a lung cancer cell line (CT=32.6). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of lung cancer. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of lung cancer. 

Panel 4.1D Summary: Agl860 This CG57109-01 transcript is highly expressed 
in activated dermal fibroblasts, endothelial cells, and astrocytes after treatment with IL-1 
or TNFalpha. Highest expression is seen in treated HPAECs (CT=3 1 .3). Please see panel 
4D for discussion of utility of this gene in inflammation. 

Panel 4D Summary: Agl860 This transcript is highly expressed in activated 
dermal fibroblasts, endothelial cells, and astrocytes after treatment with IL-1 or TNFalpha, 
with highest expression in TNF alpha and IL-1 beta treated HPAECs (CT=30.9). This 
protein has homology to protein kinase and may be involved in leukocyte extravasation 
from the peripheral blood into tissues. (Borbiev T, Am J Physiol Lung Cell Mol Physiol 
2001 May;280(5):L983-90) Therefore, antagonistic therapeutics designed against the 
protein encoded by this transcript may reduce or inhibit inflammation due to asthma, 
allergy, emphysema, osteoarthritis, colitis, psoriasis, or delayed type hypersensitivity. 
Agonistic therapies may also direct leukocyte traffic into tumors or sites of infection. 
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Ag3 112 Highest expression of the transcript is seen in IL-1 beta treated dermal 
fibroblasts (CT=30.4). Expression is in agreement with the profile seen with Agl860, 
except no expression is seen in astrocytes. 

Panel CNS_1 Summary: Agl 860 This panel confirms expression of the 
CG57109-01 gene in the brain. Please see Panel 1.4 for discussion of utility of this gene in 
the central nervous system. 

General oncology screening panel_v_2.4 Summary: Agl 860 Expression of the 
CG57109-01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not 
shown.) Ag6051 Expression of the CG57 109-06 gene is low/undetectable in all samples 
on this panel (CTs>35). (Data not shown.) 



S.CG57366-01: Kiaal223 

Expression of gene CG57366-01 was assessed using the primer-probe set Ag3219, 
described in Table SA. Results of the RTQ-PCR runs are shown in Tables SB, SC. SD and 
15 SE. 

Table SA . Probe Name Ag3219 



! 
! 

Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 • -agggatacccttcctcagaga-3 » 


21 


3380 


272 


Probe 


TET-5» -agccttaaacaagctctgaagcttca-3 ' -TAMRA 


26 


3409 


273 


Reverse 


5 1 -gctagggtcagaaccttcaatc-3 • 


22 


3435 


274 



Table SB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3219, Run 
209862297 


Rel. Exp.(%) 
Ag3219, Run 
224079667 


Tissue Name 


Rel. Exp.(%) 
Ag3219, Run 
209862297 


Rel. Exp.(%) 
Ag3219, Run 
224079667 


AD 1 Hippo 


18.9 


14.7 


Control (Path) 
3 Temporal Ctx 


9.5 


9.2 


AD 2 Hippo 


41.2 


52.5 


Control (Path) 
4 Temporal Ctx 


47.3 


41.2 


AD 3 Hippo 


1.9 


14.0 


AD 1 Occipital 
Ctx 


20.4 


28.5 


AD 4 Hippo 


6.3 


9.9 


AD 2 Occipital 
Ctx (Missing) 


0.0 


0.0 


AD 5 Hippo 


85.3 


90.1 


AD 3 Occipital 
Ctx 


5.0 


10.7 


AD 6 Hippo 


54.3 


64.2 


AD 4 Occipital 
Ctx 


28.9 


19.5 
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Control 2 
Hippo 


30.1 


39.8 


t\u j uccipita] 
Ctx 


32.8 


21.5 


Control 4 
Hippo 


15.7 


18.2 


AD 6 Occipital 

Ptv 


24.1 


40.9 


Control 
(Path) 3 
Hippo 


11.8 


12.6 


Control 1 
Occipital Ctx 


5.8 


6.7 


AD 1 

Temporal 

Ctx 


0.4 

j 
i 


36.3 

i 


Control 2 
Occipital Ctx 


36.9 


! 

|44.4 

i 
I 


AD 2 

Temporal 

Ctx 


31.6 


42.3 


Control 3 
Occipital Ctx 


i 

22.8 


i 
i 

21.3 


AD 3 

Temporal 

Ctx 


2.3 


8.6 


Control 4 
Occipital Ctx 


8.2 


11.8 


AD 4 

Temporal 

Ctx 


19.9 


30.1 


jControl (Path) 
il Occipital Ctx 

i 


i88.9 

i 
1 


79.6 


AD 5 Inf 
Temporal 
Ctx 

AD 5 Sup 
Temporal > 
Ctx 


99.3 
55.9 


I 

100 0 (Control (Path) 
12 Occipital Ctx 

.1 i. 

71 7 iControl (Path) 
j3 Occipital Ctx 


* 

•19.9 

3.6 

i 


j 

2..0 | 

i 
t 

7.0 


AD 6 Inf 
Temporal 
Ctx 


52.5 


66.0 


Control (Path) 
4 Occipital Ctx 


i 

21.6 


26.6 


AD 6 Sup 
Temporal 
Ctx 


58.2 

1 

. . . i 


65.5 


Control 1 
Parietal Ctx 


13.2 


13.5 


< 


Control 1 
Temporal < 

Ctx 


9.5 


10.7 


Control 2 
Parietal Ctx 


0.0 i 


55.1 


< 
( 


Control 2 
remporal [ 

ZtK 


52.8 


37.1 j 


Control 3 
Parietal Ctx 


21.9 


18.0 


( 
( 


control 3 
remporal : 

:tx 


13.7 : 


>4.8 ( 


Control (Path) 
Parietal Ctx 


100.0 I 


58.3 


( 
1 
c 


Control 3 
"emporal S 

:tx 


M 1 


1.2 J 


Control (Path) m 
! Parietal Ctx 4 


112 • 2 


!4.3 


c 


Control 

Path) 1 8 
^emporal 


3-5 6 


3 


:ontrol (Path) < 
Parietal Ctx ~ 


i.4 7 


.5 
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Ctx 












Control 
(Path) 2 
Temporal 
Ctx 


50.3 


52.5 


Control (Path) 
4 Parietal Ctx 


53.6 


48.0 



Table SC . Panel L3D 



r 

1 



1 

Tissue Name 


Rel. Exp.(%) Ag32l9. 
Run I6S013879 


Tissue Name 


Rel. Exp.(%) 
Ag32 1 9, Run 
1680 I 3879 


Liver adenocarcinoma 


|l4.9 

— - — 


Kidney (fetal) 


.31.0 


Pancreas 


jl.8 


Renal ca. 786-0 


j 1 9.3 


rancreatic ca. CAPAN 
2 


17.4 


Renal ca. A498 


10.4 


Adrenal gland 


1.7 


Renal ca. RXF 393 


8.5 


Thyroid 

< 1 


5.3 


Renal ca. ACHN 


12.3 


jSalivary gland 


J2.9 


jRenal ca. UO-3 1 


14.4 


jPituitary gland 


f 2 .4 ' ' 

1 


Renal ra Tk' 1 0 


1 1 C O 

j I 0.8 


.Brain (fetal) 


.58.2 


;Liver 


13.0 


iBrain fwhole^ 


no 


jLiver (fetal) 


3.7 


jBrain (amygdala) 

! 


jlO.6 

i 


jLiver ca. (hepatoblast) 
jHepG2 


W .A. 


jBrain (cerebellum) 


j9.7 


jLung 


4.5 


Brain (hippocampus) 


8.4 


Lung (fetal) 


27.7 


Brain (substantia nigra) 


8.5 


Lungca. (small cell) 
LX-1 


5.3 


Brain (thalamus) 


5.3 


Lungca. (small cell) 
NCI-H69 


17.1 


Cerebral Cortex 


7.1 


Lung ca. (s.cell var.) 
SHP-77 


42.9 


Spinal cord 


5.5 


Lung ca. (large 
cell)NCI-H460 f 


1.0 


glio/astro U87-MG < 


?.9 


Lung ca. (non-sm. cell) , 
A549 


>2.7 


glio/astroU-l I8-MG ] 


8.2 1 
] 


Lung ca. (non-s.cell) 
MCI-H23 1 


7.0 


astrocytoma SW1 783 1 


7.0 1 
1 


-ung ca. (non-s.cell) 
-lOP-62 1 


7.2 


neuro*; met SK-N-AS 2 


81 ! 


-ung ca. (non-s.cl) 
4CI-H522 1 


4.4 


astrocytoma SF-539 1 


8.4 I 

9 


.ung ca. (squam.) SW 
00 6 


5.1 


astrocytoma SNB-75 4 


4.1 L 


-ung ca. (squam.) NCI- 5 


7.4 
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H596 




glioma SNB-19 


14.5 


Mammary gland 


7.5 


glioma U251 


55 9 


Breast ca.* (pl.ef) 
MCF-7 


46.3 


glioma SF-295 


21.2 


Breast ca.* (pl.ef) 
IVl UA-M a- 15 | 


16.3 


Heart ( fptaH 


Z.Z 


pieast ca. (pl.er) 14/D 


31.9 


ricai i 




jBreast ca. B lo49 


|9.5 




1 .0 


i Breast ca. MDA-N 


10.4 




OrVCIClal IllUbLJC 


1 1 


jOvary 


3.8 


Bone marrow 


1.5 


(Ovarian ca. OVCAR-3 


17.6 


Thymus 


6.4 


jOvarian ca. OVCAR-4 


5.4 


Spleen 


1.1 


Ovarian ca. OVCAR-5 


80.1 


Lymph node 


2.8 


Ovarian ca. OVCAR-8 


9.3 


Colorectal 


4.0 


Ovarian ca. IGROV-1 


4.9 


Stomach 


5.0 


Ovarian ca.* (ascites) 
SK-OV-3 


100.0 




Small intestine 


3.7 


Uterus 




i 


Colon ca. SW480 


2.8 


Placenta 


1.4 


Colon ca.* 
SW620(SW480 met) 


9.5 


Prostate 


4.8 


Colon ca. HT29 




Prostate ca.* (bone 
met)PC-3 


29.3 


Colon ca. HCT-116 


8.3 


Testis 


8.2 


Colon ca. CaCo-2 


5.6 


Melanoma Hs688(A).T 


8.9 


Colon ca. 
tissue(OD03866) 


5.4 


Melanoma* (met) 
Hs688(B).T 


13.4 


Colon ca. HCC-2998 


8.7 


Melanoma UACC-62 


3.0 


Gastric ca.* (liver met) 
NCI-N87 


16.5 


Melanoma Ml 4 


2.8 


Bladder 


21.3 


Melanoma LOX IMVI 


11.0 


Trachea 


2.3 


Melanoma* (met) SK- 
MEL-5 


7.5 


Kidney 


6.3 


Adipose 


17.3 


Table SD. Panel 2.2 


Tissue Name 


Rel.Exp.(%)Ag3219, 
Run 174416266 


Tissue Name 


Rel. Exp.(%) 
Ag3219,Run 
174416266 


Normal Colon 


22.8 


Kidney Margin 
(OD04348) 


51.4 


Colon cancer 


25.2 


Kidney malignant cancer 


36.6 
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(UJJUoUo4J 


i 

\ 






Colon Margin 


12.0 


[Cidney normal adjacent 
tissue (OD06204E) 


5.3 


Colon cancer 
(UUUo 1 jy) 




Sidney Cancer 
(OD04450-01) 


54.3 


Colon Margin 
(UUvoijy) 


14.5 


Kidney Margin 
(OD04450-03) 


9.7 

1 


Colon cancer 
(OD06297-04) j 


j 

4.1 iKidney Cancer 8120613 


i 

0.0 

j 


Colon Margin \ 
(OD06297-05) 


23.0 


Kidney Margin 8120614 


i 

0.9 


CC Gr.2 ascend colon 
(OD03921) 


3.6 


Kidney Cancer 9010320 


3.0 


CC Margin (OD03921)j6.9 


Kidney .Margin 9010321 


3 0 


Colon cancer metastasis 1 ^ - 
(OD06104) j 


Kidney Cancer 8 1 20607 


2.6 

— 


Lung Margin ! n , 
(OD06104) j 


Kidney Margin 8120608 


0.1 


gSSSEoo' 1 "* : 14 - 3 i Nomia,Uteri,s 


44.1 j 

.. _ J 


Lung Margin • luterine Cancer 06401 1 
(OD04451-02) ; s 


i 

13.2 1 


Normal Prostate -5.8 jNormal Thyroid 


57 — 1 


Prostate Cancer 
(OD04410) 


11.3 


r 

Thyroid Cancer 064010 


10.2 


Prostate Margin 
(OD04410) 


12.2 


Thyroid Cancer 
A302152 


14.0 


Normal Ovary 


2.4 


Thyroid Margin 
A302153 


5.1 


Ovarian cancer 
(OD06283-03) 


2.7 


Normal Breast 


25.2 


Ovarian Margin 
(OD06283-07) 


14.0 


Breast Cancer 
(OD04566) 


19.6 


Ovarian Cancer 064008 


9.9 


Breast Cancer 1 024 


8.5 


Ovarian cancer 
(OD06145) 


7.1 


Breast Cancer 
(OD04590-01) 


17.1 


Ovarian Margin 
(OD06145) 


23.0 


Breast Cancer Mets 
(OD04590-03) 


31.2 


Ovarian cancer 
(OD06455-03) 


17.3 


Breast Cancer 
Metastasis (OD04655- 
05) 


100.0 


Ovarian Margin 
(OD06455-07) 


12.2 


Breast Cancer 064006 


27.9 
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Normal Lung 


21.2 


Breast Cancer 9 1 00266 


9.8 


Invasive poor diff. lung 
adeno (ODO4945-01 


20.7 


Breast Margin 9100265 


13.2 


Lung Margin 
(ODO4945-03) 


24.1 


Breast Cancer A209073 


14.1 


Lung Malignant Cancer 
(OD03126) 


8.7 


Breast Margin 
A2090734 


22.4 


Lung Margin 
(OD03126) 


16.2 


Breast cancer 
;(OD06083) 


100.0 


Lung Cancer 
(OD05014A) 


17.0 


\ Breast cancer node 
jmetastasis (OD06083) 


60.7 


Lung Margin 
(OD05014B) 


42.3 


(Normal Liver 

i 


12.8 


Lung cancer 
(OD06081) 


10.4 


i 

; Liver Cancer 1026 


0.4 


Lung Margin 
(OD06081) 


13.2 

i 


j — 

Liver Cancer 1025 


3.8 


Lung Cancer 
(OD04237-01) 


10.4 

; 


'Liver Cancer 6004-T 


iu 1 

1 


Lung Margin 
(OD04237-02) 


„ r 

jO.o 


— 

Liver Tissue 6004-N 


r i 

0.5 j 
1 


Ocular Melanoma 
Metastasis 


9.6 


Liver Cancer 6005-T 


1 

2,6 | 


Ocular Melanoma 
Margin (Liver) 


7;3 


Liver Tissue 6005-N 


4.2 


Melanoma Metastasis 


32.5 


Liver Cancer 064003 


7.1 


Melanoma Margin 
(Lung) 


21 .6 


Normal Bladder 


18.6 


Normal Kidney 


4.7 


Bladder Cancer 1023 


3.5 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


28.3 


Bladder Cancer 
A302173 


10.8 


Kidney Margin 
(OD04338) 


11.8 


Normal Stomach 


27.4 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


34.2 


Gastric Cancer 9060397 


2.4 


Kidney Margin 
(OD04339) 


8.2 


Stomach Margin 
9060396 


3.1 


Kidney Ca, Clear cell 
type (OD04340) 


11.0 


Gastric Cancer 9060395 


5.2 


Kidney Margin 
(OD04340) 


13.6 


Stomach Margin 
9060394 


4.3 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


*.5 


Gastric Cancer 064005 


5.8 
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Tissue Name 


Rel. Exp.(%) Ag3219, 
Run 164531987 


Tissue Name 


Rel. Exp.(%) 
Ag3219, Run 
164531987 


Secondary Thl act 


0.1 


HUVECIL-lbeta 


27.5 


Secondary Th2 act 


0.1 


HUVEC 1FN gamma 


35.8 


oeconaarv in act 


0.3 


HUVECTNF alpha + 
IFN gamma 


39.5 


Secondary Th 1 rest 


0 n IHUVECTNF alpha + 
* J |1L4 


54.7 


Secondary Th2 rest 


0.4 


HUVEC IL-11 


30.4 


Secondary Tr I rest 


0.3 


Lung Microvascular EC 
none 


63.7 


Primary Thl act 


0.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


43.2 


Primary Th2 act 


0.6 


Microvascular Dermal 
EC none 


49.3 


Primary Trl act • 


0.6 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


19.5 


j Primary Thl rest 


1 g Bronchial epithelium 
jTNFalpha + ILibeta 


. — 

24.3 


■Primary Th2 rest 


j 0 jSmall airway epithelium 
:iione 


6.7 


{Primary Trl rest 


0.5 


Small airway epithelium 
TNFalpha + IL-lbeta 




50.0 


CD45RA CD4 
lymphocyte act 


6.6 


Coronery artery SMC 
rest 


39.0 


CD45ROCD4 
lymphocyte act 


0.2 


Coronery artery SMC 
TNFalpha + IL-lbeta 


18.6 


CD8 lymphocyte act 


0.1 


Astrocytes rest 


10.2 


Secondary CDS 
lymphocyte rest 


0.4 


Astrocytes TNFalpha + 
IL-Ibeta 


7.9 


Secondary CDS 
lymphocyte act 


0.1 


KU-812 (Basophil) rest 


13.3 


CD4 lymphocyte none 


0.1 


KU-8 12 (Basophil) 
PMA/ionomycin 


42.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.2 


CCD 1106 

(Keratinocytes) none 


16.2 


LAK cells rest 


• 

2.3 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


3.8 


LAK cells IL-2 


0.4 


Liver cirrhosis 


2.1 


LAK cells IL-2+IL- 12 1 


3.2 


-,upus kidney 


1.3 


LAK cells IL-2+IFN \ 


3.9 NCI-H292 none 


16.6 
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gamma 








LAK cells IL-2+ IL-1 8 


0.5 


NCI-H292 IL-4 


22.8 


LAK cells 

PM A/ionomyci n 


1.2 


NCI-H292 IL-9 


19 8 


NK Cells IL-2 rest 


0.1 


NCI-H292IL-13 


14.5 


Two Way MLR 3 day 


1.0 


NCI-H292 I FN gamma 


14.2 


Two Way MLR 5 day 


0.7 


HPAEC none 


43.8 

f 


Two Way MLR 7 day 


0.3 


HPAEC TNF alpha +IL- 
1 beta 


1 

24 0 

,\J I 

I 


PBMC rest 


1.8 


Lung fibroblast none 


8.9 


PBMC PWM 


2.2 


Lung fibroblast TNF 
alpha + IL-1 beta 


9.5 


PBMC PHA-L 


0.8 


Lung fibroblast IL-4 


16.5 


Ramos (B cell) none 


2.9 


Lun<* fibroblast 11 -Q 


12.6 


Ramos (B cell) 
ionomycin 


15.3 


Lung fibroblast IL-1 3 


11.9 

t 


B lymphocytes PWM 

B lymphocytes CD40L 
and IL-4 


4.4 
0.7 


Lung fibroblast I FN 

gamma 

Dermal fibroblast 
CCD 1070 rest 


— \ 

10.4 ! 

• 

28.1 | 

22.2 i 


j 

jcL/L-I ODCAIVlr 

i 


3.7 


Dermal fibroblast 
CCD 1070 TNF alpha 


EOL-l dbcAMP 
PMA/ionomycin 


2.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


16.4 


Dendritic cells none 


4.6 


Dermal fibroblast IFN 
gamma 


R 7 
o. / 


Dendritic cells LPS 


3.2 


Dermal fibroblast IL-4 


29.5 


Dendritic cells anti- 
CD40 


4.4 


IBD Colitis 2 


1.3 


Monocytes rest 


12.8 


IBD Crohn f s 


1.7 


Monocytes LPS 


2.4 


Colon 


9.7 


Macrophages rest 


9.2 


Lung 


27.2 


Macrophages LPS 


1.1 


Thymus 


14.2 


HUVEC none 


57.4 


Kidney 


22.8 


HUVEC starved 100.0 j 







same probe and primer set produce results that are in excellent agreement. This gene is 
downregulated in the temporal cortex of Alzheimer's disease patients when compared with 
non-demented controls (p = 0.001 when analyzed by Ancova, estimate of total cDNA 
loaded per well used asa covariate). Therefore, up-regulation of this gene or its protein 
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product, or treatment with specific agonists for this receptor may be of use in reversing the 
dementia/memory loss and neuronal death associated with this disease. 

Panel 1.3D Summary: Ag3219 Highest expression of the CG57366-01 gene is 
seen in an ovarian cancer cell line (CT=29.9). In addition, expression appears to be 
significant in all the cell lines on this panel. In addition, this gene appears to be expressed 
in most of the samples on this panel, suggesting a role for this gene product in cell survival 
and proliferation. Thus, expression of this gene could be used as a marker of ovarian 
cancer. Furthermore, therapeutic modulation of the expression or function of this gene 
product may be useful in the treatment of cancer. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, thyroid, skeletal muscle, heart, and adult and fetal liver. This 
widespread expression among these tissues suggests that this gene product may play a role 
in normal neuroendocrine and metabolic function and that disregulated expression of this 
gene may contribute to neuroendocrine disorders or metabolic diseases, such as obesity 
and diabetes. f 

This gene is also expressed at moderate to low levels in the CNS, including the 
hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 2.2 Summary: Ag32 19 Highest expression of the CG57366-01 gene is seen 
in breast cancer (CT=29.3). In addition, significant expression is seen in a cluster of breast 
cancer samples. Thus, expression of this gene could be used to differentiate between this 
sample and other samples on this panel and as a marker to detect the presence of breast 
cancer. Furthermore, therapeutic modulation of the expression or function of this gene 
may be effective in the treatment of breast cancer. 

Panel 4D Summary: Ag3219 Highest expression of the CG57366-01 gene is seen 
in untreated HUVEC (CT=27.7). Expression on this panel is seen mainly in endothelial 
cells and fibroblasts from lung and skin, basophils and astrocytes, and normal lung, 
thymus and kidney. Thus, this gene product may be involved in pathological and 
inflammatory lung and skin conditions, including asthma, emphysema, allergy and 
psoriasis. 
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T. CG57368-01: Adenylate Cyclase Type IV 

Expression of gene CG57368-01 was assessed using the primer-probe set Ag3221, 
described in Table TA. Results of the RTQ-PCR runs are shown in Tables TB, TC, TD 
and TE. 

Table TA . Probe Name Ag3221 



! Primers 


Sequences 


Leimth 


Start 
Position 


SEQID 
No 


Forward 


5 1 -aactgatgggtgctatctcctt-3 ' 


22 


2390 


275 


Probe 


TET-5 1 -cttcttcttcaccctccttgtcctgg-3 » -TAMRA 


26 


2418 


276 


Reverse 


5 • -aggcggcagtagtactcattct-3 * 


22 


2450 


277 



Table TB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%)Ag3221, Run 
209862298 


Tissue Name 


Rel.Exp.(%) Ag3221, 
Run 20986^298 


AD 1 Hippo 


4.2 


Control (Path) 3 
Temporal Ctx 


17.2 


AD 2 Hippo 


41.5 


Control (Path) 4 
Temporal Ctx 


42.9 


AD 3 Hippo 


8.0 


AD 1 Occipital Ctx 


1.7 j 


AD 4 Hippo 


13.4 


AD 2 Occipital Ctx 
i(fVhssmg) 


l 

0.0 


AD 5 hippo 


56.6 


AD "3 Orrinitfll Ptv 


H 


AD 6 Hippo 


49.0 


AD 4 Occipital Ctx 


18.9 


Control 2 Hippo 


13.5 


AD 5 Occipital Ctx 


33.0 


Control 4 Hippo 


21.9 


AD 6 Occipital Ctx 


32.1 


Control (Path) 3 
Hippo 


14.7 


Control 1 Occipital Ctx 


57.8 


AD 1 Temporal 

Ctx j 


12.6 


Control 2 Occipital Ctx 


40.3 


AD 2 Temporal 
Ctx 


20.3 


Control 3 Occipital Ctx 


23.2 


AD 3 Temporal 

Ctx | 


5.1 


Control 4 Occipital Ctx 


10.2 


AD 4 Temporal 
Ctx 


32.8 


Control (Path) 1 
Occipital Ctx 


87.7 


AD 5 Inf 
Temporal Ctx 


62.9 


Control (Path) 2 
Occipital Ctx 


40.1 


AD 5 

SupTemporal 
Ctx 


47.3 


Control (Path) 3 
Occipital Ctx 


25.5 


AD 6 Inf 


563 


Control (Path) 4 


71.7 
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Temporal Ctx 




Occipital Ctx 




AD 6 Sup 
Temporal Ctx 


65.5 


Control 1 Parietal Ctx 


24.0 


Control 1 
Temporal Ctx 


33.9 


Control 2 Parietal Ctx 


67.8 


Control 2 
Temporal Ctx 


27.2 


Control 3 Parietal Ctx 


18.0 


Control 3 
Temporal Ctx 


11.9 


Control (Path) I Parietal 
Ctx 


43.8 


Control 4 
Temporal Ctx 


17.8 


Control (Path) 2 Parietal \ , Q 
Ctx 


Control (Path) 1 
Temporal Ctx 


43.5 


Control (Path) 3 Parietal 
Ctx 


i 

25.0 


Control (Path) 2 
Temporal Ctx 


62.9 


Control (Path) 4 Parietal 
Ctx 


100.0 


Tab 


e TC. Panel 1.3D 


IRel. Exp.(%) 
Tissue Name [Ag322 1 . Run 
;168014001 


Tissue Name l^^!^ 1 ' 

Run 168014001 ' 

L : i 


Liver adenocarcinoma 0.4 iKidney (fetal) '73.2 j 


Pancreas ;9.3 


Renal ca. 786-0 ^0.0 


Pancreatic ca. j 
CAPAN 2 j 0 4 


Renal ca. A49S ]0J 


Adrenal gland 


14.9 


Renal ca. RXF 393 


0.0 


Thyroid 


12.2 


Renal ca. ACHN 


0.0 


Salivary gland 


6.5 


Renal ca. UO-31 


0.0 


Pituitary gland 


10.8 


Renal ca.TK- 10 


0.0 


Brain (fetal) 


5.4 


Liver 


15.0 


Brain (whole) 


11.2 


Liver (fetal) 


3.0 


Brain (amygdala) 


10.2 


Liver ca. (hepatoblast) 
HepG2 


0.0 


Brain (cerebellum) 


12.1 


Lung 


54.7 


Brain (hippocampus) 


7.3 


Lung (fetal) 


94.0 


Brain (substantia 
nigra) 


11.7 


Lung ca. (small cell) 
LX-1 


2.1 


Brain (thalamus) 


11.3 


Lung ca. (small cell) 
NCI-H69 


1.5 


Cerebral Cortex 


10.3 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


17.1 


Lung ca. (large 
celI)NCI-H460 


0.0 


glio/astroU87-MG 


0.0 


Lung ca. (non-sm. cell) 


0.0 
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A549 




glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


••3 


astrocytoma SW1 783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


inpurn** mt*t QK" XJ AC- 
IICIIKJ , illCl oIV~lN-/\o 


u.u 


Lung ca. (non-s.cl) NCI- 
H522 


0.0 


JiQtrnr'vtnmn ^F-^^O 
as u \j\*y i\/iiia or'jj/ 




Lung ca. (squam.) SW 
900 


i.5 




u. / 


Lung ca. (squam.) NCl- 
H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


100.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) MCF- 
7 


0.0 


glioma SF-295 


0.0 


Breast ca * (pl.ef) MDA- 
MB-231 


0.0 


Heart (fetal) 


95.3 {Breast ca.* (pl.ef) T47D 




Heart 


50.3 |Breast ca. BT-549 
57.8 [Breast ca. MDA-N 


n o 

v.yj 


Skeletal muscle (fetal) 


Skeletal muscle 


28.5 jOvary 


1 8 ~> 
\ o.z 


Bone marrow J 1 1 .6 -Ovarian ca. OVCAR-3 


0.0 


Thymus 19.3 


Ovarian ca. OVCAR-4 


0.0 


Spleen 


27.2 


Ovarian ca. OVCAR-5 


43.5 


Lymph node 


41.8 


Ovarian ca. OVCAR-8 


0.6 


Colorectal 


5.8 


Ovarian ca. IGROV-1 


0.0 


Stomach 


10.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


21.2 


Uterus 


62.9 


Colon ca. SW480 


0.0 


Placenta 


22.2 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


10.4 


Colon ca. HT29 


0.4 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


5.0 


Colon ca. CaCo-2 


0.0 


Melanoma Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


6.7 


Melanoma* (met) 
Hs688(B).T 


5.5 


Colon ca. HCC-2998 


7.2 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


2.8 


Melanoma M14 


0.0 


Bladder 


8.5 


Melanoma LOX IMVI 


0.0 
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Trachea 


6.1 


Melanoma* (met) SK- 
MEL-5 


0.0 


Kidney 


27.5 


Adipose 


42.3 


Table TD. Panel 2.2 


i 

| 

Tissue Name 


Rel. Exp.(%) 
Ag3221,Run 
173817111 


Tissue Name 


Rel. Exp.(%) Ag3221, 
Run 1 738171 11 


iNomiai iwOion 


6.0 


Kidney Margin 
(OD04348) 


100.0 


Colon cancer 
(OD06064) 


17.4 


Kidney malignant 
cancer (OD06204B) 


0.0 


Colon Margin 
(OD06064) 


10.2 


Kidney normal adjacent 
tissue (OD06204E) 


8.5 


Colon cancer 
(OD06159) 


2.5 


Kidney Cancer 
(OD04450-01) 


0.0 


IColon Margin 
|(OD06I59) W 


4.2 


Kidney Margin | 
(OD04450-03) j 18 ' 4 


{Colon cancer 
;(OD06297-04) 


4.2 


Kidney Cancer 8 1 206 1 3 =5.2 
i 


'Colon Margin | 
i(OD06297-05) ; 18 '^ 

;CC Gr.2 ascend colon \ r 
;(OD03921) j 6 ' 1 


Kidney Margin 
8120614 

Kidney Cancer 9010320 


28.9 
5.3 


CC Margin 
(OD03921) 


1.6 


Kidney Margin 
9010321 


4.4 


Colon cancer 
metastasis (OD06104) 


4.3 


Kidney Cancer 8 120607 


8.0 


Lung Margin 
(OD06104) 


1.8 


Kidney Margin 
8120608 


17.1 


Colon mets to lung 
(OD04451-01) 


8.6 


Normal Uterus 


47.3 


Lung Margin 
(OD04451-02) 


16.7 


Uterine Cancer 064011 


23.8 


Normal Prostate 


12.3 


Normal Thyroid 


6.0 


Prostate Cancer 
(OD04410) 


3.1 


Thyroid Cancer 064010 


2.7 


Prostate Margin 
(OD04410) 


7.0 

4 


Thyroid Cancer 
A3 02 152 


15.9 


Normal Ovary ; 


28.3 

i 


rhyroid Margin 
A302153 ~ f 


».5 


Ovarian cancer 
(OD06283-03) ( 


>.o 


formal Breast \ 


16.6 


Ovarian Margin 44.1 1 


3reast Cancer '. 


5.8 
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(OD06283-07) 




(OD04566) 




Ovarian Cancer 
064008 


4.3 


Breast Cancer 1024 


20.3 


Ovarian cancer 
(OD06145) 


2.3 


Breast Cancer 

(OD04590-01) 


4.7 


Ovarian Margin 
(OD06145) 


16.4 


orcabi ^/aiicer iviets 
(OD04590-03) 


14.5 


Ovarian cancer 
(OD06455-03) 


1 

0.9 


jBreast Cancer 
(Metastasis (OD04655- 
05) 


11.0 


Ovarian Margin 
(OD06455-07) 


13.0 


oieast Cancer 064006 


18.8 


Normal Lung 


26.1 


Breast Cancer 9100266 


5.3 


Invasive poor diff. lung 
adeno(ODO4945-01 


3.2 


Breast Margin 9100265 


4.4 


Lung Margin 
(ODO4945-03) 


36.6 


Breast Cancer A209073 


6.3 


Lung Malignant 
Cancer (OD03 126) 


8.2 


Breast Mar«in 1 
A2090734" j 18 " 2 


Lung Margin 
(OD03126) 

Lung Cancer 
(OD05014A) 


23.0 ! Breast cancer j 

J(OD06083) j 122 


5 5 jBreast cancer node j 
metastasis (OD06083) ! 


Lung Margin 
(OD050I4B) 


29.1 


Normal Liver 


6-1 


Lung cancer 
(OD06081) 


4.0 


Liver Cancer 1 026 


4.7 


Lung Margin 
(OD06081) 


31.0 


Liver Cancer 1025 


13.0 


Lung Cancer 
(OD04237-01) 


1.5 


Liver Cancer 6004-T 


5.0 


Lung Margin 
(OD04237-02) 


35.1 


Liver Tissue 6004-N 


1.6 


Ocular Melanoma 
Metastasis 


).0 ] 


Liver Cancer 6005-T 


13.7 


Ocular Melanoma 
Margin (Liver) 


1.2 jh 


Jver Tissue 6005-N ' 


J.S 


Melanoma Metastasis 7 


5.7 I 


Jver Cancer 064003 2 


i.5 


Melanoma Margin 
(Lung) 2 


.6.8 ]> 


formal Bladder 1 


2.9 ■ 


Normal Kidney 1 


1.5 t 


Madder Cancer 1023 6 


.5 


Kidney Ca, Nuclear 
grade 2 (OD04338) 5 


5.1 * 


Madder Cancer 
^302173 2 


.0 
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Kidney Margin 
(OD04338) 


7.6 


Normal Stomach 


32.1 


Kidney Ca Nuclear 
grade l/2(OD04339) 


5.6 


Gastric Cancer 9060397 


1.0 


Kidney Margin 
(OD04339) 


14.8 


Stomach Margin 
9060396 


8.5 


Kidney Ca, Clear cell 
type (OD04340) 


18.4 


" ! 

i 

Gastric Cancer 9060395 


10.9 


Kidney Margin 
(OD04340) 


9.3 


; Stomach Margin 
•9060394 


16.0 


jKidney Ca, Nuclear 
|grade 3 (OD04348) 


1.5 


jGastric Cancer 064005 
t 


11.3 


Table TE. Panel 4D 


Tissue Name 

! 


Rel. Exp.(%) 
Ag3221, Run 
164531988 


Tissue Name 


Rel. Exp.(%) Ag3221, 
Run 164531988 


jSecondary Th 1 act 


0.1 


; HUVECIL-lbeta 


12.6 


j — 

ISecondary Th2 act 


0.2 :HUVEC IFN gamma 


[25.5 


i 

jSecondary Trl act 


0.4 


HUVEC TNF alpha* 
IFN gamma 


i 
1 

!19 1 

\ 


i 

(Secondary Th 1 rest 


1.0 


HUVEC TNF alphaT ~ 
IL4 


31 9 


ISecondary Th2 rest 


0.7 


,HUVECIL-U 


21.3 


Secondary Trl rest 


1.3 


Lung Microvascular EC 
none 


48.0 


Primary Thl act 


0.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


100.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


47.3 


Primary Trl act 


0.6 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


44.1 


Primary Thl rest 


1.6 


Bronchial epithelium 
TNFalpha + ILIbeta 


6.8 


Primary Th2 rest 


2.6 


Small airway epithelium 
none 


1.7 


Primary Trl rest 


1.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


6.9 


CD45RA CD4 
lymphocyte act 


3.3 


Coronery artery SMC 
rest 


1.7 


CD45RO CD4 
lymphocyte act 


0.2 


Coronery artery SMC 
TNFalpha + IL-lbeta 


2:1 


CD8 lymphocyte act 


0.4 


Astrocytes rest 


0.0 


Secondary CD8 


0.5 


Astrocytes TNFalpha + 


0.0 
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lymphocyte rest 




IL-lbeta 




Secondary CD8 
lymphocyte act 


0.2 


KU-812 (Basophil) rest 


0.0 


v^L/t lyiiipiiocyie none 


i.i 


KU-812 CBasonhih 
PMA/ionomycin 


0.0 


2ry Thl/Th2ATrI_anti- 


3.5 


CCD 1 106 

(Keratinocytes) none 


0.0 


u/\ts. ceils rest 


0.7 


ICCDI106 
(Keratinocytes) 
TNFalpha + IL-I beta 


0.7 


LAK cells IL-2 


2.4 


Liver cirrhosis 


0.6 


lan cells IL-2+IL-I2 


00 (Lupus kidney 


6.5 


LAK cells IL-2+IFN 
gamma 


06 !nCI-H292 none 


0.5 


LAK cells IL-2+IL-18 


0.5 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


, - _ 

02 jNCI-H292 IL-9 . 

. ■ i . 


0.5 


NK Cells IL-2 rest 
Two Way MLR 3 day 


°;8 INCI-H292 IL-13 jo.O 

I 0 - 7 iNci-H292"lFN' gamma ' jO.O 


1 wo Way MLR 5 day 


00 IflPAECnone i45.l 


Two Way MLR 7 day 


0 2 WAECTNF alpha + L „ ] 
IL-I beta j 41 - 2 j 


PBMC rest 


1 • ] jLung fibroblast none j 1 .5 


PBMCPWM 


0.5 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.2 


PBMC PHA-L 


0.5 


Lung fibroblast IL-4 


0.2 


Ramos (B cell) none 0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.3 


B lymphocytes PWM I 


3.4 


Lung fibroblast I FN 
gamma 


0.2 


B lymphocytes CD40L I 
and IL-4 ( 


).9 

< 


Dermal fibroblast 
CCD 1070 rest 


i.o 


EOL-1 dbcAMP ] 


.3 1 
< 


Dermal fibroblast 
XD1070 TNF alpha X 


5.0 


EOL-1 dbcAMP 
PMA/ionomycin 


1.6 1 
( 


Dermal fibroblast 
XD1070IL-1 beta f 


i.3 


Dendritic cells none 1 


.3 I 


Dermal fibroblast IFN v 
^arnma j' 


M 


Dendritic cells LPS 1 


.0 [ 


Dermal fibroblast IL-4 9 


.7 


Dendritic cells anti- 
CD^ 3 


•7 I 


BD Colitis 2 0 


.7 
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Monocytes rest 


5.7 


IBD Crohn's 


3.0 


Monocytes LPS 


4.5 


Colon 


9.4 


Macrophages rest 


12.4 


Lung 


17.0 


Macrophages LPS 


1.4 


Thymus 


14.0 


HUVECnone 


17.9 


Kidney 


7.4 


HUVEC starved 


16.6 


i 



expression of this gene at low levels in the brains of an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
see Panel 1.4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

Panel 1.3D Summary: Ag3221 Highest expression of the CG57368-01 gene is 
detected in mammary gland (CT=30). High expression of this gene is seen mainly in the 
normal tissue samples suggesting an important role in cellular function. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 2.2 Summary: Ag3221 Highest expression of the CG57368-01 gene is 
detected in control kidney (OD04348) sample (CT=32.7). Expression of this gene is 
generally higher in the normal control margin samples as compared to the cancer tissues. 
Interestingly, expression of this gene is higher in kidney cancer nuclear grade 2 
(OD04338) (Ct=33.5) as compared to corresponding control sample (Ct=36.4). Therefore, 
expression of this gene can be used as a diagnostic marker for nuclear grade 2 kidney 
cancer and therapeutic modulation of this gene could be beneficial in the treatment of 
kidney cancer. 
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Panel 4D Summary: Ag3221 Highest expression of the CG57368-01 gene is 
detected in TNFalpha + IL-lbeta treated lung microvascular EC (CT=28). In addition, 
moderate to low expression of the gene is also seen in resting primary and secondary 
Thl,Th2,Trl, B lymphocytes, LAK cells, dendritic cells, monocytes, macrophages, 
5 endothelial cells, eosinophils, small airway epithelium, dermal fibroblasts, lupus kidney 
and normal tissues represented by colon, lung, thymus and kidney. Therefore, therapeutic 
modulation of this gene through the use of small molecul drug may be useful in the 
treatment of autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
10 osteoarthritis 

Panel CNS_1 Summary: Ag3221 Results from one experiment with the 
CG57368-01 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 



15 U. CG59955-01: GPCR 

Expression of gene CG59955-01 was assessed using the primer-probe set Ag3637, 
described in Table UA. 

Table UA . Probe Name Ag3637 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -acactaggaagccccatgtact-3 ' 


22 


189 


278 


Probe 


TET-5 • -tgtcctttgcagattcttgcttttca-3 ' -TAMRA 


26 


226 


279 


Reverse 


5 ' -aattaatctaggggctgtggaa-3 ' 


22 


254 


280 



CNS_neurodegeneration_vLO Summary: Ag3637 Expression of the CG59955- 



20 01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General_screeningjpanel_vl.4 Summary: Ag3637 Expression of the CG59955- 
01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4.1D Summary: Ag3637 Expression of the CG59955-01 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
25 General oncology screening panel_v_2.4 Summary: Ag3637 Expression of the 

CG59955-01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not 
shown.) 
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V. CG89211-01: GPCR 

Expression of gene CG8921 1-01 was assessed using the primer-probe set Ag3691, 
described in Table VA. Results of the RTQ-PCR runs are shown in Table VB. 

Table VA . Probe Name Ag3691 



1 — 

jPrimers 

i " — 


Sequences 


Length 


Start 
Position 


SEQID 
No 


[Forward 


5 ' -tgcattttaattcgtccagttc-3 ' 


22 


464 


281 


|Probe 


TET-5 ' -cactcccgataatctatctcatctaccg-3 ' -TAMRA 


28 


492 


282 


jReverse 


5 1 -atgagcctgacaaaatggtaaa-3 1 


22 


520 


283 



5 Table VB , General oncology screening paneljv_2.4 



Tissue Name 


Rel Evn (%\ Ao>fiQl 
Run 267739062 


Tissue Name 


Run 267739062 


Colon cancer 1 


0.0 


DldUUCI CaflUcr In/A. 1 Z. 


n n 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


0.0 


jColon cancer 2 


0.0 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 

r 


0.0 


Adenocarcinoma of the 
prostate 1 


100.0 j 


'Colon cancer 3 


0.0 


Adenocarcinoma of the 
prostate 2 


0.0 

* 

! 


SColon cancer NAT 

13 


0.0 


Adenocarcinoma of the 
nro<;tflt£* ^ 


-1 

o.o ! 


Colon malignant 
cancer 4 


0.0 


Adenocarcinoma of the 
prostate 4 


0.0 


Colon normal 
adjacent tissue 4 


0.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


0.0 


Adenocarcinoma of the 
prostate 6 


0.0 


Lung NAT 1 


0.0 


Adenocarcinoma of the 
prostate 7 


67.4 


Lung cancer 2 


0.0 


Adenocarcinoma of the 
prostate 8 


0.0 


Lung NAT 2 


0.0 


Adenocarcinoma of the 
prostate 9 


0.0 


Squamous cell 
carcinoma 3 


0.0 


Prostate cancer NAT 10 


0.0 


Lung NAT 3 


0.0 


Kidney cancer 1 


0.0 


metastatic 
melanoma 1 


0.0 


KidneyNAT 1 


0.0 


Melanoma 2 


0.0 


Kidney cancer 2 


0.0 


Melanoma 3 


0.0 


Kidney NAT 2 


0.0 
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metastatic 
melanoma 4 



metastatic 
melanoma 5 



0.0 



88.3 



Kidney cancer 3 



Kidney NAT 3 



0.0 



0.0 



Bladder cancer 1 



0.0 



Kidney cancer 4 



0.0 



Bladder cancer 
NAT 1 



0.0 



Kidney NAT 4 



0.0 



Bladder cancer 2 



0.0 



CNS_neurodegeneration_vl.O Summary: Ag36^^pressTon of the CG892l"l- 
01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General_screening_panel_vl.4 Summary: Ag3691 Expression of the CG8921 1 - 
01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4.1D Summary: Ag369l Expression of the CG8921 1-01 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General oncology screening panel_v_2.4 Summary: Ag3691 Expression of the 
CG8921 1-01 gene is low/undctectable in all samples on this panel (CTs>35). (Data not 
shown.) 



W. CG90530-02: Ubiquitin-Conugating Enzyme 

Expression of full length physical clone CG90530-02 was assessed using the 
primer-probe sets Ag3410 and Ag4036, described in Tables WA and WB. Results of the 
RTQ-PCR runs are shown in Tables WC, WD, WE and WF. 



Table WA . Probe Name Ag3410 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -tgagaggtacccatttgaacct-3 ' 


22 


372 


284 


Probe 


TET-5 ' -cctcagatccgatttctcactccaat-3 ' -TAMRA 


26 : 


346 


285 


Reverse 


5 ' -aaatccttccagcagaatcaat-3 * 


22 


311 


286 



Table WB . Probe Name Ag4036 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -tgagaggtacccatttgaacct-3 • 


22 


372 


287 


Probe 


TET-5 • -cctcagatccgatttctcactccaat-3 ' -TAMRA 


26 


346 


288 


Reverse 


5 1 -aaatccttccagcagaatcaat-3 ■ 


22 


311 


289 



Table WC. CNS_neurodegeneration_vl.O 
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Tissue Name 


Rel.Exp.(%)Ag3410, 

Run 7101^1 A<; 


Tissue Name 


Rel. Exp.(%) 
Ag3410, Run 
210350165 


AD 1 Hinoo 


4 7 


Control (Path) 3 Temporal 
Ctx 


1.0 


AD 2 Hippo 


17.4 


Control (Path) 4 Temporal 
Ctx 


28.7 


AD 3 Hippo 


2.1 


AD 1 Occipital Ctx 


5.4 


AD 4 Hippo 


2.2 


AD 2 Occipital Ctx 
(Missing) 


;o.o 


AD 5 Hippo J80.7 


AD 3 Occipital Ctx ||.6 


AD 6 Hippo [32.3 


AD 4 Occipital Ctx 


14.1 


Control 2 Hippo |30.4 


AD 5 Occipital Ctx 


48.3 


Control 4 Hippo 


0.8 


AD 6 Occipital Ctx 


10.2 


Control (Path) 3 1 
Hippo 


0.9 


Control 1 Occipital Ctx 


0.5 


AD 1 Temporal 

Ctx l 4 '^ 

•AD 2 Temporal ;_ n ., 

|Ctx ! 29j 


Control 2 Occipital Ctx j76.8 
Control 3 Occipital Ctx |7.0 


jAD 3 Temporal ! 

;ctx i'- 7 


— - 

Control 4 Occipital Ctx 


jl.9 


AD 4 Temporal j 

Ctx r 9 

t 


Control (Path) 1 Occipital 
Ctx 


84.1 


AD 5 Inf 
Temporal Ctx 


100.0 ' 


Control (Path) 2 Occipital 
Ctx 


4.5 


AD 5 Sup 
Temporal Ctx 


20.7 


Control (Path) 3 Occipital 
Ctx 


0.0 


AD 6 Inf 
Temporal Ctx 


29.3 


Control (Path) 4 Occipital 
Ctx 


7.7 


AD 6 Sup 
Temporal Ctx 


30.8 


Control 1 Parietal Ctx 


2.0 


Control 1 ~T 
Temporal Ctx 


D.5 


Control 2 Parietal Ctx 


23.2 


Control 2 
Temporal Ctx 


31.9 i 


Control 3 Parietal Ctx 


6.8 


Control 3 
Temporal Ctx 


3.1 

( 


Control (Path) 1 Parietal 


73.2 


Control 3 
Temporal Ctx 


.4 < 

( 


Control (Path) 2 Parietal 
Ctx 


14.5 


Control (Path) 1 4 
Temporal Ctx 


>6.3 J 


Control (Path) 3 Parietal 

:tx • 


1.7 


Control (Path) 2 J23.7 ( 


:ontrol (Path) 4 Parietal 3 


>6.6 
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Temporal Ctx 



jctx 



Table WD . General_screening_panel_vl.4 



Tissue Name 



Rel. Exp 
Ag3410, Run 
216839094 



Rel. 

Exp.(%) 
Ag4036, 
Run 

220283343 



Tissue Name 



Rel. 

Exp.(%) 
Ag3410, 
Run ' 
216839094 



Rel. 

Exp.(%) 
Ag4036, 
Run 

2202833 



jAdipose 


;o.i 


0.3 


jRenalca. TK-10 


Il4 0 


01 0 


jMelanoma* Hs688(A).T 


'12.0 


j 2 .5 "~" 


bladder 




1 7 
1 . / 


ivieidnorna riSuoo^tS ). 1 


2.2 


1.9 


jGastric ca. (liver 
lmet.)NCI-N87 


13.1 


13.2 


Melanoma* MI4 


45.1 


q Gastric ca. KATO 

j4 - 9 ;in 




n a 1 

vO.l 


Melanoma* LOXIMVI 


11.0 


17.4 


{Colon ca. SW-948 


13.6 


17.2 


Melanoma* SK-MEL-5 


55.5 


44.8 


Colon ca. SW480 


j 100.0 


100.0 


Squamous cell 
carcinoma SCC-4 


14.2 


22.1 


jColon ca.* (SW480 
'met) SW620 


(57.0 

! 


49.7 


Testis Pool 


;2.3 


8.0 


Colon ca. HT29 


;21.9 


: - ~ .. . 

122.8 ] 


Prostate ca.* (bone met) 
PC-3 

! 


I3.6 

i 


5.7 


Colon ca. HCT-1 16 


82.9 


j 

'86.5 j 


j Prostate Pool , 


i'0.3 


0.0 Colon ca. CaCo-2 


|1 /.o 


121.0 j 


] Placenta 


0.7 


0.8 


jColon cancer tissue 


14.2 


jM 


uterus rool 


0.2 


0.0 


{Colon ca. SW1116 


4.4 


4.3 


Ovarian ca. OVCAR-3 


30.8 


21.5 


Colon ca. Colo-205 


12.9 


12.5 


Ovarian ca. SK-OV-3 


33.7 


31.4 


Colon ca. SW-48 


14.9 


26.2 


Ovarian ca. OVCAR-4 


27.2 


30.6 


Colon Pool 


0.9 


0.8 


Ovarian ca. OVCAR-5 


28.3 


24.7 


Small Intestine Pool 


0.3 


0.2 


Ovarian ca. IGROV-1 


9.4 


9.5 


Stomach Pool 


0.4 


0.3 


Ovarian ca. OVCAR-8 


8.2 


6.2 


Bone Marrow Pool 


0.1 


0.1 


Ovary 


0.6 


0.6 


Fetal Heart 


5.3 


5.0 


Breast ca. MCF-7 


32.1 


25.2 


Heart Pool 


0.5 


0.5 


Breast ca. MDA-MB- 
231 


47.0 


51.8 


Lymph Node Pool 


2.1 


1.5 


Breast ca. BT 549 


55.1 


50.7 


Fetal Skeletal 
Muscle 


1.1 


1.2 


Breast ca. T47D 


51.8 


33.9 


Skeletal Muscle 
Pool 


2.2 


1.5 


Breast ca. MDA-N 


24.5 


15.9 


Spleen Pool < 


3.7 l 


3.6 


Breast Pool 


0.8 |0.6 


Thymus Pool : 


2.9 


3.8 


Trachea | 


3.5 jo.5 


CNS cancer : 


>7.9 


17.8 
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(guo/astro) U87-MG 






Lung 


ft 1 


A fi 


CNS cancer 
(glio/astro) U-118- 
MG 


49.0 


48.6 


Fetal Lung 


3.1 


2.2 


CNS cancer 
(neuro;met) SK-N- 
AS 


41.8 


47.0 


i 

;Lungca.NCI-N417 

I 


27.9 


25.0 


CNS cancer (astro) oq Q 


6.8 


1 

Lung ca. LX-1 


40.9 


38.7 


CNS cancer (astro) " 
SNB-75 ^ 4 


29.7 


Lung ca. NC I -H 1 46 


— 

4.4 


'■■ 

6.5 


CNS cancer (glio) ! Q 
SNB-19 ~ r A 


7.6 


Lung ca. SHP-77 


59.5 


11.4 


CNS cancer (glio) • , 
SF-295 ^ : 4 * 6 


5.4 


■Lung ca. A549 

i 


17.6 


26.1 


Brain (Amygdala) I. c 
Pool W L6 


1.2 


jLung ca. NCI-H526 


1 1.6 


12.0 


Brain (cerebellum) j 1 .5 


0.9 


! Lung ca. NCI-H23 

i 


14.6 


22.1 'Brain (fetal) 3.6 


3.6 


iLung ca. NCI-H460 

; 


2.5 p.4 

■ 


Brain n 
(Hippocampus) Pool ; 


1.2 


jLung ca. HOP-62 




4.1 


Cerebral Cortex ^ 
Pool " 4 




1.4 


Lung ca. NCI-H522 


17.4 


23.5 


Brain (Substantia 
nigra) Pool 


1.8 


1.0 


Liver 


0.1 


0.1 


Brain (Thalamus) 
Pool 


2.4 


2.0 


Fetal Liver 


16.0 


12.2 


Brain (whole) 


2.8 


2 4 


Liver ca. HepG2 


4.0 


6.1 


Spinal Cord Pool 


0.5 


0.4 


Kidney Pool 


0.5 


0.3 


Adrenal Gland 


0.2 


0.4 


retal Kidney 


5.6 


4.0 


Pituitary gland Pool 


0.1 


0.3 


Renal ca. 786-0 


16.5 


21.0 


Salivary Gland 


0.1 


0.1 


Renal ca. A498 


3.1 


2.8 


Thyroid (female) 


0.4 


0.2 


Renal ca. ACHN 


10.2 


10.8 


Pancreatic ca. 
CAPAN2 


52.5 


39.2 


Renal ca. UO-31 


4.7 


7.6 


Pancreas Pool 


1.1 


1.0 



Table WE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3410, Run 
165296375 


Tissue Name 


Rel. Exp.(%) 
Ag3410, Run 
165296375 


Secondary Thl act 


29.5 


HUVEClL-lbeta 


3.4 


Secondary Th2 act 


19.5 


HUVEC I FN gamma 


6.2 
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Secondary Trl act 


29.9 


HUVEC TNF alpha + I FN 
gamma 


5.1 


Secondarv Th 1 rp^t 




HUVEC TNF alpha + IL4 


4.6 


Secondary Th2 rest 


1.3 


HUVEC IL- 11 


4.0 


Secondary Trl rest 


1.1 


Lung Microvascular EC none 


3.8 


i 

jPrimary Th I act 


10.6 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


->.8 


Primary Th2 act 


1 1.7 


Microvascular Dermal EC none 


6.0 


'Primary Trl act 


25.7 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


i3.7 


iPrimary Thl rest • 


23.7 


Bronchial epithelium TNFalpha 
-HLlbeta 


1 .6 


Primary Th2 rest 


4.5 


Small airway epithelium none 


0.3 


L . 

•Primary Trl rest 

i 


20 3 


Small airway epithelium 
TNFalpha + IL-1 beta 


5.1 


: CD45RA CD4 
lymphocyte act 


13.1 


Coronery artery SMC rest 


2.5 


CD45RO CD4 
lymphocyte act 


16.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


i 3 - 2 


CDS lymphocyte act 


13.9 


Astrocytes rest \\3 


^Secondary CD8 
lymphocyte rest 


-> 0 2 jAstrocytes TNFalpha + IL- L £ 
jlbeta j 06 


jSecondary CD8 
lymphocyte act 


13.7 


KU-812 (Basophil) rest 


10.9 


CD4 lymphocyte 
none 


0.1 


KU-8 12 (Basophil) 
PMA/ionomycin 


21.8 


2ry 

Thimi2^Trl anti- 
CD95 CH11 


3.0 


CCD1 106 (Keratinocytes) none 


12.4 


LAK cells rest 


2.0 


CCD 11 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


2.6 


LAK cells IL-2 


9.5 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL- 
12 


19.9 


Lupus kidney 


0.1 


LAK cells IL-2+IFN , 
gamma 


23.0 1 


NCI-H292 none 


12.2 


LAK cells IL-2+ IL- 

1 c 

1 0 


17.3 ] 


MCJ-H292 IL-4 


30.8 


LAK cells 
PMA/ionomycin 


).8 ] 


^JCI-H292 IL-9 


25.7 


NIC Cells IL-2 rest 


10.6 l 


SICI-H292IL-13 < 


?.9 


Two Way MLR 3 . 
day 


5.0 •] 


MCI-H292 IFN gamma 


12.4 
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l wo way MLK D 
day 


9.0 1 


HPAEC none 


4.5 


Two Way MLR 7 
day 


7 1 
/.I 


HPAEC TNF alpha + IL-1 beta 


2.5 


PBMC rest 


0.2 


Lung fibroblast none 


1.7 


PBMC PWM 


47.0 


Lung fibroblast INr alpha + ; 0 ^ 
IL-l beta 


PBMC PHA-L 


18.2 


Lung fibroblast IL-4 2.3 


Kamos (ts ecu) none 




Lung fibroblast IL-9 0.4 


Ramos (B cell) 
ionomycin 


39.5 


Lung fibroblast IL-1 3 1.9 

1 


B lymphocytes 
PWM 


100.0 


] 

Lung fibroblast I FN gamma j 1 .2 


B lymphocytes 
CD40L and IL-4 


9.3 


Dermal fibroblast CCD 1 070 j 2 { Q 
rest : 


EOL-1 dbcAMP 


4.8 


Dermal fibroblast CCD 1070 J.j 4 
TNF alpha ; 


EOL-1 dbcAMP ^ ? 
jPMA/ionomycin j 


Dermal fibroblast CCD 1070 IL- 
1 beta 


.9.8 

i 


{Dendritic cells none !0.8 


Dermal fibroblast I FN gamma 


H. i 


^Dendritic cells LPS 


[0.3 " 


Dermal fibroblast IL-4 
IBD Colitis 2 


5.8 


Dendritic cells anti- 
CD40 


0.4 

L 


Monocytes rest 


0.1 


1BD Crohn's jO.l 


Monocytes LPS 


0.0 


Colon 


0.9 


Macrophages rest 


2.7 


Lung 


0.6 


Macrophages LPS 


0.1 


Thymus 


0.4 


HUVEC none 


8.0 


Kidney 


8.1 


HUVEC starved 


28.1 







Table WF. General oncology screening panel_v_2.4 



Tissue 
Name 


Rel. Exp.(%) 
Ag3410, Run 
267171484 


Rel. Exp.(%) 
Ag4036, Run 
268362917 


Tissue Name. 


Rel. Exp.(%) 
Ag3410, Run 
267171484 


Rel. Exp.(%) 
Ag4036, Run 
268362917 


Colon 
cancer 1 


34.4 


11.3 


Bladder cancer NAT 2 


0.1 


0.0 


Colon 

cancer NAT 
1 


12 J 


10.8 


Bladder cancer NAT 3 


0.1 


0.0 


Colon 
cancer 2 


42.9 


7.1 


Bladder cancer NAT 4 


0.6 


0.0 


Colon 

cancer NAT 


2.9 


7.0 


Adenocarcinoma of 
the prostate 1 


3.2 


0.0 
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2 












Colon 
cancer 3 


70.2 


47.3 


Adenocarcinoma of 

UIC JJlUolalC Z. 


0.7 


0.0 


Colon 

cancer NAT 
3 


14.6 


18.0 


Adenocarcinoma of 
the prostate 3 


0.9 


0.0 


Colon 
malignant 
cancer 4 


90.8 


68.8 


Adenocarcinoma of 
the prostate 4 


11.7 


6.6 


Colon 

nnnYtA 1 

HUi Ilia! 

adjacent 
tissue 4 


6.2 


1.7 


Prostate cancer NAT 5 




1.3 


0.6 


Lung cancer 
1 


10.2 


0.8 


Adenocarcinoma of 
the prostate 6 


0.5 


0.0 


Lung NAT 
1 


0.1 


0.0 


Adenocarcinoma of 
the prostate 7 


0.8 


0.0 


Lun a cancer 
2 


100.0 


100.0 


Adenocarcinoma of 
the prostate 8 


0.1 


0.0 


Lun<y NAT 
2 


0.1 


0.0 


Adenocarcinoma of 
the prostate 9 


5.3 


1.7 


Souamous 
cell 

carcinoma 3 


26.2 


17.6 


Prostate cancer NAT 

1 A 


0.1 


0.0 


Lung NAT 
3 


0.2 


0.0 


Kidney cancer 1 


4.6 


3.3 


metastatic 
melanoma 1 


0.9 


0.0 


KidneyNAT 1 


0.6 


0.5 


Melanoma 2 


0.2 


0.0 


Kidney cancer 2 


21.0 


15.0 


Melanoma 3 ! 


0.9 


0.0 


Kidney NAT 2 


2.9 


2.0 


metastatic 
melanoma 4 


7.6 


3.4 


Kidney cancer 3 


2.8 


1.5 


metastatic 
melanoma 5 


7.7 


3.1 


Kidney NAT 3 


0.7 


0.7 


Bladder 
cancer 1 


0.3 


0.0 


Kidney cancer 4 


2.1 


2.0 


Bladder 
cancer NAT 
1 


0.0 


0.0 


Kidney NAT 4 


0.6 


1.0 


Bladder 
cancer 2 


0.4 


0.2 









CNS_neurodegeneration_vl.O Summary: Ag3410 This panel does not show 
differential expression of the CG90530-02-01 gene in Alzheimer's disease. However, this 
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expression profile confirms the presence of this gene in the brain. Please see Panel 1.4 for 
discussion of utility of this gene in the central nervous system. 

General_screeningj)aneI_vL4 Summary: Ag3410/Ag4036 Two experiments 
with the same probe and primer set produce results that are in excellent agreement. 
5 Highest expression of the CG90530-02 gene is seen in a colon cancer cell line (CTs=25- 
26). In addition, expression is significantly higher in all the cancer cell lines on this panel 
when compared to expression in the normal tissues. Thus, expression of this gene could be 
used as a marker of cancer. In addition, this gene is expressed at much higher levels in 
fetal lung, liver, kidney and heart tissue (CTs=28-30) when compared to expression in the 

1 0 adult counterpart (CTs=33-36). Thus, expression of this gene may be used to differentiate 
between the fetal and adult source of these tissues. This gene encodes a putative member 
of the ubiquitin conjugating enzyme family. Ubiquitin-dependent protein degradation 
plays a role in many cellular processes and has been shown to be upregulated in some 
cancers (Eliseeva E. Cell Growth Differ 2001 Aug;12(8):427-33) Furthermore, higher 

15 levels of expression of this gene in cancer cell lines and fetal tissues suggests that 

therapeutic modulation of the expression or function of this gene may be useful in the 
treatment of cancer. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 

20 muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic function and that 
disregulated expression of this gene may contribute to neuroendocrine disorders or 
metabolic diseases, such as obesity and diabetes. 

This gene is also expressed at moderate levels in the CNS, including the 

25 hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 

Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 4D Summary: Ag3410 Highest expression of the CG90530-02 is seen in 

30 PWM stimulated B lymphocytes (CT=23.7). High levels of expression are also seen in 
activated T cells, LAK cells, dermal fibroblasts and the pulmonary mucoepidermoid cell 
line NCI-H292. Thus, therapeutic regulation of the transcript or the protein encoded by the 
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transcript could be important in immune modulation and in the treatment of T and B cell- 
mediated diseases such as asthma, arthritis, psoriasis, IBD, and lupus. 

General oncology screening panel_v_2.4 Summary: Ag3410/Ag4036 Two 
experiments with the same probe and primer set produce results that are in excellent 
5 agreement. Highest expression of the CG90530-02 gene is seen in lung cancer (CTs=28- 
31). In addition, this gene appears to be overexpressed in lung, kidney and colon cancers 
when compared to expression in normal adjacent tissue. Thus, expression of this gene 
could be used as a marker of lung and colon cancer. Ubiquitinylation is a cyclical process 
operating in all cells to target specific proteins (eg, p53) for degradation. Abnormal 
10 accumulations of ubiquitinylated proteins have been identified in colorectal carcinoma. 
This gene encodes a putative ubiquitin conjugating enzyme. Therefore, therapeutic 
modulation of the expression or function of this gene could be effective in the treatment of 
lung, kidney and colon cancer. 

15 X.CG93076-01:GPCR 

Expression of full length physical clone CG93076-01 was assessed using the 
primer-probe set Ag2129, described in Table XA. Results of the RTQ-PCR runs are 
shown in Tables XB and XC. 

Table XA . Probe Name Ag2129 



Primers 


Sequences 


Length! Start 
™ 1 Position 


SEQ ID 
No 


Forward 


5 1 -gcaacagcatggtgatctg-3 1 


19 ] 133 


290 


Probe 


TET-5 ' -ctttcgaatgcacaggaaccccttct-3 1 -TAMRA 


26 j 161 


291 


Reverse 


5 1 -cgccaggttgaggatatagat-3 ' 


21 ] 189 


292 



20 Table XB. Panel 1.3D 



Tissue Name 


Rel.Exp.(%) Ag2129, 
Run 148719315 


Tissue Name 


Rel.Exp.(%)Ag2129, 
Run 148719315 


Liver 

adenocarcinoma 


11.7 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.0 


Renal ca. A498 


22.8 


Adrenal gland 


6.8 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca.UO-31 


0.0 



516 



WO 03/010327 



PCI7US02/14199 



Pituitary gland 


118 jRenal ca. TK- 1 0 


11.4 


Brain (fetal) 


51.8 (Liver 


0.0 


Brain (whole) 


14.9 jLiver (fetal) 


24.0 




i a <■) jLiver ca. (hepatoblast) 

|HepG2 


9.5 


IBrain (cerebellum) 

i — — 


55.1 [Lung 


0.0 


jBrain 

[(hippocampus) 


U.I | Lung (fetal) 


24.3 


Brain (substantia 
nigra) 


n a jLung ca. (small cell) 

|LX-1 


0.0 


Brain (thalamus) 


12 2 ' JLung ca. (small cell) 

|NCI-H69 


0.0 


.Cerebral Cortex 

i 


0.0 


jLung ca. (s.cell var.) 
SHP-77 


0.0 


r ' 

Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


jglio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 

Lung ca. (non-s.cell) 
NCI-H23 


24.8 


glio/astroU-l 18- 
MG 


0.0 


100.0 


astrocytoma _ 
;SW1783 


Lung ca. (non-s.cell) 
HOP-62 

Lung ca. (non-s.cl) 
NCI-H522 


0.0 

- ■ 1 


neuro*; met SK-N- j_ 
;AS j^ 5 


9.2 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) SW 
900 


0.0 


astrocytoma SNB- 
75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


4.3 


glioma U251 


21.6 


Breast ca.* (pl.ef) 
MCF-7 


0.0. 


glioma SF-295 


8.7 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) 
T47D 


0.0 


Heart i 


0.0 


Breast ca. BT-549 


0.0 


Skeletal muscle 
(fetal) 


78.5 


Breast ca. MDA-N ' 


?.8 


Skeletal muscle ( 


).0 


Ovary 1 


3.0 


Bone marrow 


1 2.5 jOvarian ca. OVCAR-3 


10.4 


Thymus 


1 4.9 jOvarian ca. OVCAR-4 1( 


).0 


Spleen ( 


) 0 jOvarian ca. OVCAR-5 


12.2 
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T vmnh nnrl/a, 
jL/ympn jiuuc 


Oft *7 
Zo. / 


Ovarian ca. OVCAR-8 


16.5 


Colorectal 


76.3 


Ovarian ca. IGROV-1 


22.1 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0' 


Uterus 


81.8 


Colon ca. SW480 


19.2 


Placenta 


0.0 


Colon ca.* 

SW620(SW480 

met) 

| _ _ 


0.0 


Prostate 


0.0 


Colon ca HT2Q 


o n 


Prostate ca.* (bone 
met)PC-3 


. 

0.0 


Colon ca. HCT-116 


0.G 


Testis 


55.1 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC- 
2998 


0.0 jivlelanoma UACC-62 


0.0 


Gastric ca.* (liver j 
met)NCI-N87 \ 

* -j 


14.5 j 


Melanoma M 14 


0.0 


Bladder 


o.o " " : 


Melanoma LOX IMVI ] 


0.0 j 


Trachea 


o.o 1 


Melanoma* (met) SK- ! 
MEL-5 j 


16.5 


Kidney jo.O 


Adipose J 


20.3 



Table XC. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2129, Run 
165823129 


Tissue Name 


Rel. Exp.(%) 
Ag2129, Run 
165823129 


Secondary Thl act 


3.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


12.6 


HUVEC I FN gamma 


3.8 


Secondary Trl act 


7.5 


HUVEC TNF alpha + 1FN 
gamma 


2.6 


Secondary Thl rest 


12.3 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


8.7 


HUVEC IL- 11 


0.0 


Secondary Trl rest 


15.7 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


2.3 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


5.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


84.7 


Bronchial epithelium TNFalpha 
+ lLlbeta 


4.8 
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Primary Th2 rest 


34.6 


Small airway epithelium none 


0.0 


Primary Trl rest 


17.1 


Small airway epithelium 
TNFalpha + IL-lbeta 


5.7 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


4.3 


|CD45RO CD4 
ilymphocyte act 


6.5 


Coronery artery SMC TNFalpha 
+ IL-1 beta 


0.0 


CD8 lymphocyte act 


2.4 


Astrocytes rest 


0.0 


iSecondary CD8 
ilymphocyte rest 


9.0 


Astrocytes TNFalpha + IL-1 beta 


0.0 


Secondary CD8 
Ilymphocyte act 


1.3 


KU-8 12 (Basophil) rest 


0.0 


^ f** A I » r\\t r\r* \ rk-c* 

lympnocyte 

jnone 


39.8 


KU-812 (Basophil) 
PMA/ionomycin 


5.0 


l2ry 

!Thl/Th2/Trl anti- 
•CD95CHI 1 


45.7 




CCD1 106 (Keratinocytes) none 


0.0 

i 


I — 

i 

'LAK cells rest 


9.8 

i 


CCD1 106 (Keratinocytes) 
TNFalpha -HL-1 beta' 


0.0 ' 


;LAK cells IL-2 


31.9 


Liver cirrhosis 


loo.o i 


LAK cells IL-2+IL- 
M 




17.6 


Lupus kidney 


0.0 ! 


LAK cells lL-2+iFNf 
Igamma 


20.0 


NCI-H292 none 


_ „ i 

7.3 

i 


LAK cells IL-2+IL- 
18 


13.3 


NCI-H292 IL-4 


10.6 


LAK cells 
PMA/ionomycin 


10.1 


NCI-H292 IL-9 


4.1 


NK Cells IL-2 rest 


8.5 


NCI-H292 IL-1 3 


6.3 


Two Way MLR 3 
day 


9.2 


NC1-H2921 FN gamma 


1.2 


Two Way MLR 5 
day 


2.5 


HPAEC none 


0.0 


Two Way MLR 7 
day 


0.0 


HPAEC TNF alnha + II -1 beta 


0 0 


PBMC rest 


8.1 


Lung fibroblast none 


2.5 


PBMC PWM 


3.8 


Lung fibroblast TNF alpha + IL- 
1 beta 


9.0 


PBMC PHA-L 


1.7 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes 


5.8 


Lung fibroblast I FN gamma 


0.0 
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PWM 








B lymphocytes 
CD40L and IL-4 


14 9 


Dermal fibroblast CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1 070 
TNF alpha 


5.7 


EOL-1 dbcAMP 

P\4 A /mnAmur t n 

* lviy-y/ lUIIUI 1 lyVslJI 


8.7 


Dermal fibroblast CCD 1070 IL-1 
beta 


0.0 


l/ciiui mi. L>ciib none 


0.0 


Dermal fibroblast IFN gamma 


0.0 


Of»nrlritir» r#»Ilc I PC 


fi A 
U.U 


Dermal fibroblast IL-4 


0.0 | 


Dendritir p^IIq nnti- 

CD40 


5.1 


IBD Colitis! 


5.3 


Monocytes rest 


3.4 


IBD Crohn's 


4 7 


Monocytes LPS 


2.5 1 


Colon 


97.9 


Macrophages rest 


0.0 


Lung 


28.7 


Macrophages LPS 


0.0 


Thymus 


2.5 


HUVEC none 


0.0 


Kidney 


21.2 


HUVEC starved 


0.0 






Panel 1.3D Summary: Ag2129 Expression of the CG93076-01 gene is restricted 



to a sample derived from a lung cancer cell line (CT=34.3). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as < 
marker to detect the presence of lung cancer. Furthermore, therapeutic modulation of the 
5 expression or function of this gene may be effective in the treatment of lung cancer. 

Panel 4D Summary: Ag2129 Expression of the CG93076-01 gene is highest in 
liver cirrhosis (CT=3 1:8). Low but significant levels of expression are seen in primary 
resting T cells, colon, kidney, lung, LAK cells, and untreated CD4s. Expression of this 
gene is not detected in normal liver in Panel 1.3D, suggesting that its expression is unique 
1 0 to liver cirrhosis. This gene encodes a putative GPCR; therefore, antibodies or small 
molecule therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. In 
addition, antibodies to this putative GPCR could also be used for the diagnosis of liver 
cirrhosis. 

15 Y. CG94235-01: Thymidilate Kinase 

Expression of gene CG94235-01 was assessed using the primer-probe sets Agl980 
and Ag3909, described in Tables YA and YB. Results of the RTQ-PCR runs are shown in 
Tables YC, YD, YE, YF, YG, YH, YI and YJ. 

Table YA . Probe Name Agl980 
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Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 • -gggtacgatggctgaagtaaa-3 ■ 


21 


1437 


293 


Probe 


TET-5 1 -ccagttttctgccacacaacatgctt-3 ' -TAMRA 


26 


1394 


294 


Reverse 


5 1 -ttatgcagtgttcccaaatttc-3 • 


22 


1364 


295 


Table YB. Probe Name Ag3909 


Primers 


i 

Sequences Length 


Start 1 SEQ ID 
Position No 


Forward 


5 1 -caggtgccacgtctaactagat-3 1 j 22 


1307 ! 296 


Probe 


TET-5 ' -tgttgtttgaaacatctacatccacca-3 ' -TAMRP 


i 27 


1333 { 297 


Reverse 


5 1 -gaaatttgggaacactgcataa-3 • 


22 


1364 | 298 



Table YC . AI_comprehensive panel vl.O 



Tissue Name 


Rel. 

Exp.(%) 
Agl980. 
Run 

211061884 


Rel. Exp.(%) 
Agl980. Run 

1212317511 

j 


Tissue Name 


Rel. Exp.(%) 
!Agl980, Run 
|211061884 
i 


Rel. Exp.(%) 
Agl980. Run 
lilj l n 1 1 


1 10967 COPD- 


10.1 jll.6 

i 


112427 Match : 
Control Psoriasis-F 


100.0 


1 I0980COPD- 
F 


12.5 jI6.S 

! ! 


! 

112418 Psoriasis-M '19.1 

1 


18.7 


1 10968 COPD- 
M 


13.9 


1 

13.5 


112723 Match ; 
Control Psoriasis-M j 


7.2 


110977 COPD- 
M 


50.3 


32.8 


112419 Psoriasis-M 


35.6 


17.0 


110989 

Emphysema-F . 


20.7 


24.5 


112424 Match 
Control Psoriasis-M 


7.6 • 


10.1 


110992 

Emphysema-F 


12.2 


6.0 


112420 Psoriasis-M 


54.0 


53.2 


1 10993 

Emphysema-F 


12.9 


8.7 


112425 Match 
Control Psoriasis-M 


62.4 


72.7 


110994 

Emphysema-F 


7.0 


5.8 


104689 (MF)OA 
Bone-Backus 


66.0 


56.3 


110995 

Emphysema-F : 


17.0 


17.6 


104690 (MF) Adj 
"Normal" Bone- 
Backus 


34.6 


24.0 


110996 

Emphysema-F 


3.3 


7.0 


104691 (MF) OA 
Synovi urn-Backus 


36.6 


40.9 


110997 
Asthma-M 


8.0 


3.1 


1 04692 (BA) OA 
Cartilage-Backus 


12.1 


6.8 


111001 
Asthma-F 


21.8 


15.8 


104694 (B A) OA 
Bone-Backus 


51.8 


36.3 


111002 


24.0 


13.1 


104695 (BA) Adj 


26.4 


19.5 
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Asthma-F 






"Normal" Bone- 
Backus 






111003 Atopic 
Asthma-F 


203 


22.5 


1 u4oyo (p A ) UA 
Synovium-Backus 


63.7 


40.9 


111004 Atopic 
Asthma-F 


26.1 


28.5 


1 04700 /'QQ'k HA 
I U*t /v/l/ ) \J/\ 

Bone-Backus 


20.9 


62.0 


111005 Atopic 
Asthma-F 


24.5 


10.3 


1 C\A1(\\ AA\ 
1 \JH IK) 1 ^oo ) AQJ 

"Normal" Bone- 
Backus 


28.7 


25.7 


11 1006 Atopic 
Asthma-F 


11.0 


8.1 


104702 (SS) OA 
Synovium-Backus 


80.1 


59.5 


111417 
Allergy-M 


8.7 


7.7 


117093 OA Cartilage 
Rep7 


17.8 


20.9 


112347 
Allergy-M 


0.0 


0.0 


112672 OA Bone5 


36.6 


33.9 


11 2349 Normal 
Lung-F 


0.0 


0.0 


112673 OA 
Synoviums 


21.3 

I 


22.4 


112357 Normal 
Lung-F 


67.4 


64.6 


112674 OA Synovial 
Fluid cellsS 


20.3 


20.0 


112354 Normal 
Lung-M 


9.8 


15.6 


1 1 / 1 uu ua cartilage 
Repl4 


9.6 


6 ± | 


jl 12374 
Crohns-F 


.... 


15.0 


1 12756 OA Bone9 


95.3 


66.0 


112389 Match 

Control 

Crohns-F 


10.4 


lO.T 


112757 OA 
Synovium9 


17.4 


19.2 


112375 
Crohns-F 


14.5 


9.9 


i Jz/Do ua synovial 
Fluid Cells9 


14.1 


17.8 


112732 Match j 

Control 

Crohns-F 


29.3 




117125 RA Cartilage 
Rep2 


inn 


22.5 


112725 
Crohns-M 


5.1 


3.6 


113492 Bone2 RA 


76.3 


66.0 


112387 Match 

Control 

Crohns-M 


14.4 


15 7 


113493 Synovium2 
RA 




19.3 


112378 
Crohns-M 


O.O ;< 


O.O 


1 1 1AQA Q\m Plni/4 

i kj^yH oyn riuia 
Cells RA 


48.0 


43.2 


112390 Match 

Control 

Crohns-M 


33.7 


*8.6 


H3499Cartilage4 
RA 


40.3 


49.7 


112726 
Crohns-M 


>5.0 


19.9 


113500 Bone4RA i 


53.3 


50.3 


112731 Match . 
Control 


19.1 


16.8 } 


1 13501 Synovium4 
RA 


44.8 


J6.6 
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Crohns-M 












112380 Ulcer 
Col-F 


14.8 


15.9 


l \jj\jL byn rluid 
Cells4 RA 


33.7 


21.5 


112734 Match 
Control Ulcer 
Col-F 


74 2 


SO Q 

J7.7 


1 13495 Cartilage3 
RA 


48.3 


29.3 


1 12384 Ulcer 
Col-F 


126.2 

! 


28.9 


1 13496 Bone3 RA 


51.8 


45.1 


112737 Match 
Control Ulcer 
Col-F 


8.8 


4 I 


1 13497 Synoviiim3 
RA 


1 1 .9 


2^.o 


1 12386 Ulcer 
Col-F 


11.8 


10.9 


i ii/ioo c«*t-i ci.w/i 
1 j oyn rluid 

Cells3 RA 


44.1 


47.6 . 


1 12738 Match 
Control Ulcer 
Col-F 


27.0 


15.0 


117106 Normal 
Cartilage Rep20 


2.0 


5.3 


1 12381 Ulcer 
Col-M 


0.0 


0.0 


113663 Bone3 
Normal 


0.0 


0.2 


1 12735 Match 

Control 1 Mppr 

Col-M 

j 


j 

63 !6.5 

t 

! 


1 13664 Synoviums 
Normal 


0.0 


0.0 


1 1238^ Ulcer 
Col-M 


13.5 jl0.8 


1 13665 Syn Fluid 1 
Cells3 Normal j 0 ' 0 


0.0 


112394 Match 
Control Ulcer 
Col-M 


1.6 


7.7 


117107 Nonnal 
Cartilage Rep22 


10.6 


1 1.8 


1 123.83 Ulcer 
Col-M 


18.7 


26.6 


1 13667 Bone4 
Normal 


10.1 


8.4 


112736 Match 
Control Ulcer 
Col-M 


12.9 


6.7 


1 13668 Synovium4 
Normal 


10.6 


8.1 


1 12423 
Psoriasis-F 


20.7 


18.6 

i 


1 13669 Syn Fluid 
Cells4 Normal 


15.8 


21.3 



Table YD . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 
212248424 


Tissue Name 


Rel. Exp.(%) Ag3909, 
Run 212248424 


AD 1 Hippo 


11.7 


Control (Path) 3 Temporal Ctx 


5.6 


AD 2 Hippo 


41.5 


Control (Path) 4 Temporal Ctx 


33.7 


AD 3 Hippo 


7.6 


AD 1 Occipital Ctx 


12.5 


AD 4 Hippo 


7.2 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


75.3 


AD 3 Occipital Ctx 


1.1 


AD 6 Hippo 


36.1 


AD 4 Occipital Ctx 


23.7 
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Control 2 Hippo 


19.1 


AD 5 Occipital Ctx 


29.9 


Control 4 Hippo 


12.9 


AD 6 Occipital Ctx 


35.4 


Control (Path) 3 
Hippo 


12.2 


Control 1 Occipital Ctx 


9.1 


AD 1 Temporal 
Ctx 


9.5 


Control 2 Occipital Ctx 


69.7 


AD 2 Temporal 
Ctx 


36.3 


Control 3 Occipital Ctx j 1 8.9 


AD 3 Temporal 
Ctx 


3.7 


Control 4 Occipital Ctx 


|3.9 
i 


AD 4 Temporal 
Ctx 


31.9 


Control (Path) 1 Occipital Ctx 


71 2 


AD 5 Inf 
Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


31 9 


AD 5 

SupTemporal Ctx 


56.6 


Control (Path) 3 Occipital Ctx 




AD 6 Inf 
Temporal Ctx 


22.4 


Control (Path) 4 Occipital Ctx 


23.0 


AD 6 Sup 
Temporal Ctx j 


32.3 


Control 1 Parietal Ctx 

i 


8.5 


Control 1 | 
Temporal Ctx 


5.3 


Control 2 Parietal Ctx 

i 


38.2 


Control 2 
Temporal Ctx 


42.9 


Control 3 Parietal Ctx 


29.1 


Control 3 
Temporal Ctx 


13.0 i 


Control (Path) 1 Parietal Ctx 


73.2 


Control 4 

Temporal Ctx j' 


\A ( 


Control (Path) 2 Parietal Ctx - 


♦4.4 


Control (Path) 1 ; t 
Temporal Ctx * 


)8.2 < 


:ontrol (Path) 3 Parietal Ctx ' 


7.3 


Control (Path) 2 
Temporal Ctx 


H.8 ( 


:ontrol (Path) 4 Parietal Ctx t 


19.0 



Table YE. General_screening_panel_vl.4 



Tissue Name 


Rel. 

Exp.(%) 
Ag3909, 
Run 

2172358 
26 


Rel. 

Exp.(%) 
Ag3909, 
Run 

219173644 


Tissue Name 


Rel. 

Exp.(%) 
Ag3909, 
Run 

217235826 


Rel. 

Exp.(%) 
Ag3909, 
Run 

2191736 
44 


Adipose {o.6 


0.6 


Renal ca.TK-10 


0.0 


0.0 


Melanoma* Hs688(A).T jo.O 


0.0 


Bladder 


9.7 


11.7 


Melanoma* Hs688(B).T jo.O 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


100.0 


100.0 
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ivjidaiivjllla 1V1 1 *+ 


U.J 


U.4 


Gastric ca. KATO III 


0.5 


0.6 


Melanoma* LOXIMVI 


0.1 


0.2 


Colon ca. SW-948 


0.5 


0.5 


Melanoma* SK-MEL-5 


0.2 


0.2 


Colon ca. SW480 


0.1 


0.1 


Squamous cell carcinoma 
SCC-4 


0.5 


0.5 


Colon ca.* (SW480 
met) SW620 


0.0 


0.0 


Testis Pool 


u.o 


U.j 


Colon ca. HT29 


0.1 


0.1 


Prostate ca * (heme* mRt\ 
PC-3 


0.1 


0.1 


Colon ca. HCT-I 16 


0.3 


0.3 

J 1 


Prostate Pool 


0.3 


0.3 


Colon ca. CaCo-2 


0.1 


0.0 i 


Placenta 


0.4 


0.5 


Colon cancer tissue 


0.6 


0.5 


Uterus Pool 


0.2 


0.2. 


Colon ca. SW1I16 


0.2 


0 1 


Ovarian ca. OVCAR-3 


0.2 


0.2 


Colon ca. Colo-205 


0.0 


0 0 


Ovarian ca. SK-OV-3 


0;5 


0.4 


Colon ca. SW-48 


0.0 


0 0 


Ovarian ca. OVCAR-4 


0.1 


0.1 


Colon Pool 


0.4 


U.-> 


Ovarian ca. OVCAR-5 


0.6 


0.5 


jSmall Intestine Pool 


0.4 


0 4 


Ovarian ca. IGROV-! 


0.0 


0.0 . 


Stomach Pool 


0.1 


0 3 

V.J 


jOvarian ca. OVCAR-8 


0.0 


0.0 


Bone Marrow Pool «0.2 

i 


0 0 


{Ovary 


0.8 0.6 jFetal Heart jo.2 


1 

0 ? 1 


iBreast ca. MCF-7 


0.0 |0.0 'Heart Pool 


0.5 


f 0 5 ! 


Breast ca. MDA-MB-23 1 


0.2 j0.2 [Lymph Node*Poo! 


0.4 


0.4 ] 


Breast ea RT S40 

ui toot \»a. 1_> 1 J*t7 


2.3 


3.1 


Fetal Skeletal Muscle 


0.2 


0.2 


Breast ca. T47D 


1.1 


1.0 


Skeletal Muscle Pool 


1.7 


1.9 


Breast ca. MDA-N 


0.2 


0.2 


Spleen Pool 


2.3 


2.6 


Breast Pool 


0.3 


0.4 


Thymus Pool 


0.5 


0.5 


Trachea 


0.9 


1.0 


CNS cancer (glio/astro) 
U87-MG 


0.0 


0.0 


Lung 


0.1 


0.1 


CNS cancer (glio/astro) 
U-118-MG 


1.0 


1.3 


Fetal Lung 


0.9 


0.9 


CNS cancer 
(neuro;met) SK-N-AS 


1.2 


1.5 


Lungca.NCI-N417 < 


O.O 


0.0 


CNS cancer (astro) SF- 
539 j 


0.3 


0.4 


Lungca. LX-1 i 


3.0 l 


).o ; 


CNS cancer (astro) 
SNB-75 1 


3.1 


0.1 


Lung ca.NCI-H 146 ( 


).9 


! 


CNS cancer (glio) 
5NB-19 ' 


).0 1 


3.0 


Lung ca. SHP-77 ( 


).5 ( 


). 5 : 


CNS cancer (glio)SF- 
>95 ( 


).2 ( 


).2 


Lung ca. A549 ( 


).0 ( 


).0 1 


Brain (Amygdala) Pool ( 


).6 ( 


).5 


Lungca.NCI-H526 jO.O 0.0 Brain (cerebellum) ( 


).l ( 


).2 
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Lungca.NCI-H23 


0.0 * 


0.0 


Brain (fetal) 


0.8 


0.7 


Lung ca. NCI-H460 


0.0 


0.0 


Brain (Hippocampus) 
Pool 


0.5 


0.5 


Lung ca. HOP-62 


0.0 j 


0.0 


Cerebral Cortex Pool 


0.7 


0.7 


Lung ca. NCI-H522 


j 

0.0 


0.0 


Brain (Substantia 
ni^ra^ Pool 

ill ^ 1 u j i yj\j i 


0.8 


0.7 


Liver 


0, | 


0.1 


Rrain /TTiJilamiiO Pool 




n o 

U. 7 


Fetal Liver 


4.7 | 


4.1 


Rrziin Cwholf*^ 

U 1 CI 1 1 1 ^ VVUVJI^^ 


1 3 


1 ~i 


Liver ca. HepG2 


0.0 | 


0.0 


Spinal Cord Pool 


0.4 


0.5 


Kidney Pool 


0.7 | 


0.8 


Adrenal Gland 


0.7 


0.8 


rcid.1 iviuncY 


0.2 j 


0.2 


Pituitary gland Pool 


0.3 


0.3 


Renal ca. 786-0 


o.o jo.o 


Salivary Gland 


0.5 


0.4 


Renal ca. A498 


0.2 j0.2 


Thyroid (female) 


0.4 


0.2 


Renal ca. ACHN 


1 

0.0 j 


0.0 


Pancreatic ca. 
CAPAN2 




0.3 


0.3 


Renal ca. UO-3 1 


0.1 | 


0.0 


Pancreas Pool 


0.4 


0.3 



Table YF. Panel 1 3D 



Tissue Name 


Rel. Exp.(%) 
Agl980, Run 
165534458 


Tissue Name 


jRel. Exp.(%) 
;Agl980, Run 
j 165534458 


Liver 

adenocarcinoma 


1.4 


Kidney (fetal) 


i 

'0.4 

! 


Pancreas 


0.4 


Renal ca. 786-0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.0 


Renal ca. A498 


2.0 


Adrenal gland 


0.6 


Renal ca. RXF393 


0.7 


Thyroid 


0.7 


Renal ca. ACHN 


0.0 


Salivary gland 


1.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.5 


Renal ca. TK-10 


0.1 


Brain (fetal) 


0.8 


Liver 


0.5 


Brain (whole) 


3.8 


Liver (fetal) 


5.3 


Brain (amygdala) 


2.9 


Liver ca. (hepatoblast) 
HepG2 


0.0 


Brain (cerebellum) 


0.6 


Lung 


1.5 


Brain (hippocampus) 


3.2 


Lung (fetal) 


2.5 


Brain (substantia 
nigra) 


1.6 


Lung ca. (small cell) LX-1 


0.0 


Brain (thalamus) 


5.1 


Lung ca. (small cell) NCI- 
H69 


0.4 


Cerebral Cortex 


1.9 


Lung ca. (s.cell var.) SHP- 
77 


0.5 
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Spinal cord 


1.6 


Lung ca. (large cell)NCI- 
H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. cell) 
A549 


0.0 


glio/astro U-l 18-MG 


1.8 


Lung ca. (non-s.cell) NCI- 
H23 


0.0 


astrocytoma SW1 783 


0.0 


Lung ca. (non-s.cell) HOP- 
62 


0.0 


neuro*; met SK-N- 
AS 


1.1 


Lung ca. (non-s.cl) NCI- 
H522 


0.0 


astrocytoma SF-539 


7.9 


Lung ca. (squam.) SW 900 


1.5 


astrocytoma SNB-75 


1.7 


Lung ca. (squam.) NCI- 


0.1 


glioma SNB-19 


0.0 


Mammary gland 


0.7 


glioma U251 


0.7 


Breast ca.* (pl.ef) MCF-7 


0.1 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) MDA- 
MB-231 


0.3 


Heart (fetal) 


0.1 


Breast ca.* (pl.ef) T47D 


0.0 

i 


Heart jl.2 


Breast ca. BT-549 jl.l 


Skeletal muscle L rt 
(fetal) ! 00 


Breast ca. MDA-N ko j 


jSkeletal muscle 


1.0 


Ovary jo.3 


Bone marrow |4.5 


Ovarian ca. OVCAR-3 jO. 1 


Thymus 


1.2 


Ovarian ca. OVCAR-4 


0.1 


Spleen 


1.4 


Ovarian ca. OVCAR-5 


0.3 


Lymph node 


2.9 


Ovarian ca. OVCAR-8 


0.0 . 


Colorectal 


0.3 


Ovarian ca. IGROV-1 


0.0 


Stomach 


1.4 


Ovarian ca.* (ascites) SK- 
OV-3 


0.4 


Small intestine 


1.2 


Uterus 


1.8 


Colon ca. SW480 


O.O 


Placenta 


1.3 


Colon ca.* 
SW620(SW480 met) 


o.o 


Prostate 


0.3 


Colon ca. HT29 1 


3.0 


Prostate ca.* (bone met)PC- 
3 1 


3.1 


Colon ca.HCT-1 16 ( 


3.1 


restis 


1.2 


Colon ca. CaCo-2 ( 


).0 ] 


Melanoma Hs688(A).T ( 


3.1 


Colon ca. 
tissue(OD03866) 


: 


vlelanoma* (met) 
rls688(B).T ( 


3.1 


Colon ca. HCC-2998 1 


2.0 J 


Vlelanoma UACC-62 


.0 


Gastric ca.* (liver ] 


00.0 [ 


Melanoma Ml 4 ( 


).3 
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met) NCI-N87 








Bladder 


8.2 


Melanoma LOX IMVI 


0.1 


Trachea 


2.0 


Melanoma* (met) SK-MEL- 
5 


0.1 


Kidney 


0.4 


Adipose 


0.6 



Table YG. Panel 2D 



1 

i 

Tissue Name 


Rel. Exp.(%) 
Agl980, Run 
169484147 


iRel. Exp.(%) 
Tissue Name iAgl980, Run ; 

! 169484 147 ! 


Normal Colon 


14.7 


Kidney Margin 8120608 


jl.l 


CC Well to Mod Diff 
(0DO3866) 


3.2 


Kidney Cancer 8120613 


1.4 


CC Margin 
(0DO3866) 


6.3 


Kidney Margin 8120614 


1.5 


CC Gr.2 rectosigmoid 
(0DO3868) 


3.5 


Kidney Cancer 9010320 


6.2 


CC Margin 
(OD03868) 


3.3 


Kidney Margin 90 1 032 1 |2.6 


iCC Mod Diff 
;(ODO3920) 

jCC Margin 
KODO3920) 


1.3 
1.5 


Normal Uterus jl.2 j 

L .„ 1 j 

i 1 

lUterus Cancer 064011 3.4 j 


CC Gr.2 ascend colon 
(0DO3921) 


3.7 


i _ 

Normal Thyroid 


'4.7 


CC Margin 
(0DO3921) 


1.8 


Thyroid Cancer 064010 


12.1 


CC from Partial 
Hepatectomy 
(ODO4309) Mets 


5.0 


Thyroid Cancer A302 1 52 


1.6 


Liver Margin 
(ODO4309) 


6 7 


i 

Thyroid Margin A302153 J4.4 

I 


Colon mets to lung 
(OD04451-01) 


5.0 


Normal Breast 


4.5 


Lung Margin 
(OD04451-02) 


8.4 


Breast Cancer (OD04566) 


8.1 


Normal Prostate 
6546-1 


5.2 


Breast Cancer (OD04590- 
01) 


11.8 


Prostate Cancer 
(OD04410) 


9.1 


Breast Cancer Mets 
(OD04590-03) 


7.9 


Prostate Margin 
(OD04410) 


3.1 


Breast Cancer Metastasis 
(OD04655-05) 


5.3 


Prostate Cancer 
(OD04720-01) 


3.2 


Breast Cancer 064006 


18.0 
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Prostate Margin 
(OD04720-02) 


6.4 


Breast Cancer 1 CY)A 


Z. / 


Normal Lung 061010 


14.8 


Breast Cancer 9 1 00266 


6.2 


Lung Met to Muscle 
(OD04286) 


3.4 


Breast Margin 9100265 


1.9 


Muscle Margin 
;(OD04286) W 


2.8 


Breast Cancer A209073 


6.5 

1 ■ 
; j 


;Lung Malignant 
jCancer (OD03126) 

r — ' — ■ 


5.2 


Breast Margin A209073 


i.5 ; 

! 


Lung Margin 
(OD03126) 


21.5 


Normal Liver 


{ j 

2.3 


Lung Cancer 
(OD04404) 


53.6 


Liver Cancer 064003 


6.2 


Lung Margin 
(OD04404) 


6.8 


Liver Cancer 1025 


6.5 


Lung Cancer 
i(OD04565) 

i ■ ■ 


6.8 


Liver Cancer 1 026 


2.1 


ILung Margin 
;(OD04565) j 


3.6 


Liver Cancer 6004-T 


6.0 


Lung Cancer | 
! (OD04237-01) ! 1UI 


Liver Tissue 6004-N 


2.9 I 

i 



| Lung Margin 
, (OD04237~-02) 



24.5 



jOcuIar Mel Met to 
Liver (OD043 10) 



Liver Margin 
(ODO4310) 



3.5 



Melanoma Mets to. 
Lung(OD04321) 



Lung Margin 
(OD04321) 



0.2 



1.8 



Liver Cancer 6005-T 



Liver Tissue 6005-N 



Normal Bladder 



3.3 



1.7 



100.0 



Bladder Cancer 1023 



10.3 



Normal Kidney 



11.4 



Bladder Cancer A3021 73 



Bladder Cancer (OD0471 8- 
01) 



0.6 



6.3 



74.2 



— 1 
i 

! 



Kidney Ca, Nuclear 
grade 2 (OD04338) 



5.3 



Bladder Normal Adjacent 
(OD047 18-03) 



5.1 



Kidney Margin 
(OD04338) 



4.2 



Normal Ovary 



3.5 



Kidney Ca Nuclear 
grade 1/2 (OD04339) 



1.5 



Ovarian Cancer 064008 



Kidney Margin 
(OD04339) 



Kidney Ca, Clear cell 
type (OD04340) 



4.6 

ii.i 



Ovarian Cancer (OD04768- 
07) 



Ovary Margin (OD04768- 
08) 



11.7 



99.3 



0.9 



Kidney Margin 



3.5 



Normal Stomach 



10.7 
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(OD04340) 








Kidney Ca, Nuclear 
grade 3 (OD04348) 


59.9 


Gastric Cancer 9060358 


3.0 


Kidney Margin 
(OD04348) 


81.2 


Stomach Margin 9060359 


9.9 


Kidney Cancer 
;(OD04622-01) 


6.1 

! 
i 


Gastric Cancer 9060395 


6.9 


Kidney Margin i 
(OD04622-03) ; 1 


Stomach Margin 9060394 


7.7 


Kidney Cancer ! 
;(OD04450-01) I 1 " 5 


Gastric Cancer 9060397 


7.2 


Kidney Margin j 
(OD04450-03) ! L7 


Stomach Margin 9060396 


10.4 

i 


.Kidney Cancer 
18120607 


0.9 


Gastric Cancer 064005 


19.9 j 


Tabic YH. Panel 4. ID 


j ;ReI. Exp.(%) 
Tissue Name ! Ag3909. Run 
.170127176 


|Rel. Exp.(%) ! 
Tissue Name |Ag3909. Run j 

1170127176 : 


^Secondary Thl act !0.9 


HUVEC IL- 1 beta jo.l | 


Secondary Th2 act 32.3 


HUVEC I FN gamma 


0.9 ! 


^Secondary Trl act 3.6 


HUVEC TNF alpha + IFN 
gamma 


10.5 j 


Secondary Thl rest j5.5 


HUVECTNF alpha + IL4 


. 1 

0.4 


Secondary J h2 rest 


2.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


4.8 


Lung Microvascular EC 
none 


0.2 


Primary Thl act 


1.3 


Lung Microvascular EC 
TNFa!pha + IL-1 beta 


0.6 


Primary Th2 act 


2.0 


Microvascular Dermal EC 
none 


0.1 


Primary Trl act 


1.1 


Microsvasular Dermal EC 
TNFalpha* IL-1 beta 


0.4 


Primary Thl rest 


3.5 


Bronchial epithelium 
TNFalpha + lLlbeta ; 


0.0 


Primary Th2 rest 


0.3 

i 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


2.5 : 


Small airway epithelium 
rNFalpha + lL-lbeta 


3.0 


CD45RA CD4 ~[ 
lymphocyte act 


9.3 ( 


Zoronery artery SMC rest ( 


).0 


CD45RO CD4 
lymphocyte act 


12.4 J 


Doronery artery SMC 
rNFalpha+ IL-1 beta 


).0 
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CD8 lymphocyte act 


1.2 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


4.7 


Astrocytes TNFalpha + IL- 
lbeta 


0.5 


Secondary CD8 
lymphocyte act 


0.7 


KU-812 (Basophil) rest 


1.4 


CD4 lymphocyte 
none 


0.5 


KU-8 12 (Basophil) 
PMA/ionomycin 


4.7 


2ry 

Thl/fh2/Trl anti- 
CD95CHI1 


1.2 


CCD 1 1 06 (Keratinocytes) j 4 
none 

i 






CCD 1 1 06 (Keratinocytes) j 
TNFalpha + IL-1 beta \ L/ 


LAK cells IL-2 


7.5 


Liver cirrhosis |o. 1 


LAK cells IL-2+IL- 
12 


7.1 


NC1-H292 none 


.... _ 

0.1 


LAK cells IL-2+IFN 
gamma 


9.5 


NCI-H292 1L-4 jo.3 


LAK cells IL-2+ 1L- 
18 


11.7 


NCI-H292 1L-9 10.4 


jLAK cells ; 6 
j PMA/ionomycin } 


NCI-H292IL-13 10.5 


INK Cells IL-2 rest ;11.3 


NCI-H292IFN gamma |2.0 


jTwo Way MLR 3 day 


14.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


5.3 


HPAECTNF alpha + IL-1 
beta 


2.9 


Two Way MLR 7 day 


1.7 


Lung fibroblast none 


0.0 


PBMC rest 


0.4 


Lung fibroblast TNF alpha + 
IL-1 beta 


9.5 


PBMC PWM 


2.7 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


1.1 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN gamma 


3.6 


B lymphocytes PWM 


0.9 


Dermal fibroblast CCD 1070 
rest 


0.0 


B lymphocytes 
CD40L and 1L-4 


0.5 


Dermal fibroblast CCD 1070 
TNF alpha 


1.1 


EOL-1 dbcAMP 


0.6 


Dermal fibroblast CCD 1070 
IL-1 beta 


0.8 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


6.3 


Dendritic cells none j 


0.6 


Dermal fibroblast IL-4 


0.3 
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{Dendritic cells LPS 


20.0 


Dermal FihrnHlactc ™*ct 


n 1 


jDendritic cells anti- 
[CD40 


0.3 


Neutrophils TNFa+LPS 


0.1 


[Monocytes rest 

p= — — — - — . 


2.7 


Neutrophils rest 


0.3 


(Monocytes LPS 


100.0 


Colon 


0.5 


[Macrophages rest 


0.9 


Lung 


0.9 


(Macrophages LPS 


36.9 


Thymus 


1.0 


jHUVEC none 


0-0 |Kidney 




jHUVEC starved 


0.. | 




Table YI. Panel 4D 




Tissue Name 


Re!. Exp.(%) 
Agl980, Run 
161683379 




Rel. Exp.(%) 
Agl980, Run 

1 C t £L 0 "» *> nr\ 


Secondary Th 1 act 


1.0 


n u v tz,\s i l>- i oeta 


i\ i 
U. i 


Secondary Th2 act 


52.5 


HUVEC I FN gamma 


2.4 


Secondary Trl act 


9.4 


HUVEC TNF alpha + IFN 
gamma 


29.5 


Secondary Thl rest 18.9 


HUVEC TNF alpha + IL4 


1.0 : 


Secondary Th2 rest j 1 .3 


HUVEC IL- 11 


0.0 ,' 


Secondary Trl rest :!0.5 

* 


Lung Microvascular EC 
none 


0.1 i 


Primary Thl act 


4.5 


Lung Microvascular EC 
TNFalpha + lL-lbeta 


1.2 


Primary Th2 act 


3.4 


Microvascular Dermal EC 
none 


0.3 


Primary Trl act 


4.5 


Microsvasular Dermal EC 
TNFalpha + lL-lbeta 


1.2 


Primary Thl rest 


34.2 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


3.5 


Small airway epithelium 
none 


0.0 


Primary Trl rest : 


5.8 


Small airway epithelium 
rNFalpha + IL-lbeta 




CD45RA CD4 
lymphocyte act 


11.1 < 


"oronery artery SMC rest 


1 ft 


CD45RO CD4 
lymphocyte act 1 


1.7 I 


^oronery artery SMC 
rNFalpha + IL-lbeta 


\ n 


CD8 lymphocyte act ] 


•2 / 


Astrocytes rest ( 


).0 


Secondary CD8 
lymphocyte rest S 


>.4 ' 
1 


Astrocytes TNFalpha + IL- 
beta 1 


.7 


Secondary CD8 j 
lymphocyte act 2 


.1 i 


CU-812 (Basophil) rest C 


i.o 
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CD4 lymphocyte 
none 


0.8 


KU-8 12 (Basophil) 
PMA/ionomycin 


15.1 


2ry 

Thl/Th2/Trl_anti- 
CD95 CHI 1 


1.8 


CCD1106 (Keratinocytes) 
none 


0.8 


LAK cells rest 


6.0 


CCD 1 106 (Keratinocytes) 
TNFaIpha + IL-lbeta 


i j 


LAK cells IL-2 


19.5 


Liver cirrhosis 


]0.3 


LAK cells 1L-2+IL- 
12 


12.4 


Lupus kidney 


i 

|0.2 

5 


LAK cells IL-2+IFN 
gamma 


53.2 




0 7 


LAK cells IL-2+ IL- 
18 


61.1 


NC1-H292 IL-4 


1.6 


LAK cells 

PM A/ifinnmvr in 


12.4 


NCI-H292 JL-9 


2.2 


NK Celk IT -? rt><zt 

l>rv V-Cllb 1 L/~Z, I Col 


\A 1 
I H. / 




0.4 , 


Two Way MLR 3 day 


29.1 


NC1-H292 I FN gamma 


6.2 


Two Way MLR 5 day 


7.9 


HPAEC none 


0.0 ! 


Two Way MLR 7 day 


3.4 


HPAEC TNF alpha + 1L-1 
beta 


0.0 ! 


PBMC rest 


1.3 


Lung fibroblast none 


0.0 | 


PBMC PWM 


17.8 


Lung fibroblast TNF alpha + 

Fl 1 koto 

il-j oeta 


17.9 


PBMC PHA-L 


3.3 ■ 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


1 imo flhrnhlfiQt TT -1 "\ 


n n 


B lymphocytes PWM 


12.6 


Lung fibroblast IFN gamma 


11.3 


B lymphocytes 
CD40L and IL-4 


2.7 


Dermal fibroblast CCD1070 
rest 


0.1 


EOL-1 dbcAMP 


0.1 


Dermal fibroblast CCD 1070 
TNF alpha 




EOL-1 dbcAMP 
PMA/ionomycin 


0.1 


Dermal fibroblast CCD 1070 
1L-1 beta 


n 

U.J 


Dendritic cells none 


1.1 


Dermal fibroblast IFN 

LUl till 1C4 


9.1 


Dendritic cells LPS 


48.6 


Dermal fibroblast IL-4 


0.4 


Dendritic cells anti- 
CD40 


0.4 


IBD Colitis 2 


0.1 


Monocytes rest 


5.7 


IBD Crohn's 


0.3 


Monocytes LPS 


100.0 


Colon 


3.6 
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Macrophages rest 


1.3 


Lung 


3.6 


Macrophages LPS 


79.6 


Thymus 


0.8 


HUVEC none 


0.0 


Kidney 


3.0 


HUVEC starved 


0.2 







Table YJ. Panel 5 Islet 



I 

Tissue Name i 


lei. Exp.(%) 
\g3909, Run 

.Hz.^r i j i yy 


1, 
1 

["issue Name \' 


vel. bxp.(vo; 
\g3909, Run 
142413199 


97457_Patient- ( 
02go_adipose 


( 

>0.1 

1 


)4709_Donor 2 AM - ( 
\_adipose 


).0 


97476J>atient- i, 
U /sk sKeietai muscie 


( 

ill ; 


M710_Donor2 AM - 
B_adipose 


1.8 


97477JPatient- 
07ut_uterus 


18.2 ( 


?4711J3onor2AM- 
C_adipose 


0.0 


97478_Patient- 
07pl_placenta 


53.6 


94712_Donor 2 AD - 
A adipose 


1.6 


99167_Bayer Patient 
1 


100.0 


94713_Donor 2 AD - 
B_adipose 


1.2 


97482_Patient- 
08ut_uterus. 


44.8 


947 1 4_Donor 2 AD - 1 
C_adipose 


0.0 


97483_Patient- 
Uopl_placenta 


74.2 


94742_Donor3U- | 3 , j 
A Mesenchymal Stem Cells f' 


97486_Patient- 
uysK_sKeieiai muscie 


9.3 


94743_Donor 3 U - 
B_MesenchymaI Stem Cells 


3.7 


97487_Patient- 
09ut_uterus 


24.1 


94730JDonor 3 AM - 
Aadipose 


1.7 


97488_Patient- 
jypi_jD lacenia 


44.1 


9473 1 JDonor 3 AM - 
B_adipose 


3.6 


97492 J>atient- 
10ut_uterus 


47.0 


94732 JDonor 3 AM - 
C_adipose 


0.0 


97493 Patient- 
10pl__placenta 


68.3 


94733_Donor3 AD- 
A_adipose 


0.0 


97495_Patient- 
llgo_adipose 


17.4 


94734_Donor3 AD- 
B_adipose 


0.0 


97496_Patient- 

1 lsk^_skeletal muscle 


22.8 


94735J)onor3 AD - 
C_adipose 


jo.o 


97497_Patient- 
1 1 ut_uterus 


24.3 


77138 Liver HepG2untreat 
ed 


1.8 


97498_Patient- 
1 lpl_placenta 


15.4 


73556_Heart_Cardiac 
stromal cells (primary) 


0.0 


97500JPatient- 
12go_adipose 


62.4 


8t735_Small Intestine 


97.3 


97501 Patient- 


79.6 


72409JCidney_Proximal 


5.6 
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12sk_skeletal muscle 




Convoluted Tubule 




97502 Patient- 
12ut_uterus 


30.6 


OZQOJ Ollldl 1 

intestineDuodenum 


54.3 


97503 JPatient- 
12pl_placenta 


29.7 


90650_AdrenaI_Adrenocorti 

pal oH pnnm o 


10.0 


94721_Donor2U~ 
A_Mesenchymal 
Stem Cells 


0.0 


72410_Kidney_HRCE 


8.3 


94722_Donor2U- 
B_Mesenchymal 
Stem Cells 


0.0 


724!l_Kidney_HRE 


28.3 


94723 _Donor 2 U - 
CMesenchymal 
Stem Cells 


0.0 


73139_UterusJJterine 
smooth muscle cells 


0.0 



same probe and primer produce results that are in excellent agreement, with highest 
expression in normal tissue adjacent to psoriasis (CTs=30.5-3 1 .2). This target is induced in 
bone tissue, synovial fluid, synovial fluid cells and synovium from arthritis patients 
(rheumatoid-RA and osteoarthritis-OA); In addition, the expression of this transcript in 
these samples from normal patients is much lower. Other tissues including skin and lung 
also express this transcript. However, a consistent expression in diseased tissue, as 
compared to adjacent tissue or normal lung, is not apparent. This may be due to 
contamination with activated monocytes which highly express this transcript (see panel 
4. ID) 

CNS_neurodegeneration_vLO Summary: Ag3909 This panel does not show 
differential expression of the CG94235-01 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1.4 for 
discussion of utility of this gene in the central nervous system. 

General_screening_panel_vL4 Summary: Ag3909 Two experiments with the 
same probe and primer produce results that are in excellent agreement, with highest 
expression of the CG94235-01 gene in a gastric cancer cell line (CTs=23.6-24.4). Thus, 
expression of this gene could be used as to differentiate this sample from other samples on 
this panel and as a marker of gastric cancer. This gene encodes a putative thymidylate 
kinase, a DNA synthesis enzyme necessary for cell growth. Thus, therapeutic modulation 
of the expression or function of this gene product may be useful in the treatment of gastric 
cancer. 
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Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic disorders. 
5 Dysregulated expression of this gene may contribute to neuroendocrine disorders or 
metabolic diseases, such as obesity and diabetes. 

In addition, this gene is expressed at much higher levels in fetal lung, liver and 
skeletal muscle tissue (CTs=28-30) when compared to expression in the adult counterpart 
(CTs=32.5-35). Thus, expression of this gene may be used to differentiate between the 
1 0 fetal and adult source of these tissues. 

This gene is also expressed at moderate to low levels in the CNS, including the 
hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
1 5 schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 1.3D Summary: Agl980 Highest expression of the CG94235-01 gene in 
this panel is seen in a gastric cancer cell line (CT=26). Overall, expression is in reasonable 
agreement with the results in Panel 1.4. Moderate to low levels of expression are seen in 
metabolic tissues including adipose, adult and fetal liver, skeletal muscle, heart, pituitary, 
20 thyroid, adrenal and pituitary. Moderate to low levels of expression are seen in all CNS 
regions examined. 

In addition, higher levels of expression are seen in fetal liver (CT=30.2) when 
compared to expression in adult liver (CT=33.7). Thus, expression of this gene could be 
used to differentiate between the adult and fetal sources of this tissue. 

25 Panel 2D Summary: Agl 980 Highest expression of the CG94235-01 gene is seen 

in normal bladder (CT=273). In addition, higher levels of expression are seen in ovarian, 
bladder and lung cancers when compared to expression in normal adjacent tissue. Thus, 
expression of this gene could be used as a marker of these cancers. Furthermore, 
therapeutic modulation of the expression or function of this gene product may be useful in 

30 the treatment of ovarian, bladder and lung cancers. 

Panel 4.1D Summary: Ag3909 Highest expression of the CG94235-01 gene is 
seen in LPS treated monocytes. (CT=25.4). Prominent levels of expression are also seen in 
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LPS activated macrophages and dendritic cells. This transcript may encode a protein that 
is important in the normal regulation of cytokines. Inappropriate regulation of the protein 
encoded by this gene may result in the enhanced and uncontrolled expression of 
inflammatory cytokines. Therefore, designing therapies that could regulate the expression 
(such as antisense therapies) or function of the protein encoded by this gene may be 
important in the treatment of osteoarthritis and rheumatoid arthritis as well as other 
diseases. 

Panel 4D Summary: Agl980 The expression profile of the CG94235-01 gene in 
this panel is similar to the expression pattern seen in Panel 4.1 D using the Ag3909 probe 
and primer set, except it is also expressed in LAK cells. This may reflect differences 
between panel 4 and 4.1 or slightly different amplicons. Please see panel 4. ID for 
discussion of utility of this gene in inflammation. 

Panel 5 Islet Summary: Ag3909 Highest expression of the CG94235-01 gene is 
seen in islet cells (CT=33.4). Low but significant levels of expression are seen in other 
metabolic tissues, including adipose, placenta and skeletal muscle. Please see Panel 1.4 for 
discussion of utility of this gene in metabolic disease. 

General oncology screening panel_v_2.4 Summary: Ag3909 Results from one 
experiment with the CG94235-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Z. CG94235-02: Splice Variant of CG94235-01 

Expression of gene CG94235-02 was assessed using the primer-probe sets Ag3909 
and Ag6052, described in Tables ZA and ZB. Results of the RTQ-PCR runs are shown in 
Tables ZC, ZD, ZE, ZF, ZG and ZH. 



Table ZA . Probe Name Ag3909 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -caggtgccacgtctaactagat-3 ' 


22 


1529 


299 


Probe 


TET-5 ■ -tgttgtttgaaacatctacatccacca-3 ' -TAMRA 


27 


1498 


300 


Reverse 


5 ' -gaaatttgggaacactgcataa-3 ' 


22 


1472 


301 




Table ZB. Probe Name Ag6( 


)52 




Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -cctaggcgccgttttg-3 ' 


16 


2242 


302 
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Probe 


TET-5 ' -agtgtacctcctttattcctgaagcccg-3 • -TAMRA 


28 


2210 


303 


Reverse 


5 ' -gtcgaccaggtcaagcact-3 ' 


19 


2188 


304 



Table ZC . CNS_neurodegeneration_vl.O 



Tissue Name 

i 


Rel. Exp.(%) 
Ag3909, Run 
212248424 


Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 
212248424 


(AD 1 Hippo 


11.7 


Control (Path) 3 Temporal Ctx 


5.6 


JAD 2 Hippo 


41.5 


Control (Path) 4 Temporal Ctx 


33.7 


|AD 3 Hippo 


7.6 


AD 1 Occipital Ctx 


12.5 


AD 4 Hippo 


7.2 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


75.3 


AD 3 Occipital Ctx 


1.1 


AD 6 Hippo 


36.1 


AD 4 Occipital Ctx 


23.7 


Control 2 Hippo 


19.1 


AD 5 Occipital Ctx 


29.9 


Control 4 Hippo 


12.9 


AD 6 Occipital Ctx 


35.4 


Crintrr>\ {PntU\ 1 

^oniroi ^rairy _? 
Hippo 


12.2 


Control 1 Occipital Ctx 


9.1 


r\u i i emporai \^tx 


O N 


Control 2 Occipital Ctx 


69.7 


|AD 2 Temporal Ctx 


36.3 


Control 3 Occipital Ctx 


18.9 


AD 3 Temporal Ctx 


3.7 


Control 4 Occipital Ctx 


3.9 


AD 4 Temporal Ctx 


31.9 


Control (Path) 1 Occipital Ctx 


71.2 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


31.9 


AD 5 SupTemporal 
Ctx 


56.6 


Control (Path) 3 Occipital Ctx 


4.3 


AD 6 Inf Temporal 
Ctx 


22.4 


Control (Path) 4 Occipital Ctx 


23.0 


AD 6 Sup Temporal 
Ctx 


32.3 


Control 1 Parietal Ctx 


8.5 


Control 1 Temporal 
Ctx 


5.3 


Control 2 Parietal Ctx 


38.2 


Control 2 Temporal 
Ctx 


42.9 


Control 3 Parietal Ctx 


29.1 


Control 3 Temporal 
Ctx 


13.0 


Control (Path) 1 Parietal Ctx 


73.2 


Control 4 Temporal 
Ctx 


4.4 


Control (Path) 2 Parietal Ctx 


44.4 


Control (Path) 1 
Temporal Ctx 


58.2 


Control (Path) 3 Parietal Ctx 


7.3 


Control (Path) 2 
Temporal Ctx 


41.8 


Control (Path) 4 Parietal Ctx 


49.0 



Table ZD . General screening panel vl.4 
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Tissue Name 


R<»! Pvn 
IVCl. DAp.^ /0 ) 

Ag3909, Run 
217235826 


Dpi Pvr» fO/A 

Ag3909, Run 
219173644 


Tissue Name 


Rel. 

rixp.(%) 
Ag3909, 
Run 

217235826 


Rel. bxp.(%) 
Ag3909, Run 
219173644 


Adipose 


0.6 


0.6 


Renal ca. TK-10 


0.0 


0.0 


Melanoma* 
Hs688(A)T 


0.0 


0.0 


Bladder 


9.7 


jll.7 


Melanoma* 
Hs688(B).T 


0.0 


0.0 


Gastric ca. (liver 
met.)NCI-N87 


100.0 


100.0 


Melanoma* 
M14 


0.3 


0.4 


Gastric ca. KATO III 


0.5 


■ j 

10.6 

! 


Melanoma* 
T OYIMVI 


0.1 


0.2 


Colon ca. SW-948 


0.5 


|0.5 

1 . 


lvieionoma 
SK-MEL-5 


0.2 


0.2 


Colon ca. SW480 


0.1 


0.1 


Squamous 

tell 

carcinoma 
SCC-4 


0.5 

l 


0.5 


Colon ca.* (bW4o0 
met) SW620 


0.0 


i j 
[0.0 

1 I 

: » 


Testis Pool 


0.6 


0.5 


Colon ca. HT29 


0.1 




Prostate ca.* 
(bone met) 
PC-3 


0.1 


r ".. "■ 

0.1 


Colon ca. HCT-1 16 


0.3 


0.3 


Prostate Pool 


0.3 


0.3 


Colon ca. CaCo-2 


0.1 


0.0 


Placenta 


0.4 


0.5 


Colon cancer tissue 


0.6 


0.5 


Uterus Pool 


0.2 


0.2 


Colon ca. SW1 116 


0.2 


0.1 


Ovarian ca. 
OVCAR-3 


0.2 


0.2 


Colon ca. Colo-205 


0.0 


0.0 


Ovarian ca. 
SK-OV-3 


0.5 


0.4 


Colon ca. SW-48 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.1 


0.1 


Colon Pool 


0.4 


0.5 


Ovarian ca. 
OVCAR-5 


U.o 


0.5 


Small Intestine Pool 


0.4 


0.4 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach Pool 


0.1 


0.3 


Ovarian ca. 
OVCAR-8 


U.K) 


0.0 


Bone Marrow Pool 


0.2 


0.2 


Ovary 


0.8 


0.6 


Fetal Heart 


0.2 


0.2 


Breast ca. 
MCF-7 


0.0 


0.0 


Heart Pool 


0.5 


0.5 


Breast ca. 
MDA-MB- 


0.2 . 


0.2 


Lymph Node Pool 


0.4 


0.4 
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231 












Breast ca. BT 
549 


2 3 


J. I 


retai oKeletal Muscle 


0.2 


0.2 


Breast ca. 
T47D 


I . 1 




oKeietai Muscle Pool 


1.7 


1.9 


Breast ca. 
MDA-N 


0.2 


0.2 


Spleen Pool 


2.3 


2.6 

i 


Breast Pool 


j0.3 


0.4 


Thymus Pool 


0.5 


■0.5 


Trachea 


0.9 


1.0 


CNS cancer 


0.0 


0.0 


Lung 


0.1 


0.1 

i 


CNS cancer 
^giio/dbiro^ u- 1 lo- 

MG 


1 A 
l.U 


I.J 


Fetal Lung 


1 

0.9 j0.9 


(neuro;met) SK-N- 
AS 


1.2 


1.5 


Lung ca. NCI- 
N417 


o.o jo.o 

._ .... i 


CNS cancer (astro) 
SF-539 


0.3 


0.4 j 


Lungca.LX-| 0() jo.O 


CNS cancer (astro) 
SNB-75 


0.1 


! 

0.1 

• 


Lung ca. NCI-L Q 
H146 , 


i.i 


CNS cancer (glio) 
SNB-19 


0.0 


o.o i 


Lungca. Lc 
SHP-77 


0.5 


CNS cancer (glio) 
SF-295 


0.2 


0.2 


Lung ca. 
A549 . 


0.0 


0.0 


Brain (Amygdala) 
Pool 


0.6 


0.5 


Lung ca. NCI- 
H526 


0.0 


0.0 


Brain (cerebellum) 


0.1 


0.2 


Lung ca. NCI- 
H23 


0.0 


0.0 


Brain (fetal) 


0.8 


0.7 


Lung ca. NCI- 
H460 


0.0 


0.0 


Brain (Hippocampus) 
Pool 


0.5 


0.5 


Lung ca. 
HOP-62 


0.0 


0.0 


cereorai cortex rool 


0.7 


0.7 


Lung ca. NCI- 
H522 


0.0 


0.0 


Brain (Substantia 
nigra) Pool 


0.8 


0.7 


Liver 


0.1 


0.1 


Brain (Thalamus) 

Pnnl 


0.8 


0.9 


Fetal Liver 


4.7 


4.1 


Brain (whole) 


1.3 


1.3 


Liver ca. 
HepG2 


0.0 


0.0 


Spinal Cord Pool 


0.4 


0.5 


Kidney Pool 


0.7 


0.8 


Adrenal Gland 


0.7 


0.8 


Fetal Kidney 


0.2 


0.2 


Pituitary gland Pool 


0.3 


0.3 



540 



WO 03/010327 



PCT/US02/14199 



Renal ca. 786- 
0 


0.0 


0.0 


Salivary Gland 


0.5 


0.4 


Renal ca. 
A498 


0.2 


0.2 


Thyroid (female) 


0.4 


0.2 


Renal ca. 
ACHN 


0.0 


0.0 


Pancreatic ca. 
CAPAN2 


0.3 


0.3 


Renal ca. UO- 
31 


0.1 


0.0 


Pancreas Pool 


0.4 


0.3 



Table ZE . Generalscreeningpanel _vl.5 



|Rel. Exp.(%) 
Tissue Name jAg6052, Run 
|228746663 


Tissue Name 


jReh Exp.(%) Ag6052, 

(Run 228746663 

1 


Adipose ll.2 


Renal ca. TK- 10 jO.O 


Melanoma* 
Hs688(Aj.T 


0.0 


Bladder 


12.9 


Melanoma* ' 
Hs688(B).T p° 


Gastric ca. (liver met.) NCI- 
N87 


100.0 

i 


Melanoma* M 14 »0.0 

Melanoma* ; 
LOXIMVI ;0 *° 


Gastric ca. KATO III b.O 

. — . ,, f , r . — ! 

Colon ca. SW-948 iO.O j 


Melanoma* SK- 
MEL-5 ! 0 * 0 


Colon ca. SW480 0.4 

j 
i 


Squamous cell 
carcinoma SCC-4 


2.7 


Colon ca.* (SW480 met) j 
SW620 i 00 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.4 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.4 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.9 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV- 
3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.6 


Ovarian ca. 
OVCAR-5 


0.6 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 

1 , 


0.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 ] 


Bone Marrow Pool 


0.4 
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Ovary 


0.7 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.3 


Breast ca. MDA- 
MB-231 


0.5 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


L5 


Fetal Skeletal Muscle 


1.5 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


2.1 


Breast ca. MDA-N 


0.0 


Spleen Pool 


3.7 


Breast Pool 


0.9 


Thymus Pool 


1.2 


TP 1 

Trachea 


1.0 


CNS cancer (glio/astro) U87- 
MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18- 
MG 


0.8 


Fetal Lung 


0.4 


CNS cancer (neuro;met) SK- 
N-AS 


2.3 


Lung ca. NC1-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) SNB-75 jo.O 


Lung ca.NCI-H 146 


0.4 


CNS cancer (glio) SNB-1 9 0.0 


Lungca. SHP-77 ,0.0 


CNS cancer (glio) SF-295 


1.9 ! 


Lung ca. A549 !o.O 


Brain (Amygdala) Pool 


1.3 ! 


Lungca.NCI-H526 0.0 


Brain (cerebellum) jO.O 


Lungca.NCI-H23 0.0 


Brain (fetal) jo.O 


Lungca. NCI-H460 


0.0 


Brain (Hippocampus) Pool jl.2 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


1.8 


Lungca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.4 


Fetal Liver 


2.8 


Brain (whole) 


0.0 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


1.4 


Kidney Pool 


0.0 


Adrenal Gland 


0.4 


Fetal Kidney 


1.6 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.6 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.3 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.6 



Table ZF. Panel 4.1D 



Tissue Name 


Rel. 

Exp.(%) 
Ag3909, 
Run 

170127176 


Rel. Exp.(%) 
Ag6052, Run 
226202125 


Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 
170127176 


Rel. 

Exp.(%) 
Ag6052, 
Run 

226202125 


Secondary Thl act 


0.9 


0.0 


HUVECIL-lbeta 


0.1 


0.0 
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Secondary Th2 act 


32.3 


27.7 


HUVEC IFN gamma 


0.9 


0.5 


Secondary Trl act 


3.6 


0.8 


HUVEC TNF alpha + 
IFN gamma 


10.5 


18.3 


Secondary Thl 
rest 


5.5 


5.9 


HUVEC TNF alpha + 
IL4 


0.4 


0.6 


Secondary Th^ 
rest 


2.0 


1.0 


HUVEC IL-11 


0.0 


2.7 


Secondary Trl rest 


4.8 


| ! 4.3 


Lung Microvascular 
EC none 


0.2 


0.0 


i - 

Priniarv Th 1 art 

i i illicit y i ii i a\*\. 




1 

1 

! 

! 

1 1 
I . 1 


Lung Microvascular 
be J iNraipna + IL- 
Ibeta 


0.6 


0.0 


Primary Th2 act 


2.0 


j 

2.2 


Microvascular Dermal 
EC none 


0.1 


0.0 


Primary Trl act 


1.1 


0.4 


Microsvasular Dermal 
EC TNFalpha + IL- 
lbeta 


0.4 


0.0 


Primary Thl rest b.5 


4.4 


Bronchial epithelium 
TNFalpha + IL lbeta 


0.0 


0.0 


! j 

Primary Th2 rest jo.3 

... 3 


2.1 


Small airway 
epithelium none 


o.o io.o 

i 


Primary Trl rest 


2.5 |0.3 

i 


Small airway 
epithelium TNFalpha 
+ I L- lbeta 


o.o jo.o 

V 

f 

i 


CD45RA CD4 
lymphocyte act 


9.3 


4.2 


Coronery artery SMC 
rest 


0.0 


. > 

0.0 


CD45RO CD4 
lymphocyte act 


12.4 


11.7 


Coronery artery SMC \ 
TNFalpha + IL- lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


1.2 


0.6 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest 


4.7 


12.9 


Astrocytes TNFalpha ; 
+ IL- lbeta 


0.5 


0.3 


Secondary CD8 
lymphocyte act 


0.7 


0.4 


ivu-oiz ^jBasopnnj 
rest 


1.4 


1.1 


CD4 lymphocyte 
none 


0.5 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


4.7 


6.3 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


1.2 


1.9 


CCD1106 

k jveraunocyiesj none 


0.4 


0.7 


LAK cells rest 


2.8 : 


2.8 


CCDl 106 
(Keratinocytes) 
rNFalpha + IL- lbeta 


7.7 


5.5 ' 


LAK cells IL-2 


7.5 i 


S.4 ] 


Jver cirrhosis < 


D.l 1 


3.0 


LAK cells IL- 


7.1 : 


J.2 ] 


NCI-H292 ndne 1 


5.1 0.0 
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2+IL-12 












LAK cells IL- 
2+1 FN gamma 


9.5 


2.7 


NCI-H292 IL-4 


0.3 


0.0 


LAK cells IL-2+ 
IL-18 


U.7 


7.1 


NCI-H292 IL-9 


0.4 


0.0 


LAK cells 
jPMA/ionomycin 


6.9 


5.5 


NCNH292 IL-13 


0.5 


0.0 


i 

INK Cells 1L-2 rest 

i 


11.3 


13.4 


NCI-H292 I FN 
gamma 


2.0 


0.4 


(Two Way MLR 3 
day 


14.0 


12.7 


HPAEC none 


0.0 


0.6 


Two Way MLR 5 
day 




1 A. 
J.O 


HP A EC TNF alpha + 
IL-1 beta 


2.9 


0.4 


Two Way MLR 7 
day 


1.7 


5.2 


Lung fibroblast none 


0.0 


0.6 


IPBMC rest 

i 

| ' 1 ' ' 


0.4 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


9.5 


18.6 


IPBMC PWM 


2.7 


1.5 


Lung fibroblast IL-4 


0.0 


5.2 


IPBMC PHA-L 


1.1 


4.2 


Lung fibroblast IL-9 


0.0 


0.0 


jRamos (B cell) 
;none 


0.0 


0.0 


Lung fibroblast IL-13 


0.0 


0.0 


jRamos (B cell) 

jionomycin 

j ^ 


0.0 


0.0 


Lung fibroblast IFN 
gamma 


i 

3.6 


5.7 


B lymphocytes 
PWM 


0.9 


0.7 


Dermal fibroblast 
CCD 1070 rest 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 \ 


0.5 


1.5 


Dermal fibroblast 
CCD 1070 TNF alpha 


1.1 


1.6 


EOL-1 dbcAMP 


0.6 


0.3 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.8 


1.6 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast IFN 
gamma 


6.3 


2.9 


Dendritic cells 
none 




U.U 


Dermal fibroblast IL- 
4 


0.3 


0.0 


Dendritic cells 

I PC 


20.0 


15.1 


Dermal Fibroblasts 
rest 


0.1 


0.0 


uenaruic ceus 
anti-CD40 


0.3 


0.4 


Neutrophils 
TNFa+LPS 


0.1 


1.6 


Monocytes rest 


2.7 


2.8 


Neutrophils rest 


0.3 


0.0 


Monocytes LPS 


100.0 


100.0 


Colon 


0.5 


3.0 


Macrophages rest 


0.9 


0.8 


Lung 


0.9 


1.4 


Macrophages LPS 


36.9 


24.3 


Thymus 


1.0 


1.3 


HUVECnone :< 


O.O l 


D.O 


Kidney 


0.4 


D.O 


HUVEC starved \ 


0.1 1 


[).0 




i 
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Table ZG. Panel 5 Islet 



] 

Tissue Name 


Rel. Exp.(%) 
Ag3909,Run ' 
242413199 


1 
1 

rissue Name i 

1 


*el. 

\g3909, 
Hun 

242413 1 99 


|97457_Patient-02go_adipose 


90.1 


94709_Donor 2 AM - A_adipose I 


3.0 


97476_Patient-07sk_skeletal 
muscle 


22.2 


947 1 0_Donor 2 AM - B_adipose 


1.8 


197477 Patient-07ut uterus 1 8.2 [947 1 lJDonor 2 AM - C_adipose 


0.0 


97478J > atient-07pl_placenta 53.6 


947 1 2__Donor 2 AD - A_adipose 


1.6 


99167 Bayer Patient I 


100.0 


9471 3_Donor 2 AD - B_adipose 


1.2 


97482_Patient-08ut_uterus 


44.8 


94714_Donor 2 AD - C_adipose 


0.0 


97483_Patient-08pl_placenta 


74.2 


94742_Donor 3 U - 
AMesenchymal Stem Cells 


3.1 


97486_Patient-09sk_skeletal 
muscle 


9.3 


94743_Donor 3 U - 
B_Mesenchymal Stem Cells 


3.7 


97487_Patient-09ut_uterus 


24.1 


94730_Donor 3 AM - A_adipose 


1.7 


|974S8_Patient-09pl_placenta 


44.1 


9473 1 _Donor 3 AM - B_adipose |3.6 


<97492_Patient- 1 Out_uterus 


47.0 


94732_Donor 3 AM - C_adipose 


0.0 


|97493_Patient-10plj>lacenta 1 


68.3 j94733JDonor 3 AD - A_adipose 


0.0 


I97495 Patient- 11 go adipose 


1 7.4 |94734_Donor 3 AD - B_adipose 


0.0 


97496JPatient-l lsk_skeletal 
muscle 


22.8 


94735_Donor 3 AD - C_adipose 


0.0 


97497_Patient-l 1 ut_uterus 


24.3 


77 1 3 8 _Liver_HepG2untreated 


1.8 


97498_Patient-l lpl_placenta 


15.4 


73556_Heart_Cardiac stromal cells 
(primary) 


0.0 


97500J > atienM2go_adipose 


62.4 


81735_Small Intestine 


97.3 


9750 1 _PatienM 2sk_skeletal 
muscle 


79.6 


72409_Kidney_ProximaI 
Convoluted Tubule 


5.6 


97502_Patient- 1 2ut_uterus 


30.6 


82685_Small intestineJDuodenum 


54.3 


97503 _PatienM 2pl_placenta 


29.7 


90650_Adrenal_Adrenocortical 
adenoma 


10.0 


94721_Donor2U- 
A_Mesenchymal Stem Cells 


0.0 


72410JKidney_HRCE 


8.3 


94722_Donor2U- 
BJVlesenchymal Stem Cells 


0.0 


72411_Kidney_HRE 


28.3 


94723 JDonor 2 U - 
C_MesenchymaI Stem Cells 


0.0 


73139_UterusJJterine smooth 
muscle cells 


0.0 


Table ZH. General oncology screening panel_v_2. 


4 


Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 


Tissue Name 


Rel. 

Exp.(%) 
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268143664 




Ag3909, 
Run 

268143664 


v^uion cancer 1 




Bladder cancer NAT 2 


0.0 


Pnlrm MAT 1 

^oion IN A, 1 1 


ft ft 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 2 


0.0 


Adenocarcinoma of the prostate 1 


0.0 


1 , • 

jColon cancer 3 


0.1 


Adenocarcinoma of the prostate 2 


100.0 


'Colon cancer NAT 3 


0.0 


Adenocarcinoma of the prostate 3 


0.0 


Colon malignant cancer 4 


0.0 


Adenocarcinoma of the prostate 4 


0.0 


Colon normal adjacent tissue 
4 

i — ■ — 


0.0 


Prostate cancer NAT 5 


0.0 


Xung cancer 1 


0.0 


Adenocarcinoma of the prostate 6 


0.0 


iLungNAT 1 


0.0 


Adenocarcinoma of the prostate 7 


0.0 


'Lung cancer 2 


0.0 


Adenocarcinoma of the prostate 8 


0.0 


jLimg NAT 2 jO.O 


Adenocarcinoma of the prostate 9 [o.O 


Squamous cell carcinoma 3 jo.O 


Prostate cancer NAT 1 0 jo.O 


:Lung NAT 3 jO.0 [Kidney cancer 1 jo.O 


metastatic melanoma 1 


0.0 


KidneyNAT 1 


0.0 


.Melanoma 2 jo.O 


Kidney cancer 2 


0.0 


Melanoma 3 


0.0 


Kidney NAT 2 


0.0 


^metastatic melanoma 4 


0.0 


Kidney cancer 3 


0.0 


metastatic melanoma 5 


0.0 


Kidney NAT 3 


0.0 


Bladder cancer 1 


0.0 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3909 This panel does not show 
differential expression of the CG94235-01 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1.4 for 
discussion of utility of this gene in the central nervous system. A second experiment with 
5 the probe and primer set Ag6052, which is specific for the CG94235-02 variant, shows 
low/undetectable levels of expression in all the samples on this panel (CTs>35). (Data not 
shown.) 

General_screening_panel_vl.4 Summary: Ag3909 Two experiments with the 
same probe and primer produce results that are in excellent agreement, with highest 
10 expression of the CG94235-01 gene in a gastric cancer cell line (CTs=23. 6-24.4). Thus, 
expression of this gene could be used as to differentiate this sample from other samples on 
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this panel and as a marker of gastric cancer. This gene encodes a putative thymidylate 
kinase, a DNA synthesis enzyme necessary for cell growth. Thus, therapeutic modulation 
of the expression or function of this gene product may be useful in the treatment of gastric 
cancer. 

5 Among tissues with metabolic function, this gene is expressed at moderate to low 

. levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic function and that 
disregulated expression of this gene may contribute to neuroendocrine disorders or 

10 metabolic diseases, such as obesity and diabetes. 

In addition, this gene is expressed at much higher levels in fetal lung, liver and 
skeletal muscle tissue (CTs=28-30) when compared to expression in the adult counterpart 
(CTs=32.5-35). Thus, expression of this gene may be used to differentiate between the 
fetal and adult source of these tissues. 

15 This gene is also expressed at moderate to low levels in the CNS. including the 

hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

20 General_screeningjpanel_vl.5 Summary: Ag6052 Expression of the CG94235- 

02 variant appears to be restricted to a gastric cancer cell line (CT=31.2) and normal 
bladder (CT=34.2) in this panel. Thus, expression of this gene could be used to 
differentiate between these samples and other samples on this panel and as a marker to 
detect the presence of gastric cancer. Furthermore, therapeutic modulation of the 

25 expression or function of this gene may be effective in the treatment of gastric cancer. 

Panel 4.1D Summary: Ag3909/Ag6502 Two experiments with two different 
probe and primer sets produce results that are in very good agreement. Highest expression 
of the CG94235-01 gene is seen in LPS treated monocytes. (Ag3909 CT=25.4; Ag6502 
CT=31.1). Prominent levels of expression are also seen in LPS activated macrophages and 

30 dendritic cells. This transcript may encode a protein that is important in the normal 

regulation of cytokines. Inappropriate regulation of the protein encoded by this gene may 
result in the enhanced and uncontrolled expression of inflammatory cytokines. Therefore, 
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designing therapies that could regulate the expression (such as antisense therapies) or 
function of the protein encoded by CG94325-01this gene may be important in the 
treatment of osteoarthritis and rheumatoid arthritis as well as other diseases. 

Panel 5 Islet Summary: Ag3909 Highest expression of the CG94235-01 gene is 
5 seen in islet cells (CT=33.4). Low but significant levels of expression are seen in other 
metabolic tissues, including adipose, placenta and skeletal muscle. Please see Panel 1 A for 
discussion of utility of this gene in metabolic disease. 

General oncology screening panel_v_2.4 Summary: Ag3909 Results from one 
experiment with the CG94235-01 gene are not included. The amp plot indicates that there 
10 were experimental difficulties with this run. A second experiment with the probe and 

primer set Ag6052, which is specific for the CG94235-02 variant, shows low/undetectable 
levels of expression in all the samples on this panel (CTs>35). (Data not shown.) 

A A. CG94692-01 and CG94692-02: Carnitine/Acylcarnitin Translocase 

15 Expression of gene CG94692-01 and full length physical clone CG94692-02 was 

assessed using the primer-probe set Ag3941, described in Table AAA. Results of the 
RTQ-PCR runs are shown in Tables AAB, AAC, AAD and AAE. Please note that 
CG94692-02 represents a full-length physical clone of the CG94692-01 gene, validating 
the prediction of the gene- sequence. 

20 Table AAA . Probe Name Ag3941 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 1 -ggcttcttcaagggaatgag-3 ■ 


20 


160 


305 


Probe 


TET-5 * -ttgccagcatagctgtggtcaactct-3 ' -TAMRA 


26 ; 


188 


306 


Reverse 


5 1 -tgttgctatagaccccaaacag-3 1 


22 


217 


307 



Table AAB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel.Exp.(%)Ag3941, 
Run 212344932 


Tissue Name 


Rel. Exp.(%) 
Ag3941,Run 
212344932 


AD 1 Hippo 


6.3 


Control (Path) 3 Temporal Ctx 


4.2 


AD 2 Hippo 


16.5 


Control (Path) 4 Temporal Ctx 


28.9 


AD 3 Hippo 


3.0 


AD 1 Occipital Ctx 


2.0 


AD 4 Hippo 


6.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


0.6 


AD 6 Hippo 


10.6 


AD 4 Occipital Ctx 


17.1 
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Control 2 Hippo 


17.3 


AD 5 Occipital Ctx \ 


Z8.7 


Control 4 Hippo 


8.0 


AD 6 Occipital Ctx ( 


5.4 


Control (Path) 3 
Hippo 


4.5 


Control 1 Occipital Ctx \ 


2.1 


AD 1 Temporal Ctx 


6.7 


Control 2 Occipital Ctx 


39.5 


AD 2 Temporal Ctx 


27.9 


Control 3 Occipital Ctx 


17.4 


AD 3 Temporal Ctx 


3.9 


Control 4 Occipital Ctx 


5.7 


{AD 4 TemDoral Ctx 


18.9 


Control (Path) 1 Occipital Ctx 


51.4 


AD 5 Inf Temporal 
Ctx 


39.8 


Control (Path) 2 Occipital Ctx 


11.5 


AD 5 Sup Temporal 
Ctx 


25.0 


Control (Path) 3 Occipital Ctx 


1.6 


AD 6 Inf Temporal 
Ctx 


13.8 


Control (Path) 4 Occipital Ctx 


16.7 


AD 6 Sup Temporal 
Ctx 


11.9 


Control 1 Parietal Ctx 

i 


4.1 


Control 1 Temporal 

Ctx 


5.2 


Control 2 Parietal Ctx J 13.7 


Control 2 Temporal 
Ctx 


33.9 


Control 3 Parietal Ctx 


13.6 


Control 3 Temporal 
Ctx 


6.0 

! 


Control (Path) 1 Parietal Ctx 


36.9 


Control 3 Temporal 
Ctx 


i 

10.7 


Control (Path) 2 Parietal Ctx 


21.9 


Control (Path) 1 
Temporal Ctx 


45.4 


Control (Path) 3 Parietal Ctx 


2.8 


Control (Path) 2 
Temporal Ctx 


23.3 


Control (Path) 4 Parietal Ctx 


42.6 


Table AAC. General screenin&_panel_vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag3941,Run 
219822166 


Tissue Name 


Rel. Exp.(%) 
Ag3941, Run 
219822166 


Adipose 


6.3 


Renal ca. TK-10 


7.5 


Melanoma* Hs688(A).T 


25.3 


Bladder 


11.3, 


Melanoma* Hs688(B).T 


8.1 


Gastric ca. (liver met.) NCI-N87 


13.8 


Melanoma* M14 


10.6 


Gastric ca. KATO III 


3.7 


Melanoma* LOXIMVI 


10.5 


Colon ca. SW-948 


8.2 


Melanoma* SK-MEL-5 


17.8 


Colon ca. SW480 


21.3 


Squamous cell carcinoma 
SCC-4 


1.3 


Colon ca.* (SW480 met) SW620 


14.0 


Testis Pool 


7.6 


Colon ca. HT29 


5.0 
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Prostate ca.* (bone met) PCX 


$32.3 


Colon ca.HCT-1 16 


6.3 


Prostate Pool 


8.5 


Colon ca. CaCo-2 


12.0 


Placenta 


12.7 


Colon cancer tissue 


7.4 


Uterus Pool 


1.2 


Colon ca. SW1116 


19.5 


Ovarian ca. OVCAR-3 


16.4 


Colon ca. Colo-205 


1.1 


Ovarian ca. SK-OV-3 


7.2 


Colon ca. SW-48 


6.6 


Ovarian ca. OVCAR-4 


3.3 


Colon Pool 


12.9 


Ovarian ca. OVCAR-5 


55.1 


Small Intestine Pool 


8.2 


Ovarian ca. IGROV-I 


22.1 


Stomach Pool 


]^5 


Ovarian ca. OVCAR-8 


6.8 


Bone Marrow Pool 


4.6 


Ovary 


6.8 


Fetal Heart 


|7.2 


Breast ca. MCF-7 


11.2 


Heart Pool 


15.6 


Breast ca. MDA-MB-231 


11.5 


Lymph Node Pool 


11.0 


Breast ca. BT 549 


9.1 


Fetal Skeletal Muscle 


115.5 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool { 1 2.2 


Breast ca. MDA-N 


32.5 


Spleen Pool 


1 18.6 

|23.0 


Breast Pool |9. i 


Thymus Pool 


Trachea 


16.4 


CNS cancer (glio/astro) U87-MG 


19.8 
15.6 


Lung 


1.0 


CNS cancer (glio/astro) U-l 18- 
MG 


Fetal Lung 


29.7 


CNS cancer (neuro;met) SK-N-AS 


5.0 


Lungca.NCI-N417 


2.9 


CNS cancer (astro) SF-539 |3.7 


Lungca. LX-1 


18.3 


CNS cancer (astro) SNB-75 1 3.7 


Lung ca.NCI-H 146 / 


8.3 


CNS cancer (glio) SNB- 1 9 


18.0 


Lung ca. SHP-77 


12.7 


CNS cancer (glio) SF-295 


14.6 


Lung ca. A549 


18.8 


Brain (Amygdala) Pool 


7.2 


Lung ca. NCI-H526 


5.2 


Brain (cerebellum) . 


5.2 


Lungca.NCI-H23 


24.1 


Brain (fetal) 


1.5 


Lung ca. NCI-H460 


6.3 


Brain (Hippocampus) Pool 


6.2 


Lung ca. HOP-62 


7.0 


Cerebral Cortex Pool 


8.4 


Lungca.NCI-H522 


15.4 


Brain (Substantia nigra) Pool 


12.3 


Liver 


5.2 ] 


Brain (Thalamus) Pool < 


?.2 


Fetal Liver 


15.9 | 


Brain (whole) 


7-7 


Liver ca. HepG2 < 


>.5 < 


Jpinal Cord Pool \ 


J.O 


Kidney Pool 


18.6 j 


\drenal Gland < 


).5 


Fetal Kidney < 


).0 I 


3 ituitary gland Pool i 


1.3 


Renal ca. 786-0 j; 


U I 


ialivary Gland \ 


5.7 


Renal ca. A498 jj 


.5 n 


fhyroid (female) J 


!3.7 
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Renal ca. ACHN f 


>.5 I 


>ancreatic ca. CAPAN2 t 


1.2 


Renal ca.UO-31 - 


5.3 1 


J ancreas Pool ' 


7.0 


Table AAD. Panel 4.1D 


jl 

Tissue Name ' 


lei. Exp.(%) 
\g3941,Run ' 
170684829 


I 

rissue Name ' 


lei. Exp.(%) 
\g3941,Run 
70684829 


Secondary Thl act 


36.9 


HUVEC IL-lbeta ? 


).3 


Secondary Th2 act 


43.5 ~[ 


HUVEC I FN gamma i 


5.4 


Secondary Trl act 147.3 


HUVEC TNF alpha + IFN gamma ,. 


5.9 


Secondary Thl rest 


46.0 


HUVEC TNF alpha + IL4 \ 


3.7 


Secondary Th2 rest 


45.1 


HUVEC IL-1 1 |7.0 


Secondary Trl rest 


58.6 


Lung Microvascular EC none | 


11.7 


Primary Thl act 


31.6 


Lung Microvascular EC TNFalphal 
+ 1L-Ibeta | 


14.2 


Primarv Th2 act 


72.7 


Microvascular Dermal EC none ;6.0 


Primary Trl act 


47.0 


Microsvasular Dermal EC L g 
TNFalpha + lL-lbeta ' ' 


Primary Thl rest 


40.6 


Bronchial epithelium TNFalpha + 
ILlbeta 


2.4 


Primary Tli2 rest 


26.2 


Small airway epithelium none ; 1 .4 


Primary Trl rest 


100.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


4.1 


CD45RA CD4 lymphocyte 
act 


27.9 


Coronery artery SMC rest 


9.4 


CD45RO CD4 lymphocyte 
act 


45.1 


Coronery artery SMC TNFalpha + 
IL-lbeta 


14.2 


CD8 lymphocyte act 


44.1 


Astrocytes rest 


4.2 


Secondary CD8 lymphocyte 
rest 


37.1 


Astrocytes TNFalpha + IL-lbeta 


7.1 


Secondary CD8 lymphocyte 
act 


33.2 


KU-812 (Basophil) rest 


16.8 


CD4 lymphocyte none 


63.7 


KU-8 12 (Basophil) 
PMA/ionomycin 


11.7 


2ry Thl/Th2/Trl_anti-CD95 
CH11 


85.9 


CCD 1 106 (Keratinocytes) none 


2.6 


LAK cells rest 


73.7 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


0.8 


LAK cells IL-2 


68.3 


Liver cirrhosis 


8.1 


LAK cells IL-2+IL-12 


42.9 


NCI-H292 none 


19.2 


LAK cells IL-2+IFN gamma 


35.8 


NCI-H292 1L-4 


17.9 


LAK cells IL-2+IL-18 


]59.5 


|NCI-H292 IL-9 


38.2 
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LAK cells PMA/ionomycin 


14.7 


(NCI-H292IL-13 


11.9 


v^eus rest 


81.2 


NCI-H292IFN gamma 


26.1 


i wu way iviLrK j flay 


63.3 JHPAEC none 


4.0 


i wo way mlk 3 day 


3 1 .9 JHPAEC TNF alpha + IL- 1 beta 


13.3 


i wu way iviLrv / day 


28.9 |Lung fibroblast none 


6.8 


PBMC rest 


26 8 |Lung fibroblast TNF alpha + IL-I 
|beta 


T — ~— 

9.0 


PBMC PWM 


18.2 


Lung fibroblast IL-4 


7.6 


PRMP PI4A 1 


33.0 


Lung fibroblast JL-9 


19.6 


Ramos (B cell) none 


7.2 


Lung fibroblast IL-I 3 


7.9 


Ramos (B cell) ionomycin 


20.6 


Lung fibroblast I FN gamma 


7.6 


B lymphocytes PWM 


25.0 


Dermal fibroblast CCD1070 rest 


1 1 i 


B lymphocytes CD40L and 
IL-4 


74.2 


Dermal fibroblast CCD 1070 TNF 
alpha 


40.1 


EOL-1 dbcAMP 


2.5 


Dermal fibroblast CCD1070 IL-1 
beta 


1 A 


EOL-1 dbcAMP 
PMA/ionomycin | 


] — — — , 

? 

1 .6 jDermal fibroblast I FN gamma 


4.9 I 


Dendritic cells none 


23.8 jDermal fibroblast IL-4 


15.4 


Dendritic cells LPS |23.2 jDermal Fibroblasts rest 


8.7 


Dendritic cells anti-CD40 


36.3 


Neutrophils TNFa+LPS 


2.9 


Monocytes rest |3 ] .6 


Neutrophils rest 


1 I.J 


Monocytes LPS 


16.4 


Colon i 


40.9 


Macrophages rest ; 


JO. I 


Lung 


37.6 


Macrophages LPS 


13.8 / 


rhymus < 


59.7 


HUVEC none j, 


l.l 1 


ICidney j { 


35.9 


HUVEC starved ~T 


5.8 





Table AAE. General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag3941, Run 
268035089 


Tissue Name 


Rel. Exp.(%) 
Ag3941,Run 
268035089 


Colon cancer 1 


7.9 


B ladder cancer NAT 2 


0.0 


Colon cancer NAT 1 


4.2 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


6.5 


Bladder cancer NAT 4 


2.0 


Colon cancer NAT 2 


7.9 


Adenocarcinoma of the prostate 1 


13.4 


Colon cancer 3 


8.0 


Adenocarcinoma of the prostate 2 


0.2 


Colon cancer NAT 3 


4.8 


Adenocarcinoma of the prostate 3 


7.4 


Colon malignant cancer 4 


9.3 


Adenocarcinoma of the prostate 4 


6.9 


Colon normal adjacent tissue 


2.7 


Prostate cancer NAT 5 


0.4 
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4 








Lung cancer 1 


7.5 


Adenocarcinoma of the prostate 6 


2.9 


Lung NAT 1 


1.2 


Adenocarcinoma of the prostate 7 


3.4 


Lung cancer 2 


21.5 


Adenocarcinoma of the prostate 8 


1.3 


Lung NAT 2 


1.1 


Adenocarcinoma of the prostate 9 


15.8 


Squamous cell carcinoma 3 


15.7 


Prostate cancer NAT 1 0 


1.1 


Lung NAT 3 


0.4 


Kidney cancer 1 


16.6 


metastatic melanoma 1 


7.9 


KidneyNAT 1 


4.2 


Melanoma 2 j0.2 


Kidney cancer 2 


100.0 


Melanoma 3 


0.8 


Kidney NAT 2 


3.2 


metastatic melanoma 4 


19.2 


Kidney cancer 3 


16.0 


metastatic melanoma 5 


23.5 


Kidney NAT 3 


4.0 


Bladder cancer 1 


0.5 


Kidney cancer 4 


14.6 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


18.3 


Bladder cancer 2 jl.9 







CNS_neurodegeneration_vLO Summary: Ag3941 The CG94692-01 gene 



appears to be slightly downregulated in the brains of Alzheimer's patients, when compared 
to expression in normal control brains. Therefore, up-regulation of this gene or its protein 
product, or treatment with specific agonists for this receptor may be of use in reversing the 
5 dementia/memory loss and neuronal death associated with this disease. 

General_screening_panel_vl.4 Summary: Ag3941 This gene is widely 
expressed in this panel, with highest expression in a breast cancer cell line (CT=28.6). 
Prominent levels of expression are also seen in samples derived from prostate cancer, 
ovarian cancer and melanoma. Thus, expression of this gene could be used to differentiate 
1 0 these samples from the rest of the samples on this panel and as a marker of these cancers. 
Furthermore, therapeutic modulation of the expression or function of this gene product 
may be usefiil in the treatment of breast, ovarian, prostate and melanoma cancers. Overall, 
the widespread expression of this gene suggests a role for this gene in cell survival and 
growth. 

15 Among tissues with metabolic function, this gene is expressed at moderate to low 

levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic function and that 
disregulated expression of this gene may contribute to neuroendocrine disorders or 

20 metabolic diseases, such as obesity and diabetes. 
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In addition, this gene is expressed at much higher levels in fetal lung (CT=30.5) 
when compared to expression in the adult counterpart (CT=35.2). Thus, expression of this 
gene may be used to differentiate between the fetal and adult source of this tissue. 

This gene is also expressed at moderate to low levels in the CNS, including the 
hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 4.1D Summary: Ag3941 Highest expression of the CG94692-01 gene is 
seen in primary resting Trl cells (CT=30.9). The transcript is expressed at higher levels in 
lymphocytes, which is consistent with the expression in the thymus and lymph node on 
Panel 1.4. Therefore, therapeutics designed with this sequence or the protein it encodes 
could be important in regulating T cell activation and be important for immune modulation 
and in treating T and B cell mediated diseases such as asthma, allergy, COPD. arthritis, 
psoriasis, lupus and IBD. 

General oncology screening panel_v_2.4 Summary: Ag3941 Highest 
expression of the CG94692-01 gene is seen in kidney cancer (CT=29.5). In addition, 
significant levels of expression are also seen in kidney cancers and lung cancers when 
compared to normal adjacent tissue. Thus, expression of this gene could be used to 
differentiate these samples from the rest of the samples on this panel and as a marker of 
these cancers. Furthermore, therapeutic modulation of the expression or function of this 
gene product may be useful in the treatment of kidney and lung cancers. 

AB. CG94724-01: Acylcarnitine Translocase 

Expression of gene CG94724-01 was assessed using the primer-probe set Ag4071, 
described in Table ABA. Results of the RTQ-PCR runs are shown in Table ABB. 

Table ABA . Probe Name Ag4071 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 1 -ctggtacattcaccacaggc-3 1 


20 


363 


308 


Probe 


TET-5 ' -cctggagaacccatcaagtccttgtt-3 ■ -TAMRA 


26 


392 


309 


Reverse 


5 ? -acttggtttcccctgaagaa-3 1 


20 


430 


310 



Table ABB . Panel 4.1D 
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Tissue Name 


Rel. Exp.(%) | 

Ag4071 , Run Tissue Name 

1718078?** 

1 / I O \J l OZ.J 1 


Rel. Exp.(%) 
Ag4071, Run 
171807825 


Secondary Thi act 


0.0 |HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 jHUVEC I FN gamma 


0.0 


Secondary Tr I act 


00 HUVECTNF alpha + IFN 
(gamma 


0.0 


Secondary Th I rest 


0 0 |HUVEC TNF alpha + IL4 


0.0 | 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 ; 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 """ "~j 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha -HL-lbeta 


ft ft 

U.U 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


°J 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha -HL-lbeta 


0.0 | 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest jo.O J 

i 


CD45RO CD4 
lymphocyte act 


ft ft 

u.u 


Coronery artery SMC TNFalpha 
-HL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


3.4 


Astrocytes TNFalpha + IL- 1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 




CD4 lymphocyte none 


4.9 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


< 


try Inl/Th2/Trl anti- 
:D95CH11 1 


D.O 


CCD 1 1 06 (Keratinocytes) none l 


0.0 


] 


^AK cells rest ( 


).0 


CCD 1 1 06 (Keratinocytes) 
TNFalpha + IL-1 beta 


3.0 




.AK cells IL-2 ( 


).0 


Liver cirrhosis ( 


).0 


I 


_AK cells IL-2+IL-1 2 ( 


).0 


NCI-H292none \( 


).0 


I 

£ 


.AK cells IL-2+IFN 
jamma ' 


0.7 1 


MCI-H292IL-4 ( 


).0 




.AK cells IL-2+IL- 18 C 


10 1 


MCI-H292IL-9 ( 


>.o 


I 
F 


.AK cells 

'MA/ionomycin ^ 


K0 I 


MCI-H292 IL-13 C 


1.0 
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NK Cells IL-2 rest 


0.0 


NCI-H292 IFN gamma 


0,0 


Two Way MLR 3 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-1 beta 


0.0 


Two Way MLR 7 day 


4.2 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-1 
beta 




PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L jO.O 


Lung fibroblast 1L-9 


0.0 


Ramos (B cell) none ;0.0 


Lung Fibroblast 1L-13 


7.9 


Kamos (t$ ceil) 
ionomycin 


0.0 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast CCD 1 070 TNF 
alpha 


5.1 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 

Dendritic cells none 


0.0 

o'.o 


Dermal fibroblast IFN gamma 
Dermal fibroblast IL-4 


0.0 

_j 

"o.o : " 


'Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest jO.O 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS |o.O 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


9.5 


Macrophages LPS 


0.0 


Thymus 


6.9 


HUVEC none . 


0.0 


Kidney 


100.0 


HUVEC starved 


3.7 . 







CNS_neurodegeneration_vl.O Summary: Ag4071 Expression of the CG94724- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

5 GeneraI_screening_paneI_vL4 Summary: Ag4071 Expression of the CG94724- 

01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1D Summary: Ag4071 Expression of the CG94724-01 gene is 
exclusively seen in kidney (CT=32.9). Therefore, expression of this gene can be used to 
10 distinguish kidney sample from other samples in this panel. In addition, therapeutic 

modulation of this gene through the use of small molecule target may be beneficial in the 
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treatment of autoimmune and inflammatory diseases that affect kidney, including lupus 
and glomerulonephritis. 

The CG94724-01 gene codes for carnitine/acylcarnitine translocase (CACT) 
homologue. Carnitine-acylcarnitine translocase is 1 of 10 closely related mitochondrial - 
5 membrane carrier proteins that shuttle substrates between cytosol and the 

intramitochondrial matrix space. Deficiency in CACT causes defect in the co-transport of 
free and esterified carnitine across the inner mitochondrial membrane. Recently, Choong 
et al (2001, Pediatr Dev Pathol 4(6):573-9, PMID: 1 1826365) reported a case of lethal 
cardiac tachyarrhythmia in a newborn who died at 72 h of age from severe, intractable 

10 cardiac tachyarrhythmia, despite an improvement in his neurological and biochemical 

status caused due to CACT deficiency. Postmortem examination showed marked steatosis 
of myocardium, liver, and kidney. Thus, the CG94724-01 gene may also play a role in the 
pathology of disorders associated with CACT deficiency and therapeutic modulation of 
this gene product could be useful in the treatment of these disorders including lethal 

15 cardiac tachyarrhythmia , 

Panel 5 Islet Summary: Ag4071 Expression of the CG94724-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel _v_2.4 Summary: Ag4071 Expression of the 
CG94724-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 

20 (data not shown). 

AC CG94946-01 and CG94946-02 and CG94946-04 and CG94946-05 and CG94946- 
06 and CG94946-07: Agrin Precursor 

Expression of gene CG94946-01 and variants CG94946-02, CG94946-04, 
25 CG94946-05, CG94946-06 and CG94946-07 was assessed using the primer-probe sets 
Ag3605 and Ag3974 ? described in Tables ACA and ACB. Results of the RTQ-PCR runs 
are shown in Tables ACC, ACD, ACE, ACF, ACG and ACH. Please note that variants 
CG94946-02, CG94946-04 and CG94946-07 correspond to the probe and primer set 
. Ag3974 only. 

30 Table ACA . Probe Name Ag3605 



Primers 


Sequences 


jtength 


Start 
Position 


SEQID 
No 


Forward 


5 * -gaccccaagtcagaactgttc-3 1 


i * 


3514 


311 
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Probe |tet-5 • -attgagagcaccctggacgacctctt-3 ' -TAMRA 


26 


3553 


312 


Reverse |s ' -gaaatccttcttgacgtctgaa-3 ' 


22 


3585 


313 



Table ACB . Probe Name Ag3974 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 1 -gacaccaggatcttctttgtga-3 • 


22 


379 


314 


{Probe 


TET-5 1 -catacctgtggccagcccacaag-3 ' -TAMRA 


23 


413 


315 


[Reverse 5 1 -gagttgagcatcagctcgtt-3 » 


20 


436 


316 



Table ACC CNSjneurodegeneration_vl.O 



Tissue Name 


Rel.' 
Exp.(%) 
Ag3605, 
Run 

210997601 


Rel. 

Exp.(%) 
Ag3974, 
tvun 

212348647 


Tissue Name 


Rel. 

Exp.(%) 
Ag3605, 
Kun 

210997601 


Rpl Fvn 

Ag3974, 
Run 

212348647 


AD 1 Hippo 


12.2 

i 


28.7 


Control (Path) 3 Temporal 
Ctx 


10.9 


15.4 


AD 2 Hippo !27.2 
AD 3 Hippo ]l5.3 


36.3 

TO 9 


Control (ratn) 4 I emporai 
Ctx 

au i uccipitai ut\ 


57.0 

ZD. 1 


46.0 


;AD 4 Hippo js4.4 


21.8 


AD 2 Occipital Ctx 
(M issing) 


0.0 


0.0 . 


|AD 5 Hippo 


100.0 


80.7 


AD 3 Occipital Ctx 


12.0 


19.5 


IAD 6 Hippo 


32.3 


42.6 


AD 4 Occipital Ctx 


24.7 


21.0 


Control 2 Hippo 


37.9 


42.3 


AD 5 Occipital Ctx 


44.4 


47.6 


Control 4 Hippo 


20.7 


34.9 


AD 6 Occipital Ctx 


13.5 


14.7 


Control (Path) 3 
Hippo 


7.5 


12.4 


Control 1 Occipital Ctx 


10.0 


204 


AD 1 Temporal 
Ctx 


28.7 


32.1 


Control 2 Occipital Ctx 


51.8 


55.1 


AD 2 Temporal 
Ctx 


31.9 


31.2 


Control 3 Occipital Ctx 


17.3 


22.5 


AD 3 Temporal 
Ctx 


17.4 


20.2 


Control 4 Occipital Ctx 


14.9 


22.2 


AD 4 Temporal 
Ctx 


31.2 


24.0 


Control (Path) 1 Occipital 
Ctx 


90.8 


71.2 


AD 5 Inf 
Temporal Ctx 


87.1 


100.0 


Control (Path) 2 Occipital 
Ctx 


19.8 


17.4 


AD 5 Sup 
Temporal Ctx 


51.4 


58.6 


Control (Path) 3 Occipital 
Ctx 


7.4 


15.4 


AD 6 Inf 
Temporal Ctx 


42;9 


40.6 


Control (Path) 4 Occipital 
Ctx 


44.8 


38.7 


AD 6 Sup 


51.1 


35.1 


Control 1 Parietal Ctx 


14.6 


18.2 
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1 emporai v^ix 












Control 1 

i emporai *^ia 


17.3 


19.8 


Control 2 Parietal Ctx 


53.6 


67.4 


Control 2 

nAi*fl 1 f TV 

l emporai v^ia 


50.3 


48.3 


Control 3 Parietal Ctx 


18.6 


21.2 


Control 3 
Temporal Ctx 


23.7 


17.8 


Control (Path) 1 Parietal 
Ctx 


76.3 


51.4 


Control 3 
Temporal Ctx 


20.3 


25.0 


Control (Path) 2 Parietal 
Ctx 


36.6 


32.3 


Control (Path) 1 
Temporal Ctx 


78.5 


63.7 


Control (Path) 3 Parietal 
Ctx 


12.0 


11.9 


Control (Path) 2 
Temporal Ctx 


50.3 


43.5 

! 

a ... 


Control (Path) 4 Parietal 
Ctx 


64.2 


58.6 



Table ACD . General jscreenin&j>ancl_v 1.4 



I 
1 

Tissue Name > 
1 


lei. jRel. 
Exp.(%) |Exp.(%) 
\g3605, IAg3974, ' 
Hun iRun 
213406184 '217508632 


I 
I 

rissue Name > 
1 


lei. 

£xp.(%) 1 
\g3605. / 
flun '' 
213406184 


lei. Exp.(%) 
\g3974, Run 
217508632 


\ 

Adipose j 




"Tl.5 


Renal ca. TK- 1 0 


19.3 j 


[6.4 


... | 

Melanoma* 
Hs688(A)T 


2.6 


3.2 

i 


Bladder 


8.4 9.0 

i 


Melanoma* 
Hs688(B).T 


4.6 


i 

i< 

i 


4.2 


Gastric ca. (liver 
met.)NCl-N87 


87.7 


80.7 


Melanoma*-M14 


6.7 f 


6.4 jGastric ca. KATO III 


17.8 


17.7 


Melanoma* LOXIMV1 


4.8 


4.0 


Colon ca. SW-948 


9.2 


7.8 


Melanoma* SK-MEL- 
5 


2.6 


4.2 


Colon ca. SW480 


25.0 


32.3 


Squamous cell 
carcinoma SCC-4 


9.0 


8.4 


Colon ca.* (SW480 
met) SW620 


5.0 


4.6 


Testis Pool 


1.3 


1.1 


jColon ca. HT29 


26.8 n 


30.6 


Prostate ca.* (bone 
met) PC-3 


21.0 


24.8 


jcolonca. HCT-116 


4.9 


5.8 


Prostate Pool 


0.9 


0.8 


jColon ca. CaCo-2 


1.1 3-6 


10.4 


Placenta 


0.9 


1.3 


JColon cancer tissue 


10.2 


10.0 


Uterus Pool 


0.4 


0.4 


jColonca.SW1116 


5.1 


3.6 


Ovarian ca. OVCAR-3 


77.4 


66.9 


JColon ca. Colo-205 


1.8 


1.5 


Ovarian ca. SK-OV-3 


42.0 


36.3 


jColon ca. SW-48 


1.2 


0.7 


Ovarian ca. OVCAR-4 


9.5 


12.7 


jColon Pool 


1.9 


1.3 


Ovarian ca. OVCAR-5 


39.2 


44.4 


jsmall Intestine Pool 


0.6 


1.0 


Ovarian ca. IGROV-1 


22.1 


27.7 


jStomach Pool 


1.5 


1.2 


Ovarian ca. OVCAR-8 


18.0 


14.9 


jBone Marrow Pool 


0.6 


0.5 
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Ovary 


1 ^ 


1 O 


retal Heart 


1 A 

1.4 


1.0 


Breast ra MPF-7 


7 7 


O H 

y. / 


Heart Pool 


0.7 


0.8 


Breast ca MDA-MB- 
231 


22.7 


31.2 


Lymph Node Pool 


2.1 


2.0 


Breast ca BT S4Q 


lJ.Z. 


1 U. 1 


retal skeletal Muscle 


0.9 


0.5 


iBreast ca. T47D 


100.0 


100.0 


Skeletal Muscle Pool 


0.4 


0.5 


jBreast ca. MDA-N 


4.8 


4.2 


Spleen Pool 


0.7 


0.7 


j Breast Pool 


1.6 


1.6 


Thymus Pool 


1.8 


2.2 


Trachea 


2.8 


|2.6 


CNS cancer 
(glio/astro) U87-MG 


5.9 


6.0 


Lung 


0.2 


0.1 


CNS cancer 
(glio/astro) U-118- 
MG 


11.7 


11.2 


Fetal Lung 


11.4 


8.3 


CNS cancer 
(neuro;met) SK-N-AS 


1.2 


0.9 


jLung ca. NCI-N417 


1.4 


0.7 


CNS cancer (astro) 
SF-539 


6.7 


5.0 


i 

Lungca. LX-1 

i 


10.5 


! 1 1.0 


1CNS cancer (astro) 
SNB-75 


22.8 


32.3 


Lung ca. NCI-H146 

i 


0.1 


0 I 


CNS cancer (glio) 
SNB-19 


1j. u 


20.2 


jLungca. SHP-77 

i 


1.1 


0.8 


CNS cancer (glio) 
SF-295 


35.1 


38.2 


Lung ca. A549 


15.6 


10.4 


Brain (Amygdala) 

rOOl 


1.7 


1.3 


Lung ca. NCI-H526 


5.4 


1.6 


Brain (cerebellum) 


1.4 


1.0 


Lungca. NCI-H23 


18.9 


20.6 


Brain (fetal) 


7.0 


2.8 


Lung ca. NCI-H460 


1 1.5 


9 3 


Brain (Hippocampus) : 
Pool 


1 .0 


u.y 


Lung ca. HOP-62 


23.7 


23.0 


Cerebral Cortex Pool 


1.9 


0.9 


Lungca. NCI-H522 


1.8 


2.3 


Brain (Substantia 
nigra) Pool 


2.8 


1.7 


Liver 


0.5 


0.6 


Brain (Thalamus) 
Pool 


2.7 


1.6 


Fetal Liver 


0.8 


1-4 


Brain (whole) 


3.4 


1.1 


Liver ca. HepG2 


15.5 


12.6 


Spinal Cord Pool 


1.8 


1.4 


Kidney Pool 


1.5 


2.5 


nui&iiai vJ i u.i i\J 


ft 0 


ft A 


Fetal Kidney 


5.8 


4.6 


Pituitary gland Pool j 


0.3 


0.2 


Renal ca. 786-0 


46.3 


39.5 


Salivary Gland 


1.1 


1.3 


Renal ca. A498 


13.8 


7.9 


Thyroid (female) 


3.3 


3.7 


Renal ca. ACHN 


14.3 


15.9 


Pancreatic ca. 
CAPAN2 


23.7 : 


27.7 
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Renal ca.UO-31 


41.5 


38.7 


Pancreas Pool 


3.0 


4.1 


Table ACE. Panel 2.2 



. |Rel. Exp.(%) 
Tissue Name |Ag3605, Run 

] 173764229 


Ti^ne Name 


Rel. Exp.(%) 

Aa^fiOS Run 

173764229 


Normal Colon j4.7 


rvIUIlVjr lYlalglll ^UlyU*t JtO ) 


i no n 


Colon cancer (OD06064) j6.3 


Kidney malignant cancer 


21.6 ! 

i 


i "■ 

Colon Margin (OD06064) j2.3 

i 


Kidney normal adjacent tissue 


~ - 1 

23.2 


Colon cancer (OD06 1 59) 


2.1 


Kidney Cancer (OD04450-01) 


J J .z 


Colon Margin (OD06159) 


1.8 


Kidney Margin (OD04450-03) 


21.3 


Colon cancer (OD06297- 
04) 


2.0 


Kidney Cancer 8120613 


i 

1.1 


Colon Margin (OD06297- 
05) 


3.0 


Kidney Margin 8120614 


14.1 

... i 


CC Gr.2 ascend colon 
(OD03921) ; J * 6 


Kidney Cancer 9010320 


20.3 ! 


CC Margin (OD03921) 


jl.3 iKidney Margin 9010321 




Colon cancer metastasis 
(OD06104) 


i 

ll.l iKidney Cancer 8120607 

t 


71.7 


Lung Margin (OD06104) 


'1.0 jKidney Margin 8120608 




Colon mets to lung 
(OD04451-01) 


4.5 


Normal Uterus 


7.3 


Lung Margin (OD04451- 
02) 


6.7 


Uterine Cancer 064011 


6.9 


Normal Prostate 


2.1 


Normal Thyroid 




Prostate Cancer (OD044 1 0) 


3.9 


Thyroid Cancer 064010 


27 0 


Prostate Margin 
(OD04410) 


3.4 


Thyroid Cancer A302152 


19.1 


Normal Ovary 


7.6 


Thyroid Margin A302153 


8 1 

O. 1 


Ovarian cancer (OD06283- 
03) 


27.9 


Normal Breast 


14.0 " 


Ovarian Margin 
(OD06283-07) 


2.0 


Breast Cancer (OD04566) 


13.4 


Ovarian Cancer 064008 


16.0 


Breast Cancer 1024 


35.1 


Ovarian cancer (OD06145) 


10.1 


Breast Cancer (OD04590-01) 


31.6 


Ovarian Margin 
(OD06145) 


8.2 


Breast Cancer Mets (OD04590-03) 


8.7 


Ovarian cancer (OD06455- 
03) 


28.9 


Breast Cancer Metastasis (OD04655- 
05) 


13.3 


Ovarian Margin 


1.9 


Breast Cancer 064006 


21.5 
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(OD06455-07) 








Normal Lung 


2.9 


Breast Cancer 91 00266 


17.7 


Invasive poor diff. lung 
adeno(ODO4945-01 


9 4 


oreasi iviargin viuuzoj 


1 c c 

16.5 


Lung Margin (OD04945- 
03) 


7 9 


oicabi i_/diicer /\zii!7U /j 


IJ.Z 


Lung Malignant Cancer 
(OD03 12M 


7.5 


Breast Margin A2090734 


35.4 


1 lino Mnrcrin fOnfni?^ 




breast cancer (OD06083) 


— 

24.5 


Lung Cancer (OD050 1 4A) 


17.0 


Breast cancer node metastasis 
(UDUoOoj) 


21.5 


Lung Margin (OD05014B) 


11.7 


Normal Liver 


5.0 


Lung cancer (OD0608 1 ) 


12.2 


Liver Cancer 1026 


15.5 


Lung Margin (OD06081) 


2.4 


Liver Cancer 1025 


12.2 


Luna: Cancer (OD04237- 
01) 


1.8 


Liver Cancer 6004-T 


7.8 


Lung Margin (OD04237- 
02) 


16.2 


Liver Tissue 6004-N 


6.1 


Ocular Melanoma 
Metastasis 


8.4 jLiver Cancer 6005-T 

! 


25.0 


Ocular Melanoma Margin 
(Liver) 


i 

2.9 ILiver Tissue 6005-N 

! 


12.4 


Melanoma Metastasis 4.0 ILiver Cancer 064003 


12.9 


Melanoma Margin (Lung) 


3.8 


Normal Bladder 


14.2 


Normal Kidney 


10.9 


Bladder Cancer 1023 


9.5 


Kidney Ca, Nuclear grade 2 
(OD04338) 


35.4 


Bladder Cancer A302173 


12.2 


Kidney Margin (OD04338) 


20.4 


Normal Stomach 


8.7 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


52.9 


Gastric Cancer 9060397 


8.9 


Kidney Margin (OD04339) 


16.6 


Stomach Margin 9060396 


7.4 


Kidney Ca, Clear cell type 
(OD04340) 


16.6 


Gastric Cancer 9060395 


7.0 


Kidney Margin (OD04340) 


7.4 


Stomach Margin 9060394 


7.5 


Kidney Ca, Nuclear grade 3 
(OD04348) 


11.2 


Gastric Cancer 064005 


6.9 



Table ACF. Panel 4.1D 





Rel. 


Rel. 




Rel. 


Rel. 


Tissue Name 


Exp.(%) 


Exp.(%) 




Exp.(%) 


Exp.(%) 


Ag3605, 


Ag3974, 


Tissue Name 


Ag3605, 


Ag3974, 




Run 


Run 




Run 


Run 




169943454 


170739806 




169943454 


170739806 
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Secondary Thl act 


1.0 


1.2 


HUVEC IL-lbeta 


15.6 


18.9 


Secondary Th2 act 


5.1 


8.0 


HUVEC IFN gamma 


12.9 


16.7 


Spp Ann on; Ti*l 

ocwonaary i r 1 act 




3.5 


HUVEC TNF alphas 
IFN gamma 


37.6 


34.9 


octuiiudiy 1 11 1 rcbi 


u.u 


0.7. 


HUVEC TNF alpha + 
IL4 


31.4 


31.4 


Secondary Th2 rest 


0.4 


0.2 (HUVEC IL-ll 


14.9 


13.9 

. I 


Secondary Trl rest 


|0.4 


1 2 iLung Microvascular EC 
•none 


< 

79.0 


! 

100.0 


Primary Thl act 


i 

3.6 


3.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


100.0 


97.9 


Primary Th2 act 


1.1 


2.0 


Microvascular Dermal 
EC none 


49.7 


48.3 


Primary Trl act 


3.4 


2.9 


Micros vasular Dermal 
EC TNFalpha + IL-lbeta 


56.6 


47.0 


Primary Thl rest 


0.9 


0.4 


Bronchial epithelium 
TNFalpha+ ILlbeta 


78.5 


90.1 


jPrimarv Th2 rest 

! 


0.5 


0.2 


Small airway epithelium 
none 


31.0 


32.5 


i ! 
jPrimary Trl rest |0.2 

I 1 


0 ^ ISmall airway epithelium 
' jTNFalpha+IL-lbeta 


81.8 


93.3 
28.5 


CD45RA CD4 
lymphocyte act 


43.5 


22.7 


Coronery artery SMC 
rest 


17.2 


CD45RO CD4 
lymphocyte act 


5.0 


5.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


22.2 


28.7 


CD8 lymphocyte act 


3.9 


3.3 


Astrocytes rest 


80.1 


55.1 


Secondary CDS 
lymphocyte rest 


3.7 


3.3 


Astrocytes TNFalpha + 
IL-lbeta 


82.9 


66.4 


Secondary CD8 
lymphocyte act 


3.3 


3.5 


KU-812 (Basophil) rest 


2.6 


1.9 


CD4 lymphocyte 
none 


0.3 


0.1 


KU-812 (Basophil) 
PMA/ionomycin 


0.6 


2.8 


2ry 

Thl/Th2/Trl anti- 
CD95CH11 


0.3 


0.4 


CCD1106 

(Keratinocytes) none 


70.7 


82.4 


LAK cells rest 


5.0 


6.4 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


77.4 


72.7 


LAK cells IL-2 : 


2.8 


1.7 


Liver cirrhosis 


13.2 


14.4 


LAK cells IL-2+IL- 
12 


1.4 


1.8 


NCI-H292 none 


57.8 


54.0 


LAK cells IL-2+IFN . 
gamma 


2.3 




NCI-H292 IL-4 i 


52.4 


78.5 
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LAK ceils IL-2+ IL- 
18 



LAK cells 
PMA/ionomycin 

NK Cells IL-2 rest 



2.9 



6.8 



1.6 



4.6 



NCI-H292 IL-13 



NCI-H292IL-9 



61.6 



53.6 



79.6 



59.9 



1.6 



1.9 



NCI-H292 I FN gamma 



67.4 



71.7 



Two Way MLR 3 day 1 10.7 



12.4 



HPAEC none 



21.5 



Two Way MLR 5 day 



Two Way MLR 7 day 



PBMC rest 



PBMC PWM 



PBMC PHA-L 



Ramos (B cell) none 



6.5 



4.3 
0.0 



5.0 



5.5 



0.4 



21.3 



5.3 
4~0~ 



HPAEC TNF alpha + 1L- 
1 beta 



Lung fibroblast none 



0.6 
4.9 



Lung fibroblast TNF 
alpha + 1L- 1 beta 



Lung fibroblast 1L-4 



3.4 



0.4 



Lung fibroblast ILr9 



Lung fibroblast IL-K 



37.6 
22A 
71.2 



45.4 
]29.3" 
87.7 



16.2 



31.9 



30.4 



18.7 



36.6 



Ramos (B cell) 
ionomycin 



B lymphocytes PWM 



B lymphocytes 
CD40L and IL-4 



EOL-I dbcAMP 



0.2 



2.4 



0.2 



Lung fibroblast I FN 
gamma 



23.0 



29.7 



i.O 



Dermal fibroblast 
CCD 1070 rest 



15.9 



27.2 



1.5 



3.4 



EOL-1 dbcAMP 
PMA/ionomycin 



Dendritic cells none 



Dendritic cells LPS 



Dendritic cells anti- 
CD40 



3.1 



18.3 



14:8 



48.3 



1.7 



Dermal fibroblast 
CCD 1070 TNF alpha 



115.9 



20.6 



8.0 



9.0 



32.8 



9.7 



Monocytes rest 



Monocytes LPS 



Macrophages rest 



Macrophages LPS 



HUVEC none 



HUVEC starved 



0.9 



8.8 



Dermal fibroblast 
CCD 1070 1L-1 beta 



jl7.2 



'22.4 



Dermal fibroblast I FN 



Dermal fibroblast IL-4 



Dermal Fibroblasts rest 



Neutrophils TNFa+LPS 



1.4 



66.4 



162 



54.7 



9.3 



14.4 



81.2 



9.7 



43.8 



12.6 



25.2 



7.2 



7.2 



4.3 



0.0 



Neutrophils rest 



Colon 



Lung 



Thymus 



Kidney 



Table ACG . Panel CNS_1 



0.2 



10.3 



8.0 



6.3 



0.9 



7.0 



23.3 



5.8 



23.2 



1.0 
5.8 



23.3 



7.3 



33.2 



Tissue Name 


Rel. Exp.(%) 
Ag3605, Run 
171648697 


Tissue Name 


Rel. Exp.(%) 
Ag3605, Run 
171648697 


BA4 Control 


23.8 


BA17PSP 


21.9 


BA4 Control2 


44.4 


BA17PSP2 


14.4 


BA4 Alzheimer*s2 


7.5 


Sub Nigra Control 


37.9 
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BA4 Parkinson's 


66.0 


Sub Nigra Control2 


31.0 


BA4 Parkinson's2 


80.7 


Sub Nigra Alzheimer's2 


26.4 


BA4 Huntington's 


23.5 


Sub Nigra Parkinson's2 


80 1 


BA4 Huntington's2 


49.3 


Sub Nigra Huntington's 


76.3 


BA4 PSP 


19.5 


Sub Nigra Huntington's2 


29.7 


BA4 PSP2 


34.9 


Sub Nigra PSP2 


11. 1 


BA4 Depression 


20.6 


Sub Nigra Depression 


34.4 


BA4 Depression2 


21.3 


Sub Nigra Depression2 


18.8 j 


BA7 Control 


53.2 jdob Palladus Control 


40.3 


BA7 Control 


47.6 


Glob Palladus ControI2 J35.8 j 


BA7 Alzheimer's2 


13.6 


Glob Palladus Alzheimer's 


20.0 j 


BA7 Parkinson's 


39.8 


Glob Palladus Alzheimer's2 


21.8 ' 


BA7 Parkinson's2 


60.7 


Glob Palladus Parkinson's 


— 1 

100.0 j 


BA7 Huntington's 


41.8 


Glob Palladus Parkinson's2 


25.0 


BA7 Huntington's2 


62.9 


Glob Palladus PSP 


17.9 


BA7PSP 


36.1 


Glob Palladus PSP2 


. j 

7.2 I 


BA7 PSP2 [25.0 


Glob Palladus Depression 


!:>.:> 


BA7 Depression 120.0 


Temp Pole Control 


15.j 


BA9 Control (36.6 


Temp Pole Control2 


/o.a 


BA9 Contro!2 -83.5 


Temp Pole Alzheimer's 


1 /I "5 \ 


BA9 Alzheimer's 


17.1 


Temp Pole Alzheimer's2 


14.7 


BA9 Alzheimer*s2 


34.4 


Temp Pole Parkinson's 


76.3 


BA9 Parkinson's 


70.7 


Temp Pole Parkinson's2 


77 4 


BA9 Parkinson's2 


74.2 


Temp Pole Huntington's 


39.8 


BA9 Huntington's 


55.5 


Temp Pole PSP 


7.4 


BA9 Huntington's2 


45.1 Temp Pole PSP2 


8.1 


BA9 PSP 


28.7 


Temp Pole Depression2 


31.6 


BA9 PSP2 


8.2 jCing Gyr Control 


82.4 


BA9 Depression 


18 0 |Cing Gyr Control2 


82.4 


BA9 Depression2 


0.0 JCing Gyr Alzheimer's 


27.4 


BA 17 Control 


74.7 jCing Gyr Alzheimer's2 


36.3 


BA17Control2 


86.5 jCing Gyr Parkinson's 


46.3 


BA17AIzheimer's2 


20.3 jCing Gyr Parkinson's2 


42.6 


BA 17 Parkinson's 


75 .3 jCing Gyr Huntington's 


70.7 


BA17Parkinson's2 


85.3 jCing Gyr Huntington's2 


37.6 


BA17 Huntington's 


47.0 CingGyrPSP 


21.6 


BA17Huntington's2 ; 


26.6 ]CingGyrPSP2 


13.9 


BA 17 Depression ; 


24.8 |Cing Gyr Depression : 


21.3 
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BA17 Depression2 



41.2 



Cing Gyr Depression2 



Table ACH , General oncology screening panel_v_2.4 



Tissue Name 


Rel: Exp.(%) 
Ag3605, Run 
260268655 


Tissue Name 


Rel. Exp.(%) 
Ag3605, Run 
260268655 


Colon cancer 1 


11.6 


Bladder cancer NAT 2 


0.2 


Colon cancer NAT ] 


5.3 ^Bladder cancer NAT 3 


1.7 


Colon cancer 2 


18.6 


Bladder cancer NAT 4 


1.8 


Colon cancer NAT 2 


3.2 


Adenocarcinoma of the prostate 1 


2.4 


Colon cancer 3 


22.2 


Adenocarcinoma of the prostate 2 


2.6 


Colon cancer NAT 3 


7.7 


Adenocarcinoma of the prostate 3 


5.2 


Colon malignant 
cancer 4 


23.5 


Adenocarcinoma of the prostate 4 


21.9 


Colon normal adjacent 
tissue 4 


2.0 


Prostate cancer NAT 5 


5.4 


Lung cancer 1 


57.0 [Adenocarcinoma of the prostate 6 


4.4 


Lung NAT 1 


31 lAdcnocarcinoma of the prostate 7 


3.8 


Lung cancer 2 j 1 00.0 j Adenocarcinoma of the prostate 8 


1.8 j 


Lung NAT 2 


3.9 jAdcnocarcinoma of the prostate 9 


10.7 ! 


Squamous cell 
carcinoma 3 


41.2 


Prostate cancer NAT 10 


.... , 1 

0.7 


Lung NAT 3 


1.7 


Kidney cancer 1 


no T 

j8.7 


metastatic melanoma 
1 


5.4 


KidneyNAT 1 


20.6 


Melanoma 2 


3.2 


Kidney cancer 2 


66.4 


Melanoma 3 


3.7 


Kidney NAT 2 


39.8 


metastatic melanoma 
4 


12.5 


— — — 1 111 !1T 

Kidney cancer 3 


64.6 


metastatic melanoma 
5 


7.9 


Kidney NAT 3 


19.6 


Bladder cancer 1 


0.5 


Kidney cancer 4 


16.4 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


15.6 


Bladder cancer 2 


6.0 







CNS_neurodegeneration_vl.O Summary: Ag3605/Ag3974 Two experiments 
with two different probe and primer sets confirm the expression of this gene at moderate 
levels in the brains of an independent group of individuals. However, no differential 
expression of this gene was detected between Alzheimer's diseased postmortem brains and 
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those of non-demented controls in this experiment. Please see Panel 1.4 for a discussion of 
the potential utility of this gene in treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3605/Ag3974 Two experiments 
with the same probe and primer set produce results that are in excellent agreement. The 
5 expression of the CG94946-01 gene appears to be highest in a sample derived from a 

breast cancer cell line (T47D) (CTs=22.5-25.3). In addition, there appears to be substantial 
expression in other samples derived from breast cancer cell lines, ovarian cancer cell lines, 
kidney cancer cell lines, lung cancer cell lines, colon cancer cell lines and brain cancer cell 
lines. Thus, the expression of this gene could be used to distinguish T47D cells from other 

10 samples in the panel. Moreover, therapeutic modulation of this gene, through the use of 
small molecule drugs, protein therapeutics, or antibodies could be of benefit in the 
treatment of breast, ovarian, kidney, lung, colon or brain cancer. 

Among metabolic tissues, this gene has low-to-moderate levels of expression in 
adrenal, pituitary, adult and fetal heart, adult and fetal liver, adult and fetal skeletal 

15 muscle, and adipose. This gene product has high levels of expression (CT values = 27) in 
pancreas and thyroid. Thus, this gene product may be important for the pathogenesis, 
diagnosis, and/or treatment of metabolic and endocrine diseases, including obesity. Types 
1 and 2 diabetes and thyroidopathies. In support of this hypothesis, decreased glomerular 
expression of agrin has been observed in diabetic nephropathy (Yard BA, Exp Nephrol 

20 2001;9(3):214-22). 

In addition, this gene is expressed at moderate levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. The CG94946-01 gene encodes a protein 
with homology to agrin, a neuronal aggregating factor that induces the aggregation of 

25 acetylcholine receptors and other postsynaptic proteins on muscle fibers and is crucial for 
the formation of the neuromuscular junction. Agrin also plays an important role in 
defining neuronal responses to excitatory neurotransmitters both in vitro and in vivo 
(Hilgenberg LG,Mol Cell Neurosci 2002 Jan;19(l):97-1 10 and Bixby JL, J Neurobiol 
2002 Feb 5;50(2): 164-79). The CG5984M)1 gene expression in the central nervous 

30 system is consistent with the hypothesis that this protein may have similar functions as 
agrin. Therefore, this gene may play a role in central nervous system disorders such as 
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Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

Agrin has also been implicated in the formation of senile plaques in Alzheimer's 
disease and in the acetylcholine synapse/neuromuscular junction (van Horssen J, Acta 
Neuropathol (Berl) 2001 Dec;102(6):604-14). In addition, an agrin minigene rescued 
dystrophic symptoms in a mouse model of muscular dystrophy (Moll J, Nature. 2001 Sep 
20;413(6853):302-7). Therefore, this gene product may be used as a treatment or cure for 
congenital muscular dystrophies. Furthermore, this gene product is also an excellent drug 
target in AD or in any disease involving the neuromuscular junction or the acetylcholine 
system. 

Panel 2.2 Summary: Ag3605 Expression of the CG94946-01 gene is highest in a 
sample of normal kidney (CT = 27.4). In addition, expression of this gene appears to be 
upregulated in a number of ovarian and renal cancers when compared to the matched 
control margins. Thus, expression of this gene could be used as a marker for ovarian and 
renal carcinoma. Furthermore, therapeutic modulation of the activity of this gene or its 
protein product, using small molecule drugs, antibodies or protein therapeutics, could be 
of benefit in the treatment of renal and ovarian cancer. 

Panel 4.1D Summary: Ag3605/Ag3974 Two experiments with two different 
probe and primer sets produce results that are in very good agreement. Highest expression 
of the CG94946-01 gene is highest in lung microvascular endothelial cells (CTs=27.3- 
28.5), microvascular dermal endothelial cells, mucoepidermoid cell line NCI-H292, 
astrocytes, and keratinocytes. This gene encodes a protein with homology to agrin. 
Recently, it has been demonstrated that agrin, an aggregating protein crucial for formation 
of the neuromuscular junction, is also important for T cell signaling in the immune system 
(Khan AA, Science 2001 Jun l;292(5522):1681-6). In additin, agrin has been identified as 
a potential disease target for autoimmune disorders at the neuromuscular junction, 
including multiple sclerosis (Liyanage Y, Muscle Nerve 2002 Jan;25(l):4-16)Therefore, 
small molecule drug, antibody or protein therapeutics designed against the protein 
encoded by the CG59841-01 gene could reduce or inhibit inflammation in asthma, 
emphysema, allergy, psoriasis, muscular dystrophy and multiple sclerosis. 

Panel CNS_1 Summary: Ag3605 This panel confirms the expression of the 
CG94946-01 gene at low levels in the brains of an independent group of individuals. 
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Please see Panel L4 for a discussion of the potential utility of this gene in treatment of 
central nervous system disorders 

General oncology screening panel_v_2.4 Summary: Ag3605 Highest 
expression of the CG94946-01 gene is seen in lung cancer (CT=26.8). In addition, higher 
5 levels of expression are seen in lung and kidney cancers when compared to expression in 
normal adjacent tissue. Thus, expression of this gene could be used as a marker of lung 
and kidney cancers. Furthermore, therapeutic modulation of the expression or function of 
this gene product may be useful in the treatment of lung and kidney cancers. 

1 0 AD. CG95165-01: Adenylate Cyclase, Type II 

Expression of gene CG95 165-01 was assessed using the primer-probe set Ag3991 , 
described in Table ADA. Results of the RTQ-PCR runs are shown in Tables ADB, ADC 
and ADD. 

Table ADA , Probe Name Ag3991 



1 

jPrimers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


j Forward 


5 1 -aacgtggcatcctgaagagt-3 ' 


20 


3261 


317 


■Probe 


TET-5 ' -caccttcattttggcaagaagactgt-3 ' -TAMRA 


26 


3281 


318 


(Reverse 


5 ' -gttgcagtcagaaagtgtgtga- 3 • 


22 


3322 


319 



15 Table ADB. CNS neurodegeneration yl.O 



Tissue Name 


Rel. Exp.(%) 
Ag399l, Run 
212349022 


Tissue Name 


Rel. Exp.(%) 
Ag3991, Run 
212349022 


AD 1 Hippo 


27.9 


Control (Path) 3 Temporal Ctx 


15.9 


AD 2 Hippo 


46.0 


Control (Path) 4 Temporal Ctx 


35.4 


AD 3 Hippo 


14.9 


AD 1 Occipital Ctx 


26.2 


AD 4 Hippo 


13.8 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


61.6 


AD 3 Occipital Ctx 


14.9 


AD 6 Hippo 


97.9 


AD 4 Occipital Ctx 


31.2 


Control 2 Hippo . 


43.8 


AD 5 Occipital Ctx 


64.6 


Control 4 Hippo 


26.1 


AD 6 Occipital Ctx 


26.4 


Control (Path) 3 Hippo 


15.9 


Control 1 Occipital Ctx 


9.6 


AD 1 Temporal Ctx 


39.0 


Control 2 Occipital Ctx 


71.2 


AD 2 Temporal Ctx 


55.9 


Control 3 Occipital Ctx 


21.9 


AD 3 Temporal Ctx 


15.0 


Control 4 Occipital Ctx 


16.5 


AD 4 Temporal Ctx 


33.2 


Control (Path) 1 Occipital Ctx 


81.2 
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AD 5 Inf Temporal Ctx 1 


00.0 c 


:ontrol (Path) 2 Occipital Ctx 9 


.3 


AD 5 Sup Temporal 
Ctx 


50.7 C 


:ontrol (Path) 3 Occipital Ctx 7 


A 


AD 6 Inf Temporal Ctx : 


W.7 C 


:ontrol (Path) 4 Occipital Ctx 9 


.6 


AD 6 Sup Temporal ( 
Ctx 


XL1 C 


:ontrol 1 Parietal Ctx 1 


5.0 


IControl 1 Temporal Ctx 


15.1 [Control 2 Parietal Ctx 5 


6.3 


'Control 2 Temporal Ctx < 


54.2 Control 3 Parietal Ctx 2 


3.2 


-Control 3 Temporal Ctx \ 


25.0 iControl (Path) 1 Parietal Ctx S 


)9.3 


Control 3 Temporal Ctx 


] 5.5 -Control (Path) 2 Parietal Ctx 2 


»8.3 


Control (Path) I 
Temporal Ctx 


65.1 < 


Control (Path) 3 Parietal Ctx 1 


1.6 


Control (Path) 2 
Temporal Ctx 


43.5 jControl (Path) 4 Parietal Ctx ' 


14.4 


Table ADC- Panel 4. ID 


\ 

Tissue Name 

• 

1 

i 


Rel. Exp.(%) | 
Ag3991. Run 
170739808 | 

i 


Tissue Name 


Rel. 

Exp.(%) 
Ag3991, 
Run 

170739808 


Secondary Th 1 act 


0.0 HUVEC IL-lbeta 


0.0 1 


|SecondaryTh2 act 


0.0 .HUVEC I FN gamma 


0.0 


jSecondary Trl act 


1 .5 |HU VEC TNF alpha + I FN gamma 


4.5 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC L q 
TNFalpha + IL-lbeta j ' 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium TNFalphi 
+ IL-lbeta 


l 0.0 


CD45RACD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 Astrocytes rest 


37.4 
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Secondary CD8 
Ivmohocvte rest 


0.0 


Astrocytes TNFalpha + IL-1 beta 


15.8 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) , 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) none 

t 


0.0 


i-jjTi. IV V^CIlb I Col 


0 0 (CCD 1 1 06 (Keratinocytes) 
!TNFalpha + lL-lbeta 


n (\ 


LAK cells IL-2 


0.0 [Liver cirrhosis 


2.8 


LAK cells IL-2+IL- 12 


0.0 jNCi-H292 none 


1.8 


LAK cells IL-2+IFN 
gamma 


0.0 ' !nCI-H292 IL-4 

1 i 


0.0 


LAK cells 1L-2+ IL-18 


0.0 


NCI-H292 IL-9 


0.0 


| 00 |nCI-H29 2I L.,3 


0.0 


NK Cells IL-2 rest k).0 {NCI-H292 IFN gamma 


0.0 


Two Way MLR 3 day 10.0 |HPAEC none 


0.0 ~ ~ ! 

i 


Two Way MLR 5 day ;0.0 |HPAEC TNF alpha + IL-1 beta 


0.0 i 


Two Way MLR 7 day .0.0 !Lung fibroblast none 


0.0 


PBMC rest 


0 q jLung fibroblast TNF alpha + IL- 1 
jbeta 


i 

u.u 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


2.8 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1 070 rest 


o n 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast CCD 1 070 TNF 
alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1 070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


o.2 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


42.3 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


39.5 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


2.0 


Monocytes LPS 


0.0 


Colon 


17.7 
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Macrophages rest 


2.4 


Lung 


8.8 


Macrophages LPS 


0.0 


Thymus 


15.2 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 






Table ADD. General oncology screening panel v 2.4 


1 

Rel. Exp.(%) 
Tissue Name j Ag399 1 , Run 
•268168218 


Tissue Name 


Rel. 

Exp.(%) 
Ag3991, 
Run 

268168218 


Colon cancer 1 


0.2 


Bladder cancer NAT 2 


0.1 


Colon cancer NAT 1 


1.1 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 |0.3 


Bladder cancer NAT 4 


0.9 


Colon cancer NAT 2 !o.5 


Adenocarcinoma of the prostate 1 


7.0 


Colon cancer 3 0.6 


Adenocarcinoma of the prostate 2 


0.9 


Colon cancer NAT 3 ; 1 .4 


Adenocarcinoma of the prostate 3 


4.8 


Colon malignant cancer » j 
4 


Adenocarcinoma of the prostate 4 


0.8 


Colon normal adjacent . Art ' . . M1XC 
■ . „ J 0.0 i Prostate cancer NAT 5 
tissue 4 • 


2.0 


Lung cancer 1 ,0.1 


Adenocarcinoma of the prostate 6 J 1 .0 


Lung NAT 1 


0.1 


Adenocarcinoma of the prostate 7 j 1 .4 


Lung cancer 2 


0.2 


Adenocarcinoma of the prostate 8 


0.9 


Lung NAT 2 


0.1 


Adenocarcinoma of the prostate 9 


8.3 


Squamous cell 
carcinoma 3 


1.7 


Prostate cancer NAT 10 


0.5 


Lung NAT 3 


0.2 


Kidney cancer 1 


0.6 


metastatic melanoma 1 


3.8 


KidneyNAT 1 


0.4 


Melanoma 2 


2.3 


Kidney cancer 2 


100.0 


Melanoma 3 


2.0 


Kidney NAT 2 


0.0 


metastatic melanoma 4 


4.6 


Kidney cancer 3 


0.1 


metastatic melanoma 5 


1.6 


Kidney NAT 3 


0.1 


Bladder cancer 1 


0.3 


Kidney cancer 4 


0.6 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.4 


Bladder cancer 2 


1.2 







CNS_neurodegeneration_vl.O Summary: Ag3991 This panel confirms the 



expression of the CG95 165-01 gene at low levels in the brains of an independent group of 
individuals, but no differential expression of this gene was detected between Alzheimer's 
5 diseased postmortem brains and those of non-demented controls in this experiment. 
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However, expression in the brain suggests that this gene may play a role in central 
nervous system disorders such as Parkinson's disease, epilepsy, multiple sclerosis, 
schizophrenia and depression. 

The CG95 165-01 gene codes for a homologue of rat adenylyl cyclase. In 
5 Drosophila, mutations in adenylyl cyclase and other members of cAMP pathway have 
been shown to reduce the ability of fruit flies to learn or to remember (Waddell S and 
Quinn WG, 2001, Trends Genet 17(12):7 19-26, PMID: 1 1718926). In addition, 
derangement of second messenger, adenylyl cyclase, system closely parallels ischemic 
neuronal damage and persistent enhancement of this cAMP signaling pathway is important 
10 for neuronal survival in acute cerebral ischemia (Tanaka K, 2001, Prog Neurobiol 

65(2): 173-207, PMID: 1 1403878). Therefore, therapeutic modulation of adenylyl cyclase 
encoded by this gene may be useful in the treatment of learning disorders and acute 
cerebral ischemia. 

General_screening_panel_vl. 4 Summary: Ag3991 Results from one 

15 experiment with the CG95 165-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4.1D Summary: Ag3991 Highest expression of the CG95165-01 gene is 
detected in kidney (CT=32). In addition, moderate to low expression of this gene is seen in 
thymus, colon, dermal fibroblast and astrocytes. Thus, expression of this gene can be used 

20 to distinguish these samples from other samples used in this panel. In addition, therapeutic 
modulation of this gene may be beneficial in the treatment of autoimmune and 
inflammatory diseases that affect brain, colon and kidney, including inflammatory bowel 
diseases, lupus and glomerulonephritis. 

General oncology screening panei_v_2.4 Summary: Ag3991 Highest 

25 expression of the CG95 165-01 gene is detected in kidney cancer (CT=27.8). Interestingly, 
expression of this gene is higher in cancer sample compared to the adjacent control sample 
(CT=40). In addition, moderate expression of this gene is seen in prostate adenocarcinoma 
and melanoma. This expression is low/undetectable in the normal samples used in this 
panel. Therefore, therapeutic modulation of this gene product through the use of small 

30 molecule drug or antibodies could be beneficial in the treatment of melanoma, prostate 
and kidney cancers. 
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AE. CG95175-01: Ephrin Type-A Receptor 7 Precursor 

Expression of gene CG95 175-01 was assessed using the primer-probe sets Ag3992 
and Ag612, described in Tables AEA and AEB. Results of the RTQ-PCR runs are shown 
in Table AEC. 

Table AEA . Probe Name Ag3992 



Primers 


Sequences 


Length 


Start 
Position 


SEQIDNo 


Forward 


5 1 -accactatggtgaggctacaga-3 ' 


22 


2427 


320 


Probe 


TET-5 ' -ctatgggccgctcccgagacact-3 1 -TAMRA 


23 


2466 


321 


Reverse 


5 1 -agagctgaagtggccaaact-3 1 


20 


2491 


322 



Table AEB . Probe Name Ag612 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -gccgctcccgagacactt-3 1 


18 


2472 


323 


Probe 


TET-5 1 -ccacttcagctctgccagtgacgtg-3 ' -TAMRA 25 


2498 


324 


Reverse 


5 1 -cccacatgatgatgccgaa-3 • I 19 


2529 


325 



Table AEC. Panel 4D 



Tissue Name 


;ReI. Exp.(%) 
;Ag612, Run 
145645058 


Tissue Name 


Rel. Exp.(%) 
Ag6I2, Run 
145645058 


Secondary Th 1 act 


0.0 


HUVEClL-lbeta 


0.0 


Secondary Th2 act 


12.2 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha 
+ ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium I 
TNFalpha + IL-lbeta ] 


4.6 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest j 


0.0 
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CD45ROCD4 
lymphocyte act 


4.6 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.0 


CD8 lymphocyte act 


5.3 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


5.3 


Astrocytes TNFalpha + IL-lbeta 


0.0 


Secondary CDS 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 




0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


9.7 


CCD1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


1.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


61.6 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


5.1 


NCI-H292 none 


32.5 


LAK cells IL-2+IL- 18 


0.0 


NCI-H292 IL-4 


46.7 
58.6 


LAK cells j 
PMA/ionomycin j 


NCI-H292 JL-9 


NK Cells IL-2 rest 10.0 


NCI-H292 IL-I3 


89.5 


Two Way MLR 3 day lO.O 


NCI-H292 I FN gamma 


100.0 


Two Way MLR 5 day jO.O 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + 1L-1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha + IL- 
1 beta 


0.0 


PBMC PHA-L 


8.9 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL- 1 3 


9.0 


B lymphocytes PWM 


8.5 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 
TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast CCD1070 IL-1 
beta 


0.0 


Dendritic cells none 


5.3 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells LPS 


4.5 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti-CD40 


0.0 


IBD Colitis 2 


10.7 
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ivionocyies rest 




idu v^runn s 


n n 


Monocytes LPS 


1.2 


Colon 


51.8 


Macrophages rest 


0.0 


Lung 


24.7 


Macrophages LPS 


0.0 


Thymus 


4.1 


HUVEC none 


0.0 


Kidney 


4.6 


HUVEC starved 


0.0 







CNS_ncurodegeneration_vl.O Summary: Ag3992 Expression of the CG951 75- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 



General_screening_panel_vl.4 Summary: Ag3992 Expression of the CG95175- 
5 01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1D Summary: Ag3992 Expression of the CG95 175-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
Panel 4D Summary: Ag612 Highest expression of the CG95 175-01 gene is 
10 detected in IFN gama treated NCI-H292 cells (CT=33). Moderate to low expression of this 
gene is also seen in cytokine treated and untreated NCI-H292 cells, liver cirrhosis and 
colon tissue samples. Therefore, expression of this gene can be used to distinguish these 
samples from other samples used in this panel. In addition, therapeutic modulation of this 
gene can be used for the treatment of chronic obstructive pulmonary disease, asthma, 
15 allergy, and emphysema, liver cirrhosis, autoimmune and inflammatory disease affecting 
colon including Crohn's disease and ulcerative colitis. 

Panel CNS_1 Summary: Ag3992 Expression of the CG95 175-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel _v_2.4 Summary: Ag3992 Expression of the 
20 CG95 175-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

AF. CG95693-01 : DPY-19 Protein I Like Protein 

Expression of gene CG95693-01 was assessed using the primer-probe set Ag4026, 
25 described in Table AFA. Results of the RTQ-PCR runs are shown in Tables AFB, AFC, 
AFD and AFE. 

Table AFA . Probe Name Ag4026 
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Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 • -tggcattctaacagtgatgtca-3 1 


22 


54 


326 


Probe 


TET-5 ' -tgcaaacctctgtaatcaatggagca-3 ' -TAMRA 


26 


87 


327 


Reverse 


5 • -aaagttcttcctgaggcaaatc- 3 ' 


22 


130 


328 



Table AFB . CNS_neurodegencration_vl.O 



Tissue Name 


ReK Exp.(%) 
Ag4026, Run 
212391720 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run . 
212391720 


AD 1 Hippo 


7.6 


Control (Path) 3 Temporal Ctx 


2.0 


AD 2 Hippo 


24.0 


Control (Path") 4 Temooral Ctx 

V V-/ 1 1 11 KJ I 1 A ULI J J T I will YJ\J I CI 1 V^* C 


27.7 


AD 3 Hippo 


12.4 


AD 1 Occipital Ctx 


15.3 


AD 4 Hippo 


9.3 


AD 2 Occioital Ctx (Missing 


0.0 


AD 5 hippo 


77.4 


AD 3 Occipital Ctx 


4.8 


AD 6 Hippo 


38.7 


AD 4 Occipital Ctx 


24.5 


Control 2 Hippo 


11.2 


AD 5 Occipital Ctx 


23.8 


Control 4 Hippo 


23.8 


AD 6 Occipital Ctx 


30.6 


Control (Path) 3 Hippo 


12.8 


Control 1 Occipital Ctx 


1 6 


AD 1 Temporal Ctx 


12.2 


Control 2 Occipital Ctx 


47.0 


AD 2 Temporal Ctx 


26.4 


Control 3 Occipital Ctx 


13.2 


au .) I emporai ctx 


5.6 


Control 4 Occipital Ctx 


7.9 


AD 4 Temporal Ctx 


12.5 


Control (Path) 1 Occipital Ctx 


52.1 


AD 5 Inf Temporal Ctx 


84.7 


Control (Path) 2 Occipital Ctx i 


13.4 


AD 5 SupTemporal Ctx 


52.5 


Control (Path) 3 Occipital Ctx 1 


4.0 


AD 6 Inf Temporal Ctx 


55.5 


Control (Path) 4 Occipital Ctx 


14.7 


AD 6 Sup Temporal Ctx 


100.0 


Control 1 Parietal Ctx 


5.1 


Control 1 Temporal Ctx 


7.9 


Control 2 Parietal Ctx 


47.6 


Control 2 Temporal Ctx 


18.0 


Control 3 Parietal Ctx 


6.6 


Control 3 Temporal Ctx 


12.9 


Control (Path) 1 Parietal Ctx ! 


51.1 


Control 4 Temporal Ctx 


6.0 


Control (Path) 2 Parietal Ctx 


17.9 


Control (Path) 1 Temporal 
Ctx 


47.0 


Control (Path) 3 Parietal Ctx j 


4.8 


Control (Path) 2 Temporal 
Ctx 


22.8 


Control (Path) 4 Parietal Ctx 


33.4 



Table AFC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
218425757 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
218425757 


Adipose 


0.9 


Renal ca.TK- 10 


12.7 
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Melanoma* Hs688(A)T 


0.6 


Bladder 


7.6 


Melanoma* Hs688(B).T 


0.9 


Gastric ca. (liver met.) NCI-N87 


14.1 


Tk AT _ 1 *u % M * A 

Melanoma* Ml 4 


2.5 


Gastric ca. KATO III 


31.0 


Melanoma* LOXIMVI 


5.4 


Colon ca. SW-948 


4.3 


Melanoma* SK-MEL-5 


16.8 


Colon ca. SW480 


49.0 


Squamous cell carcinoma 
SCC-4 


0.8 


Colon ca * mptl ^w^fl 


25.9 


i estis Pool 


91.4 


jColon ca. HT29 


i 

1.6 


Prostate ca.* (bone met) PC- 
3 


5.6 


r * u ' — 

1 

Colon ca. HCT-116 


0.9 




Prostate Pool 


2.6 


^oion ca. \-.3\^o-z 


6.3 


Placenta 


0.4 


Colon cancer tissue 


20.4 


Uterus Pool 


0.8 


■ ■ - 

Pnlnn Q\A/ ] 1 1 /C 
v>UlUU Cd. oW I 1 io 


0.9 


Ovarian ca. OVCAR-3 




Colon ca. Colo-205 


6.2 


Ovarian ca. SK-OV-3 


31.2 


Colon ca. SW-48 


6.5 


Ovarian ca. OVCAR-4 |2.0 
i — — — L — 


Colon Pool 


5.5 


Ovarian ca. OVCAR-5 


10.2 


Small Intestine Pool 


7.2 




Ovarian r»5i IfJPOW 1 
kJ vdl tall Cd.. 1VJ i\\J V - I 


11.6 


Stomach Pool 


3.8 


Ovarian ca. OVCAR-S 


3.8 Bone Marrow Pool 


0.0 




Ovary 


2.5 J 


Fetal Heart 


5.2 




Breast ca. MCF-7 


12.4 


Heart Pool 


2.1 


Breast ca. MDA-MB-23 1 


10.9 


Lymph Node Pool 


7.7 


Breast ca. BT 549 


23.7 


Fetal Skeletal Muscle 


,.9 


Breast ca. T47D 


10.2 


Skeletal Muscle Pool 


2.0 


Breast ca. MDA-N 


7.2 


Spleen Pool j 


6.2 


Breast Pool 


7.6 jThymus Pool 


7.6 


Trachea 


8.4 ] 


CNS cancer (glio/astro) U87-MG j 


7.6 


Lung 


5 ~) |CNS cancer (glio/astro) U-l 1 8- 
jMG 


27.9 


Fetal Lung 


1 5 6 cancer (neuro;met) SK-N- 
|AS 


6.8 


Lungca.NCI-N417 


0.4 |CNS cancer (astro) SF-539 


10.7 


Lung ca. LX-1 


29.9 |CNS cancer (astro) SNB-75 


60.7 


Lungca. NCI-H146 


18 jCNS cancer (glio) SNB-19 


14.9 


Lung ca. SHP-77 


1 .6 |CNS cancer (glio) SF-295 


100.0 


Lung ca. A549 j 


59 jBrain (Amygdala) Pool 


4.8 


Lungca.NCI-H526 ; 


2-1 jBrain (cerebellum) 


0.4 


Lung ca. NCI-H23 ; 


52.9 1 


3rain (fetal) 


3.8 


Lung ca. NCI-H460 


3.8 I 


3rain (Hippocampus) Pool < 


5.2 
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Lung ca. HOP-62 


2.4 


Cerebral Cortex Pool 


6.0 


Lungca.NCI-H522 


4.2 


Brain (Substantia nigra) Pool 


6.6 


Liver 


0.0 


Brain (Thalamus) Pool 


8.8 


Fetal Liver 


14.8 


Brain (whole) 


2.9 


Liver ca. HepG2 


1.8 


Spinal Cord Pool 


9.3 


Kidney Pool 


7.9 


Adrenal Gland 


1.6 


Fetal Kidney 


jl0.6 


Pituitary gland Pool 


3-4 ! 


Renal ca. 786-0 


|14.4 [Salivary Gland 


0.7 | 


Renal ca. A498 


3.3 |Thyroid (female) 


1 

2.5 


Renal ca. ACHN 


3.1 


Pancreatic ca. CAPAN2 


9.0 


Renal ca. UO-3 1 


2.9 Pancreas Pool 


5.1 


Table AFD, Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
171613290 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 

l MUI JZ7U i 


Secondary Thl act 


1.3 


HUVEClL-lbeta 


0.7 | 


Secondary Th2 act |4.2 


HUVEC I FN gamma 


0.9 


Secondary Trl act 
Secondary Th I rest 


2.0 
2.9 


HUVEC TNF alpha + IFN 
gamma 

HUVEC TNF alpha + IL4 


1.2 j 

o.o ; 


Secondary Th2 rest 


3.3 


HUVEC 1L-11 


0.0 j 


Secondary Trl rest 


2.6 


Lung Microvascular EC none 


i.o 


Primary Thl act 


3.4 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.0 


Primary Th2 act 


9.2 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


3.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


1.1 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.5 


Primary Th2 rest 


0.5 


Small airway epithelium none 


0.0 


Primary Trl rest 


2.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


1.5 


CD45RA CD4 lymphocyte 
act 


7.3 


Coronery artery SMC rest 


0.0 


CD45ROCD4 lymphocyte 
act 


7.7 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.4 


CD8 lymphocyte act 


8.1 


Astrocytes rest 


2.0 


Secondary CD8 lymphocyte 
rest 


2.9 


Astrocytes TNFalpha + IL- 1 beta 


1.4 


Secondary CD8 lymphocyte , 
act 


2.7 


KU-812 (Basophil) rest 


74.2 
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CD4 lymphocyte none 


1.3 


KU-812 fBasonhin 
PMA/ionomycin 


100.0 


2ryThinWTrl anti-CD95 
CH11 


7.6 


K^y^LJi iuo ^iveraunocyiesj none 


"} 7 


LAK cells rest 


3.4 


CCDl 1 06 (Keratinocytes) 
I Nr alpha + IL-1 beta 


2.8 


LAiSw cells 




Liver cirrhosis 


a a 

0.0 


I AI/ oilier 11 04-11 10 
LAIS. ceilS IL-_^1L-1/ 


Z.J. 


INLI-H292 none 


3.2 

A A 

4.0 


lak cells iL-z-ririN gamma 


5.0 

5.7 


NC I-H292 IL-4 


1 A IS s% A ltr~ II O-i- 11 1 0 

LAK cells IL-Z-r IL-Io 


NCI-H292 IL-9 


3.0 


LAK cells PMA/ionomycin 


1.4 


NCI-H292 IL-13 


3.0 


NK Cells IL-2 rest 


12.9 


NCI-H292 I FN gamma 


2.5 


Two Way MLR 3 day 


4.8 


HPAEC none 


0.5 


Two Way MLR 5 day 


2.1 


HPAEC TNF alpha + IL-1 beta 


1.8 


Two Way MLR 7 day 


3.1 


Lung fibroblast none 


0.0 


PBMC rest 


0.5 


Lung fibroblast TNF alpha + IL-1 

i <. 
beta 


1.4 


PBMC PWM 


5.6 


Lung fibroblast IL-4 


0.4 


PBMC PHA-L 


5.3 


Lung fibroblast IL-9 


1.0 


Ramos (B cell) none j23.0 


Lung fibroblast IL-13 


2.2 


Ramos (B cell) ionomycin |24.0 


Lung fibroblast I FN gamma 


2.3 


B lymphocytes PWM 


3.3 


Dermal flhrnhlfiQt PPni07D rpsf 


j .j 


B lymphocytes CD40L and 
IL-4 


12.2 


normal fihrnhlfl<;t CCT}]()1() TNF 

JL/CIIIlal HUlUUldSl ^V./U 1 U / v I INT 

alpha 


12.9 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCDl 070 IL-1 
beta 


U.U 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


1.4 


Dendritic cells none 


1.0 


Dermal fibroblast IL-4 


1.2 


Dendritic cells LPS 


0.8 


Dermal Fibroblasts rest 


3.7 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNra+LPS 


A A 

0.0 


Monocytes rest 


0.6 


Neutrophils rest 


A A 

0.0 


Monocytes LPS 


0.0 


Colon 


11.7 


Macrophages rest 


4.0 


Lung 


6.6 


Macrophages LPS 


0.0 


Thymus 


22.4 


HUVEC none 


0.9 


Kidney 


49.3 


HUVEC starved 


1.7 






Table AFE. General oncology screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
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f (268362901 




268362901 


(Colon cancer 1 |23.8 


Bladder cancer NAT 2 


0.6 


[Colon NAT 1 jo.7 


Bladder cancer NAT 3 


0.0 


j^oioii cancer z 


|3.9 


Bladder cancer NAT 4 


2.2 


p^/Uiuii cancer jna i z 


J2.0 


Adenocarcinoma ot tne prostate 1 


30.8 


jL>oion cancer 3 


|63.3 


/\aeiiocarcmoma ot ine prostate z 


1 *5 


^uion cancer in a i j 


|4.9 


Adenocarcinoma or tne prostate j 


j. 5 

t 


v^uiuii iiidiignani cancer 4 


16.8 


Adenocarcinoma oi tne prostate 4 


38.7 


^uion normal aojacent tissue 

4 ' 


1.2 


Prostate cancer NAT 5 


0.4 


Lung cancer 1 


13.9 


Adenocarcinoma of the prostate 6 


1.6 


Lung NAT 1 


0.9 


Adenocarcinoma of the prostate 7 


0.4 


Lung cancer 2 


16.3 


Adenocarcinoma of the prostate 8 


O.O 


Lung NAT 2 


0.3 


Adenocarcinoma of the prostate 9 


6.1 


Squamous cell carcinoma 3 


4.0 


Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 


0.5 
30.1 


Kidney cancer I 


12.8 


jmetastatic melanoma 1 


KidneyNAT 1 


5.0 


Melanoma 2 [o.O 


Kidney cancer 2 


100.0 


Melanoma 3 


1.5 


Kidney NAT 2 |6.3 


metastatic melanoma 4 


12.4 


Kidney cancer 3 


52.9 


metastatic melanoma 5 


26.8 


Kidney NAT 3 


4.2 


Bladder cancer 1 


0.9 


Kidney cancer 4 


12.1 


Bladder cancer NAT 1 


O.O 


Kidney NAT 4 


2.6 


Bladder cancer 2 


1.0 







CNS_neurodegeneration_vl.O Summary: Ag4026 This panel confirms the 
expression of the CG95693-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 

treatment of central nervous system disorders. 

General_screeningjpanel_vl.4 Summary: Ag4026 Highest expression of the 

CG95693-01 gene is detected in CNS cancer SF-295 cell line (CT=28.3). Significant 

expression of this gene is associated with cluster of cancer cell lines including CNS, colon, 
0 renal, lung, breast, ovarian, pancreatic, melanoma and prostate cancer cell lines. 

Therefore, therapeutic modulation of this gene could be beneficial in the treatement of 

these cancers. 
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Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adrenal gland, thyroid, pituitary gland, skeletal muscle, 
heart, fetal liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=3 1 ) when 
compared to adult liver (CT=40). This observation suggests that expression of this gene 
can be used to distinguish fetal from adult liver. In addition, the relative overexpression of 
this gene in fetal liver suggests that the protein product may enhance liver growth or 
development in the fetus and thus may also act in a regenerative capacity in the adult. 
Therefore, therapeutic modulation of the protein encoded by this gene could be useful in 
treatment of liver related diseases. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease. Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4026 Highest expression of the CG95693-01 gene is 
detected in PMA/ionomycin treated basophils (CT=3 1). In addition, this gene is expressed 
to a lesser extent in untreated KU-812 cells. Therefore, antibody or small molecule 
therapies designed with the protein encoded for by this gene could block or inhibit 
inflammation or tissue damage due to basophil activation in response to asthma, allergies, 
hypersensitivity reactions, psoriasis, and viral infections. 

Moderate to low levels of expression of this gene is also detected in Ramos B cells, 
activated lymphocytes, IL-2 treated LAK cells, TNF alpha treated dermal fibroblasts, and 
normal tissues respresented by colon, lung, thymus and kidney. Therefore, modulation of 
the gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 
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General oncology screening panel_v_2.4 Summary: Ag4026 In this panel the 
highest expression of the CG95693-01 gene is detected in a kidney cancer sample 
(CT=31.4). Interestingly, expression of this gene is higher in kidney, prostate, and lung 
cancer samples as compared to adjacent control samples as well as the melanoma samples 
in this panel. Therapeutic modulation of this gene, through the use of small molecule 
drugs, protein therapeutics or antibodies could be of benefit in the treatment of melanomas 
as well as kidney, prostate and lung cancers. 



10 



AG. CG95814-01 : Rho-GTPase-Activating Protein 4 

Expression of gene CG95814-01 was assessed using the primer-probe set Ag4031, 
described in Table AGA. Results of the RTQ-PCR runs are shown in Tables AGB, AGC, 
AGD, AGE and AGF. 

Table AGA . Probe Name Ag4031 



Primers 


Sequences 


(Length 


Start 
Position 


SEQ ID ! 

No I 


Forward 


5 ' -aaccaatgcatctgtcttcaag-3 1 


! 22 


1053 


329 1 


Probe 


TET-5 ' ^tccatgacctatctgaccttattgatca-3 


• -tamra; 28 


1082 


330 ! 


Reverse 


5 ' -tgcatggtagcctaagtcaca-3 ■ 


| 21 


1 114 


331 | 



Table AGB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
212396115 


Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
212396115 


AD 1 Hippo 


18.2 


Control (Path) 3 Temporal Ctx 


10.4 


AD 2 Hippo 


42.0 


Control (Path) 4 Temporal Ctx 


29.3 


AD 3 Hippo 


13.2 . 


AD 1 Occipital Ctx 


25.9 


AD 4 Hippo 


11.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


96.6 


AD 3 Occipital Ctx 


7.7 


AD 6 Hippo 


83.5 


AD 4 Occipital Ctx 


19.9 


Control 2 Hippo 


42.9 


AD 5 Occipital Ctx 


21.8 


Control 4 Hippo 


20.4 


AD 6 Occipital Ctx 


55.9 


Control (Path) 3 Hippo 


5.0 


Control 1 Occipital Ctx 


6.5 


AD 1 Temporal Ctx 


26.2 


Control 2 Occipital Ctx 


68.3 


AD 2 Temporal Ctx 


29.1 


Control 3 Occipital Ctx 


15.1 


AD 3 Temporal Ctx 


8.9 


Control 4 Occipital Ctx 


10.9 


AD 4 Temporal Ctx 


0.0 


Control (Path) I Occipital Ctx i 


88.3 


AD 5 Inf Temporal Ctx 


100.0 Control (Path) 2 Occipital Ctx |6.5 
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au 3 bup i emporai ctx 


Oi.l 


Control (ratn) 3 Occipital Ctx 


O.O 


au o mi 1 emporai ctx 


/l.Z 


control (ratn ) 4 uccipital ctx 


zu.y 


AD 6 Sup Temporal Ctx 


66.9 


Control 1 Parietal Ctx 


7.4 


Control 1 Temporal Ctx 


5.8 


Control 2 Parietal Ctx 


48.6 


Control 2 Temporal Ctx 


47.6 


Control 3 Parietal Ctx 


13.4 


Control 3 Temporal Ctx 


12.9 


Control (Path) 1 Parietal Ctx 


68.3 


Control 4 Temporal Ctx 


9.9 


Control (Path) 2 Parietal Ctx 


15.4 


Control (Path) 1 Temporal 
Ctx 


61.1 


Control (Path) 3 Parietal Ctx 


6.7 


Control (Path) 2 Temporal 
Ctx 


26.2 


Control (Path) 4 Parietal Ctx 


45.7 



Table AGC General_screening_panel vl.4 



Ticcnp Mnmp 

1 IjjIIC lNulllC 


Rel. Exp.(%) 
218525121 




Rel. Exp.(%) 

AoAH^I Run 
M.g'fUJ) 1 , ixun 

218525121 


Adipose 


10.7 


Renal ca. TK- 10 


34.9 j 


Melanoma* Hs688(A).T 


28.1 


Bladder 


13.4 | 


ivieianoma risooo^Dj.i 




Gastric ca. (liver met.) NCI- 
N87 


ZJ.U J 


Melanoma* M 1 4 


50.3 


Gastric ca. KATO 111 


27.9 i 


Melanoma* LOXIMVI 


25.9 


Colon ca. SW-948 


1.9 


Melanoma* SK-MEL-5 


66.0 


Colon ca. SW480 


66.0 


Squamous cell carcinoma 
SCC-4 


14.3 


Colon ca.* (SW480 met) 
SW620 


14.1 


Testis Pool 


4.5 


Colon ca. HT29 


6.1 


Prostate ca.* (bone met) 
PC-3 


34.4 


Colon ca.HCT-1 16 


26.8 


Prostate Pool 


3.1 


Colon ca. CaCo-2 


6.8 


Placenta 


15.6 


Colon cancer tissue 


8.0 


Uterus Pool 


4.5 


Colon ca. SW1116 


1.4 


Ovarian ca. OVCAR-3 


15.8 


Colon ca. Colo-205 


3.5 


Ovarian ca. SK-OV-3 


34.6 


Colon ca. SW-48 


3.4 


Ovarian ca. OVCAR-4 


15.7 


Colon Pool 


12.9 . 


Ovarian ca. OVCAR-5 


25.3 


Small Intestine Pool 


9.6 


Ovarian ca. IGROV-1 


28.9 


Stomach Pool 


5.8 


Ovarian ca. OVCAR-8 


14.7 


Bone Marrow Pool 


3.2 


Ovary 


9.8 


Fetal Heart 


8.2 


Breast ca. MCF-7 


17.1 


Heart Pool 


3.9 


Breast ca. MDA-MB-231 


55.1 


Lymph Node Pool 


8.4 
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Breast ca. BT 549 


42.6 


Fetal Skeletal Muscle 


12.3 


Breast ca. T47D 


43.8 


Skeletal Muscle Pool 


3.7 


Breast ca. MDA-N 


26.8 


Spleen Pool 


16.8 


Breast Pool 


10.2 


Thymus Pool 


8.8 


Trachea 


6.7 


CNS cancer (glio/astro) U87- 
MG 


"1*7 A 

37.4 


Luna 

i ~ 


3.1 


CNS cancer (glio/astro) U-l 1 8- 
MG 


48.0 


j 

jFetal Lung 


37.4 


CNS cancer (neuro;met) SK-N- 
AS 


30.1 


Lung ca. NCI-N417 


4.6 


CNS cancer fastrcA SF-539 


45 1 


jLung ca. LX-1 


15.5 


CNS cancer Castro^ SNB-75 


83 5 


Lung ca. NCI-HI 46 


7.0 


CNS cancer (elio^ SNB-19 


32.5 


iLun^ca SHP-77 


73.2 


CNS cancer folio^ SF-295 


46.7 


iLung ca. A549 


20.7 


Rrain ^ Anwodala^ Pnol 

Ul Hill 1 /VI 1 1 J gURIH I 1 V-/vjf 1 


14.5 


Lun<* ca NCI-H526 


4.2 


Rrain fpprphplliiiYi^ 




'Luntr ca NCI-I-P3 




Rrain tTpral , i 
lji d ill ^ itiai j 




Lun^ca NCI-H460 


°0 0 


Rrain f Hinnnramnn^ Ppjfi! 


16 3 


>Lun» ca HOP-62 


?4 1 


Cprphra! Cnrtpx Ponl 


15 5 


Lun« ca NCI-H5'>2 


31 0 


Rrain i^nrKtantia niora^ PriPil 

Lfl alll ^OUL/oLCtllLICI 1 1 1 I a j I \J\J \ 


18 0 


I 1 

Liver 


0.9 


Rrain fThalamiK^ Pnol 

lii alii y i iiciiaiiiLijy i i 


23.7 


Fetal Liver 

ft \* ^ J w i ▼ W 1 


12.6 


Rrain fwhnle^ 

uiaiii i »¥iiviv / 


32 J 


Liver ca HenG2 


15 4 


Sninal Cord Pool 

OL/II IQI V_^V_Jl VJ 1 \J\J I 


18 9 


Kidnev Poo! 


18 0 


AHrpnal CiIanH 


93 


Fetal Kidney 


13.7 


Pituitary gland Pool 


2.2 


Renal ca. 786-0 


54.0 


Salivary Gland 


i.i 


Renal ca. A498 


8.0 


Thyroid (female) 


2., 


Renal ca. ACHN 


19.1 


Pancreatic ca. CAPAN2 


31.9 


Renal ca. UO-31 


24.5 


Pancreas Pool 


8.5 


Table AGD, Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4031, Run 
171616912 


Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
171616912 


Secondary Thl act 


49.3 


HUVEClL-lbeta 


49.7 


Secondary Th2 act 


100.0 


HUVEC I FN gamma 


43.5 


Secondary Trl act 


62.0 


HUVEC TNF alpha + 1FN 
gamma 


32.5 


Secondary Thl rest 


13.2 


HUVEC TNF alpha +IL4 


28.3 


Secondary Th2 rest 


13.9 


HUVEC IL- 11 


25.2 
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Secondary Trl rest 


11.9 


Lung Microvascular EC none 


55.9 


Primary Thl act 


14.2 


Lung Microvascular EC 
TNFalpha+IL-lbeta 


31.9 


Primary Th2 act 


23.0 


Microvascular Dermal EC 
none 


. 4z.o 


Primary Trl act 


22.5 


Microsvasular Dermal EC 
TNFalpha+IL-lbeta 


27.5 


Primary Till rest 


13 6 


Bronchial epithelium 
TNFalpha + ILlbeta 


|19.8 

i 


Primary Th2 rest 


9.2 


Small airway epithelium none 


6.9 


Primary Trl rest 


16 3 


Small airway epithelium 
TNFalpha + IL-lbeta 


13.3 


CD45RA CD4 lymphocyte 
act 


26.8 


v^uiuiici y dricry oiviv^ rest 


11.2 


CD45RO CD4 lymphocyte 
act 


22 7 


Coronery artery SMC 
TNFalpha + IL-lbeta 


24.7 


CD8 lymphocyte act 


19.3 


Astroc\tes rest 


38.4 


Secondary CD8 
lymphocyte rest 


14.4 


Astrocxtes TNFalpha + IL- 
lbeta 


26.8 


Secondary CD8 
lymphocyte act 


21.5 


KU-8 12 (Basophil) rest 


,3 


CD4 lymphocyte none 


4 Q 


KU-8 12 (Basophil) 
PMA/ionomycin 


_ j 

49.7 


2ryThl/Th2/Trl anti- 
CD95CH11 


21.9 


CCD 11 06 (Keratinocytes) 
none 


38.2 


LAK cells rest 


24.7 


CCD 1 1 06 (Keratinocytes) 
TNFalpha+IL-lbeta 1 


33.2 


LAK cells IL-2 


19.3 


Liver cirrhosis 


5.0 


LAK cells 11 -2+11 -17 


12.4 


NCI-H292 none 


20.6 


LAK cells IL-2+IFN 
gamma 


17.0 


NCI-H292 IL-4 


24.5 


LAK cells IL-2+ IL-I8 


22.8 


NCI-H292 IL-9 


37.1 


LAK cells PMA/ionomycin 


8.4 NCI-H292IL-13 


25.2 


NK Cells IL-2 rest 


30-1 NCI-H292 I FN gamma 


28.9 


Two Wav MI R 1 Hav 


22.2 HPAECnone 


34.9 


Two Wav MI R S rlav 


19.5 j] 


HPAEC TNF alpha + IL-1 beta < 


43.8 


Two Way MLR 7 day : 


22.5 ] 


Lung fibroblast none : 


60.0 


PBMC rest ( 


! 


Lung fibroblast TNF alpha + , 
L-l beta 


24.7 


PBMC PWM 


13.7 ] 


^ung fibroblast IL-4 


13.6 


PBMC PHA-L 


19.6 ] 


„ung fibroblast IL-9 : 


>3.0 


Ramos (B cell) none : 


>0.2 I 


-ung fibroblast IL-1 3 


14.9 
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Ramos (B cell) ionomycin 


23.7 


Lung fibroblast IFN gamma 


24.8 


B lymphocytes PWM 


22.7 


Dermal fibroblast CCD1070 
rest 


37.9 


B lymphocytes CD40L and 
IL-4 


99 7 


Dermal fibroblast CCD1070 
TNF alpha 


<A "7 
->4. / 


EOL-1 dbcAMP 


25.9 


Dermal fibroblast CCD 1070 
1L-I beta 


21.5 


tvJU-l aDCAlvir 
PMA/ionomycih 


27.2 


Dermal fibroblast IFN gamma 


29.7 


Dendritic cells none 


36.1 


Dermal fibroblast IL-4 


46.3 


Dendritic cells LPS 


76.8 


Dermal Fibroblasts rest 


26.4 


Dendritic cells anti-CD40 


32.5 


Neutrophils TNFa+LPS 


2.6 


Monocytes rest 


33.4 


Neutrophils rest 


9.7 


Monocytes LPS 


42.0 


Colon 


4.7 


Macrophages rest 


18.6 


Lung 


27.0 


Macrophages LPS 


26.8 


Thymus 


20.6 


HUVEC none 


37.1 


Kidney 


16.6 


HUVEC starved 


45.1 







Table AGE. Panel CNS 1 



Tissue Name 


Rel. Exp.(%) 
Ag403K Run 
180912028 


Tissue Name 


Rel. Exp.(%) 
Ag4031. Run 
180912028 


BA4 Control 


24.1 


BA17PSP 


30.1 


BA4 Control2 


50.7 


BA17PSP2 


7.1 


BA4 Alzheimer's2 


3.5 


Sub Nigra Control 


37.9 


BA4 Parkinson's 


68.3 


Sub Nigra ControI2 


34.4 


BA4 Parkinson's2 


75.8 


Sub Nigra Alzheimei J s2 


25.5 


BA4 Huntington's 


42.3 


Sub Nigra Parkinson's2 


77.9 


BA4 Huntington's2 


6.5 


Sub Nigra Huntington's 


100.0 


BA4PSP 


8.2 


Sub Nigra Huntington's2 


69.7 


BA4 PSP2 


30.4 


Sub Nigra PSP2 


12.8 


BA4 Depression 


49.7 


Sub Nigra Depression 


20.7 


BA4 Depression2 


15.5 


Sub Nigra Depression2 


14.9 


BA7 Control 


31.9 


Glob Palladus Control 


20.2 


BA7 Control2 


31.6 


Glob Palladus Control2 


15.1 


BA7 Alzheimer's2 


7.4 


Glob Palladus Alzheimer's 


18.6 


BA7 Parkinson's 


27.5 


Glob Palladus Alzheimer's2 


8.3 


BA7 Parkinson's2 


47.6 


Glob Palladus Parkinson's 


81.8 


BA7 Huntington's 


75.3 


Glob Palladus Parkinson's2 


19.9 
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RA7 WiintinrrtruVeO 

Dr\ 1 ouniingion sz 


4o.O 


Glob Palladus PSP 


9.5 


RA7 P^P 

LfJTX I I Of 


JO./ 


Glob Palladus PSP2 


8.0 


RA7 PSP? 


Z.3.J 


Glob Palladus Depression 


10.2 


RA7 DpnrPQcirin 


i i a 


I emp Pole Control 


12.4 


RAO Pontrnl 


lO A 


1 emp role Control2 


40.1 


RAO Cnntrnl? 


OJ.J 


Temp Pole Alzheimer's 


5.5 


BA9 Al7lieimer r Q 




Temp Pole Alzheimer's2 


6.1 


RAQ A^hpimprV 


I 7.6 


Temp Pole Parkinson's 


35.1 


RAQ Parkincnn'c 


47 £ 


I emp Pole Parkinson s2 


38.4 


RAO Parkincnn'c9 
\jt\y ralKlilbUII 2>Z 


^A 1 


Temp Pole Huntington s 


49.0 


RAO J-Ii intinrrt/^rk'c 

Orvy jnunungion s 




Temp Pole PSP 


3.3 


RAO Hnntt n citrm'cO 

D/A7 nuiiuiigiun sz 


11.8 


Temp Pole PSP2 


5.3 


RAO PSP 

da 7 ror 


15.4 


Temp Pole Depress ion2 


1 1.2 


RAO PSP? 


4.4 


Cing Gyr Control 


57.8 


DA7 repress ion 


10.7 


Cing Gyr Control2 


32.1 


RAO TlAnropcirtnO 

D/\y L/epressionz 

r 


8.2 


Cing Gyr Alzheimer's 


20.9 


IRA I 7 PnntrrJ 


44.8 jCing Gyr AIzheimer's2 


i 

10.6 


R A 1 7 PontrnIO 

da j / ^oniroiz 


46.0 JCing Gyr Parkinson's 


38.7 


iRA17 A I'yh^im^r'e') 


12.6 jCing Gyr Parkinson's2 


46.0 


R A 1 7 Part' incnn'c 

d/\ i / rarKinson s 


40.9 


Cing Gyr Huntington's 


76.3 


BA17Parkinson's2 


57.8 


Cing Gyr Huntington's2 


38.2 


BA17 Huntington's 


44.4 


Cing Gyr PSP 


26.2 


BA17Huntington f s2 


24.7 


Cing Gyr PSP2 


9.5 


BA1 7 Depression 


12.7 


Cing Gyr Depression 


10.2 


BA17 Depression2 


32.5 I 


Cing Gyr Depression2 


19.6 


Table AGF. Genera 


oncology screening panel_v_2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
268362913 


Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
268362913 


Colon cancer 1 


13.4 


Bladder cancer NAT 2 


1.0 


Colon NAT 1 


2.2 


B ladder cancer NAT 3 


1.3 


Colon cancer 2 


27.4 


B ladder cancer NAT 4 


3.8 


Colon cancer NAT 2 


5.6 


Adenocarcinoma of the 
prostate 1 


9.3 


Colon cancer 3 


15.2 


Adenocarcinoma of the 
prostate 2 


2.1 


Colon cancer NAT 3 


15.8 


Adenocarcinoma of the 
prostate 3 


9.0 


Colon malignant cancer 4 


35.4 


Adenocarcinoma of the 


8.8 
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prostate 4 




Colon normal adjacent 
tissue 4 


2.7 


Prostate cancer NAT 5 


6.3 


Lung cancer 1 


13.9 


Adenocarcinoma of the 
prostate 6 


3.1 


Lung NAT 1 


4.3 


Adenocarcinoma of the 
prostate 7 


6.1 


Lung cancer 2 


100.0 


Adenocarcinoma of the 
prostate 8 


1.2 


Lung NAT 2 


9.3 


Adenocarcinoma of the 


25.2 


Squamous cell carcinoma 3 


29.5 


Pmctntp panrpr 1\IAT 1ft 


1 7 


Lung NAT 3 


1.6 


K. irlnpv rancer 1 

l V1U1 l^j v>Clltl*WI 1 


34.2 


metastatic melanoma 1 


20.2 


KidnevNAT 1 


19 1 


Melanoma 2 


8.5 


Kidney cancer 2 


42.0 


Melanoma 3 


8.0 


Kidney NAT 2 


38.4 


metastatic melanoma 4 j93 .3 


Kidney cancer 3 


33.4 


metastatic melanoma 5 j65.1 


Kidney NAT 3 


19.1 


Bladder cancer 1 1.6 


Kidney cancer 4 




Bladder cancer NAT 1 jO.O 


Kidney NAT 4 |8.4 


Bladder cancer 2 |6.0 





CNS_neurodegeneration_vl.O Summary: Ag4031 This panel confirms the 
expression of the CG95814-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
5 Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screeningjpanel_vl.4 Summary: Ag4031 Highest expression of the 
CG95814-01 gene is detected in brain (cerebellum) sample (CT=25.6). High expression of 
10 this gene is seen in all the region of the central nervous system examined, including 
amygdala, hippocampus, substantia nigra, thalamus, cerebellum, cerebral cortex, and 
spinal cord. Therefore, this gene may play a role in central nervous system disorders such 
as Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

1 5 The CG958 1 4-0 1 gene codes for a homolog of RHO-GTPase activating protein 

(RHOGAP). RHOGAP stimulats the GTPase activity of Rho protein. Since Rho proteins 
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have been implicated in ischemia (Trapp et al., 2001, Mol Cell Neurosci 17(5):883-94, 
PMID: 1 1358485), inhibitors for the RHOGAP encoded by this gene may have a 
therapeutic utility in the treatment of pathological aspects following brain damage 
including neuronal death. 
5 High expression of this gene is also observed in cluster of cancer cell lines 

including pancreatic, CNS, colon, renal lung, breast, ovarian, prostate, squamous cell 
carcinoma, and melanoma cancer cell lines. Therefore, therapeutic modulation of this gene 
can be useful in treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 

1 0 to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=27-28) 

15 when compared to adult lung and liver (CTs=30-32). This observation suggests that 
expression of this gene can be used to distinguish fetal from adult lung and liver. In 
addition, the relative overexpression of this gene in fetal tissue suggests that the protein 
product may enhance growth or development of lung and liver in the fetus and thus may 
also act in a regenerative capacity in the adult. Therefore, therapeutic modulation of the 

20 protein encoded by this gene could be useful in treatment of liver and lung related 
diseases. 

Panel 4.1D Summary: Ag403 1 Highest expression of the CG95814-01 gene is 
detected in activated secondary Th2 cells (CT=27). This gene is expressed at high to 
moderate, levels in a wide range of cell types of significance in the immune response in 

25 health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 

30 tissues. This pattern is in agreement with the expression profile in 

General_screening_panel_yl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
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therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 
5 Panel CNS_1 Summary: Ag4031 This panel confirms the expression of the 

CG958 14-01 gene at low levels in the brains of an independent group of individuals. 
Please see Panel 1 A for a discussion of the potential utility of this gene in treatment of 
central nervous system disorders. 

General oncology screening panel_v_2.4 Summary: Ag4031 Highest 
10 expression of the CG95814-01 gene is detected in lung cancer sample (CT=27). Higher 
expression of this gene is also seen in melanoma, colon, kidney, prostate cancers and this 
expression is lower in corresponding control samples. Thus, expression of this gene can be 
used as diagnostic marker and therapeutic modulation of this gene product may be useful 
in the treatment of these cancers. 

15 

AH. CG95824-01: Rho GAP 

Expression of gene CG95824-01 was assessed using the primer-probe set Ag4032. 
described in Table AHA. Results of the RTQ-PCR runs are shown in Tables AHB, AHC, 
AHD and AHE. 

20 Table AHA . Probe Name Ag4032 



Primers 


Sequences 


I Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -ctcctacctggaaggctacatc-3 ' 


| 22 ; 


2475 


332 


Probe 


TET-5 1 -atggaccccgtcaatgacactaaagg-3 1 


-tamra| 26 


2506 


333 


Reverse 


5 • -cagtacttggcatacgtgctta-3 1 


) 22 


2540 


334 



Table AHB . CNSjaeurodegenerationjvLO 





Rel. Exp.(%) 




Rel. Exp.(%) 


Tissue Name 


Ag4032, Run 


Tissue Name 


Ag4032, Run 




212396260 




212396260 


AD 1 Hippo 


9.9 


Control (Path) 3 Temporal Ctx 


5.9 


AD 2 Hippo 


14.8 


Control (Path) 4 Temporal Ctx 


27.0 


AD 3 Hippo 


5.2 


AD 1 Occipital Ctx 


14.9 


AD 4 Hippo 


3.5 


AD 2 Occipital Ctx (Missing) 


0.0 
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1UU.U 




S.2 


AHA Uinnrh ' 

IWJ O olppO j 




ADA Orrinital Ptv 


18.2 


L/Omroi z riippo ^ 




An ^ Orrinital Ctx 1 


15.4 


control *f riippo * 


> < 
C.J 


AHA OrrinitaJ Ptx 


16.0 


L^oniroi \rox\\) j riippo 




Pontml 1 Orrinital Ctx 


1.4 


AU l I emporai v_,ix 


1 ft ^ 

1 U.J 


Pnntrnl ? Orrinital Ctx 


84.7 


■ AL/ z i emporai cix 


1 Q 7 


Pr.ntrr.1 ^ Orrinital Ctx 


21.8 


AH * Tpm nr\rtx 1 Ptv , 

AU -> i emporai v_i\ * 


1 7 


Pr>ntrr»l 4 Orrinital Ctx 


1.6 


t\\j h l emporai ctx 


1 1 7 
II./ 


Prmtrnl fPathl 1 Occinital Ctx 


53.2 


AL/ d ini i emporai ctx 


73 0 


Pnntrr.1 fPatM ? Orrinital Crx 


14.2 


/\u j oiip i emporai v^tx 


11 ft 


Pnntml fPatM ^ Occinital Ctx 

\_U1IUU1 \Y duly ~J \J\^\^\y\\.<x\ v> L<\ 


33 


AU o ini 1 emporai ctx 


32.3 


Control (Path) 4 Occipital Ctx 


17 2 


AD 6 Sup Temporal Ctx 


39.2 


Control 1 Parietal Ctx 


5.6 


iControl 1 Temporal Ctx 


4.2 


Control 2 Parietal Ctx 


32.8 


Control 2 Temporal Ctx 


35.6 


Control 3 Parietal Ctx 


23.0 


jControl 3 Temporal Ctx 


15.5 


Control (Path) 1 Parietal Ctx 


62.0 


jControl 3 Temporal Ctx 


6.7 jControl (Path) 2 Parietal Ctx 


18.8 \ 

_j 


Control (Path) 1 Temporal 
Ctx 


41.8 


Control (Path) 3 Parietal Ctx 


3.0 1 

i 


Control (Path) 2 Temporal 

jCtx 


32.8 


Control (Path) 4 Parietal Ctx 


37.4 


Table AHC. General screening_panel_vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
218525131 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
218525131 


Adipose 


0.5 


Renal ca. TK-10 


22.7 


Melanoma* Hs688(A).T 


2.7 


Bladder 


4.2 


Melanoma* Hs688(B).T 


2.4 


Gastric ca. (liver met.) NCI-N87 


31.4 


Melanoma* Ml 4 


4.4 


Gastric ca. KATO III 


41.8 


Melanoma* LOXIMVI 


12.4 


Colon ca. SW-948 


8.6 


Melanoma* SK-MEL-5 


25.3 


Colon ca. SW480 


17.2 


Squamous cell carcinoma 
SCC-4 


5.0 


Colon ca.* (SW480 met) SW620 


11.3 


Testis Pool 


19.6 


Colon ca. HT29 


5.8 


Prostate ca.* (bone met) 
PC-3 


30.8 


Colon ca.HCT-1 16 


40.9 


Prostate Pool 


2.0 


Colon ca. CaCo-2 


9.7 


Placenta 


2.5 

- „ 


Colon cancer tissue 


8.8 


Uterus Pool 


0.0 


Colon ca.SW1116 


2.5 
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Ovarian ra OVPAR-T. 

VVdl I all va. v V V-^xAIV-J 


90 7 
zu. / 


coion ca. v-»oio-zi/j 


Z.J 


Ovflrifln ra ^K-OV-3 

Wuiiaii va.. oiv v v j 


16 5 

1 U.J 


v^oion ca. ow-^o 


5 9 i 
j.Z 


Ovarian ca OVCAR-4 

w entail va. v ^rnv*t 


J U* 1 


\^UIUII rUOl 


9 1 


Ovarian ca OVCAR-5 


35 1 


oiiiaii iiiicoiine ruui 


1 1 
1 . 1 


Ovarian ca IGROV-1 


4 2 


OiUIUavll ruui 


1 .o 


Ovarian ca OVCAR-8 


15.8 


Ron** IMsirmw/ Pah 1 
ij vj i j c tviai 1 vw ruui 


0 S 

U.J 


Ovary 


2 6 


Frtal Heart 
i VoLai rival I 


7 1 


Breast ca MCF-7 


37 9 


1-1 part Ponl 
rival l ruui 


i <: 
l . j 


Breast ca MDA-MB-23 1 


18 6 


L-yiupu inuuc ruui 


9 ^ 
Z.J 


Breast ca BT 549 


84 7 


rcial OKClCLal IVlUbLIC 




Breast ca T47D 


58 2 


oKciviai iviuDvie ruoi 


0 7 
U.J 


Breast ca MDA-N 


J .V/ 


opiccii ruui 


1 7 
1 . / 


Breast Pool 


j .\j 


Tlivmnc Pr\r^l 
i iiYiiiuo ruui 


7 Q 


Trachpa 

1 1 uvl j vcl 


10 2 


i^ino cancer ^giio/asuo^ uo/-ivi.vj 


0 O 


1 lin ty 


0 1 


v^ino cancer 'gi 10/ asiro^ u-i lo-ivio 




Ff*tal I liner 


1 7 0 


v^ino cancer ^neuro,mei^ orv-iN-Ao 


90 Q 
ZU.V 


1 una ra NCI-T\J4 J 7 


1 9 
j.z 


i^ino cancer ^astioj or-Djv 


J.U 


f nncr ra I Y- 1 


0 9 


l^ino cancer 'asiroj o1Nd-/j 


19 A ' 
12.6 | 


I una ca NCI-H 146 


10 4 

1 v.'-r 


viNo cancel ^gnoj oiND-iy 


1 9 n 


I una ca SHP-77 


50 0 

JU.U 


viNo cancer ^gi 10 ) or-iyj 


119 i 

1 1.2 | 


I una ra A549 


lu.O 


ordin ^/vmygaaia^ rooi 


J J.U 


I una ca NOI-H596 


1R 4 


Prom ^P0p0nolliim\ 

Drain ^cereueuum^ 


87 7 


Lunaca NP1-H23 


5 3 


Drain ^ieiai^ 


100 0 
1UU.U 


T una ra NPI-H460 


5 5 
j. J 


Drain ^riippocanipusj rooi 


jj.^f 


I una ca HOP-fi? 


S 1 
j. i 


v^ereorai v^onex. rooi 


oy.j 


Luneca NCI-H522 


33 7 
j j . / 


Dram ^oUDSianna nigra ) rooi 


j j.j 


Liver 


0 2 


DIalll y 1 UdlalUUb^ lUUl 


51 1 
j i.i 


Fetal Liver 


1.6 


Rrain Twlinlp^ 
jtjiaui ^wiiviv^ 


69 3 


Liver ca HenG2 


4 8 


opillal \^/UlU ruui 


93 9 


Kidnev Pool 


2 0 


r\UIClld.l vJlallU 


4 9 


Fetal Kidney 


15.8 


Pituitary gland Pool 


2.7 


Renal ca. 786-0 


4.5 


Salivary Gland 


2.5 


Renal ca. A498 


1.6 


Thyroid (female) 


3.1 


Renal ca. ACHN 


4.9 


Pancreatic ca. CAPAN2 


10.2 


Renal ca. UO-31 


6.5 


Pancreas Pool 


6.8 


Table AHD. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 


Tissue Name j 


Rel. Exp.(%) 
Ag4032, Run 
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171616913 




171616913 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


11.1 


Secondary Th2 act 


1.4 


HUVEC IFN gamma 


24.7 


Secondary Trl act 


0.0 


nu v cv^ i iNr aipna ^ iri\ gamma 


1.1 


Secondary Thl rest 


0.0 


HUVFP TNF a\nha -+■ If A 


i "7 n 
1 7.9 


Secondary Th2 rest 


1.1 


HUVEC IL- 11 


14.3 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


11.0 | 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha + 
IL-1 beta 


ZU. / 

i 


Primary Th2 act 


1.3 


Microvascular Dermal EC none 


20.2 


. Primary Trl act 


1.0 


j Micros vasular Dermal EC TNFalpha 
j+ IL-lbeta 


in o 
I u.z 


Primary Thl rest 


0.0 


[Bronchial epithelium TNFalpha + 
jl LI beta 




Primary Th2 rest 


0.0 


'Small airway epithelium none 


6.0 


Primary Trl rest 


0.0 


Small airway epithelium TNFalpha + 
IL-lbeta 


I U.J 


CD45RA CD4 lymphocyte 
act 


15.4 


Coronery artery SMC rest 


1 J.O j 


|CD45ROCD4 lymphocyte 
jact 


0 0 Coronery artery SMC TNFalpha + 
jlL-lbeta 


6.7 j 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


31.0 


Secondary CD8 
lymphocyte rest 


6.4 


Astrocytes TNFalpha + IL-1 beta 


99 9 
ZZ.Z 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


1 1 7 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) PMA/ionomycin 


14.5 


2ryThl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD1 106 (Keratinocytes) none 


40.9 


LAK cells rest 


0.0 


CCD! 106 (Keratinocytes) TNFalpha 
+ IL-1 beta 


22.1 


LAK cells IL-2 


0.0 


Liver cirrhosis 


8.8 


LAK eel Is IL-2+IL- 1 2 


0.0 


NCI-H292 none 


40.3 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


56.6 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-9 


51 8 


LAK cells 
PMA/ionomycin 


O.O 


NCI-H292 IL-13 


31.9 


NK Cells IL-2 rest 


5.0 


NCI-H292 IFN gamma 


100.0 


Two Way MLR 3 day 


5.0 


HPAEC none 


19.2 


Two Way MLR 5 day 


9.0 


HPAEC TNF alpha + IL-1 beta 


32.3 
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Two Way MLR 7 day 


0.0 


Lung fibroblast none 


■AS 1 
j j. i 


PRN/fP root 

r diviv^ rest 


1.3 


Lung fibroblast TNF alpha + IL-1 
beta 


13.1 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


17.8 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


34 ft 


Ramos (B cell) none 


10.8 


Lung fibroblast 1L-13 


36.6 


R/imnQ 1 R P<=*IM innnm\/r«in 
*\.ciiiiv/o yu HJliUIIiyCllI 


|I2.9 (Lung fibroblast I FN gamma 


31.2 


|B lymphocytes PWM 

i j - — 


0.0 


Dermal fibroblast CCD 1070 rest 


24.5 


|B lymphocytes CD40L and 
IL-4 


0.0 


Dermal fibroblast CCD 1070 TNF 
alpha 


15.7 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 IL-1 
beta 


12.6 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


11.0 


Dendritic cells none 


3.5 


Dermal fibroblast IL-4 


40.6 


jDendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


25.2 


(Dendritic cells anti-CD40 [0.0 


Neutrophils TNFa+LPS 


0.0 


[Monocytes rest jo.0 


Neutrophils rest 


o.o .; 


jMonocytes LPS 


0.0 jColon 


13.9 


'Macrophages rest 


0.0 t j Lung 


6.0 


jMacrophages LPS 


0.0 JThymus 


15.5 


jHUVEC none 


15.2 jKidney 


26.6 


HUVEC starved 


10.7 f~ 





lableAHE. Panel CNS 1 



Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
180912030 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
180912030 


BA4 Control 


19.1 


BA17PSP 


31.6 


BA4 Control2 


36.9 


BA17PSP2 


17.1 


BA4 AIzheimer's2 


13.0 


Sub Nigra Control 


10.1 


BA4 Parkinson's 


56.6 


Sub Nigra Control2 


24.3 


BA4 Parkinson's2 


88.9 


Sub Nigra Alzheimer's2 


3.8 


BA4 Huntington's 


69.3 


Sub Nigra Parkinson's2 


21.0 


BA4 Huntington's2 


11.9 


Sub Nigra Huntington's 


23.8 


BA4 PSP 


13.8 


Sub Nigra Huntington's2 


19.3 


BA4 PSP2 


29.7 


Sub Nigra PSP2 


6.6 


BA4 Depression 


14.0 


Sub Nigra Depression 


3.0 


BA4 Depression2 


13.1 


Sub Nigra Depression2 


2.8 


BA7 Control 


71.7 


Glob Palladus Control 


0.9 
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BA7 Control 


39.8 Glob Palladus Control 


0.0 


BA7 Alzheimer's2 


15.0 


Ctloh PflllaHiic A ]'?}*if*\mf*r 1 Q 
vjiuu jraiiauiia /\lZ4lCllllcr b 


2.5 


BA7 Parkinson's 


38.4 


VJIUU JTallaUUb /A.l£IlCJIHCr SZ 


2.9 


BA7 Parkinson's2 


64.6 




76.3 


BA7 Huntington's 


53.2 


VJIUU rdlldUUb rarKlliSOn SZ 


3.5 


BA7 Hnntington's2 


42.9 


Glob Palladia P<sP 


0.0 


BA7 PSP 


44.1 


Olnh PflllaHiiQ P^P? 


3.2 


BA7 PSP2 


37.6 


liloW Pilllnrliic lip nrpcci An 


0.0 


BA7 Depression 


8.8 


1 cinp i uic v^oniroi 


10.2 


BA9 Control 


17.7 


Temp Pole Control2 


36.1 


BA9 Control2 


100.0 


Temp Pole Alzheimer's 


8.7 


BA9 Alzheimer's 


7.1 


Temp Pole Alzheimer's2 


6.4 


BA9 Alzheimer's2 


25.9 


Temp Pole Parkinson's 


32.8 


BA9 Parkinson's 


65.1 
49.3 


Temp Pole Parkinson's2 


18.7 


BA9 Parkinson's2 


Temp Pole Huntington's 


31.0 ! 


BA9 Huntington's 


41.5 jTemp Pole PSP js.1 


BA9 Huntington*s2 


34.9 (Temp Pole PSP2 (3.8 


! B ^ PSP .. _l U A iTemp Pole Depression! 


5.9 


BA9 PSP2 


3 .6 |Cing Gyr Control 


37.4 


BA9 Depression 


18.8 


Cing Gyr Control2 


28.5 


BA9 Depression2 


13.7 


Cing Gyr Alzheimer's 


9.9 


BA 17 Control 


92.7 


Cing Gyr Alzheimer's2 


7.2 


BA17Control2 


57.8 


Cing Gyr Parkinson's 


26.2 


BA17 AIzheimer's2 


27.2 


Cing Gyr Parkinson's2 


8.5 


BA 17 Parkinson's 


62.9 


Cing Gyr Huntington's 


43.5 


BA17 Parkinson's2 


74.7 I 


Cing Gyr Huntington's2 


5.6 


BA1 7 Huntington's 


77.4 


Cing Gyr PSP 


2.8 


BA17Huntington's2 


26.1 | 


Cing Gyr PSP2 


1.4 


BA17 Depression 


18.4 ;< 


Cing Gyr Depression 


0.0 | 


BA17Depression2 


39.2 i 


Cing Gyr Depression2 


10.8 



CNS_neurodegeneration_vl.O Summary: Ag4032 This panel confirms the 
expression of the CG95824-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 
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General_screeningjpanel_vl.4 Summary: Ag4032 Highest expression of the 
CG95824-01 gene is detected in fetal brain (CT=29.8). High expression of this gene is 
seen all the regions of the central nervous system examined, including amygdala, 
hippocampus, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. 
Therefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease. Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression and therapeutic modulation of this gene product may be beneficial in the 
treatment of these CNS disorders. 

The CG95824-01 gene codes for a RHO-GTPase activating protein (RHOGAP). 
RHOGAP stimulats the GTPase activity of Rho proteins. Since Rho proteins have been 
implicated in ischemia (Trapp et al., 2001, Mol Cell Neurosci 17(5):883-94, PMID: 
1 1358485), inhibitors for the RHOGAP encoded by this gene may have a therapeutic 
utility in the treatment of pathological aspects following brain damage including neuronal 
death. 

Moderate levels of expression of this gene is also seen in cluster of cancer cell 
lines including melanoma, CNS. colon, breast, ovarian, lung, gastric, renal, pancreatic and 
prostate cancer cell lines. Therefore, therapeutic modulation of this gene product through 
the use of small molecule target may be useful in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adrenal gland, thyroid, fetal heart, and fetal skeletal 
muscle. Therefore, therapeutic modulation of the activity of this gene may prove useful in 
the treatment of endocrine/me tabolically related diseases, such as obesity and diabetes. 

Interestingly, expression of this gene is higher in fetal (CTs=32-33) as compared to 
adult lung, heart and skeletal muscle (CTs>36). Therefore, expression of this gene can be 
used to distinguish fetal from these adult tissue samples. In addition, the relative 
overexpression of this gene in fetal suggests that the protein product may enhance growth 
or development of these tissues in the fetus and thus may also act in a regenerative 
capacity in the adult. Therefore, therapeutic modulation of the RHOGAP encoded by this 
gene could be useful in treatment of muscle, lung and heart related diseases. 

Panel 4.1D Summary: Ag4032 Highest expression of the CG95824-01 gene is 
detected in IFN gamma treated NCI-H292 cells (CT=33.4). Moderate to low expression of 
this gene is seen in NCI-H292 cells, HPAEC, lung fibroblasts, IL-4 treated dermal 
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fibroblasts. Therefore, therapeutics designed with the protein encoded by this gene may 
reduce or eliminate symptoms caused by inflammation in lung epithelia, in chronic 
obstructive pulmonary disease, asthma, allergy, and emphysema. 

Panel CNS_1 Summary: Ag4032 This panel confirms the expression of the 
CG95824-01 gene at low levels in the brains of an independent group of individuals. 
Please see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of 
central nervous system disorders. 



10 



15 



AI. CG96231-01 and CG96231-02: OTU-Like Cysteine Protease 

Expression of gene CG96231-01 and full length physical clone CG96231-02 was 
assessed using the primer-probe set Ag4043, described in Table AIA. Results of the RTQ- 
PCR runs are shown in Tables AIB, AIC, AID and AIE. Please note that CG96231-02 
represents a full-length physical clone of the CG9623 1-01 gene, validating the prediction 
of the gene sequence. 

Table AIA . Probe Name Ag4043 



jPrimers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


; Forward 


5 1 -agcgtaacttccctgatcca-3 ' 


j 20 


815 


335 


•Probe 


TET-5 1 -cacctcctctgaccattttctcctct-3 


-TAMRAj 26 


839 


336 


(Reverse 


5 ' -tcatctgctaattccagtgctt-3 ' 


22 • 


887 


337 



Table AIB , CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
214151952 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
214151952 


AD 1 Hippo 


13.8 


Control (Path) 3 Temporal Ctx 


4.5 


AD 2 Hippo 


15.4 


Control (Path) 4 Temporal Ctx 


28.3 


AD 3 Hippo 


8.6 


AD 1 Occipital Ctx 


40.6 


AD 4 Hippo 


8.1 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


98.6 


AD 3 Occipital Ctx 


10.2 


AD 6 Hippo 


72.2 


AD 4 Occipital Ctx 


19.8 


Control 2 Hippo 


20.6 


AD 5 Occipital Ctx 


41.2 


Control 4 Hippo 


8.2 


AD 6 Occipital Ctx 


31.4 


Control (Path) 3 Hippo 


6.3 


Control 1 Occipital Ctx 


3.3 


AD 1 Temporal Ctx 


16.6 


Control 2 Occipital Ctx 


43.8 


AD 2 Temporal Ctx 


19.3 


Control 3 Occipital Ctx 


11.1 


AD 3 Temporal Ctx 


8.5 


Control 4 Occipital Ctx 


7.4 
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AD 4 Temporal Ctx |20.6 


Control (Path) 1 Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx 


74.2 


Control (Path) 2 Occipital Ctx 


8.7 


AD 5 SupTemporal Ctx 


20.4 


Control (Path) 3 Occipital Ctx 


2.0 


AD 6 Inf Temporal Ctx 


87.7 


Control (Path) 4 Occipital Ctx 


12.9 


AD 6 Sup Temporal Ctx 


70.2 


Control 1 Parietal Ctx 


6.2 


Control I Temporal Ctx 

i — ~ — 


4.5 


Control 2 Parietal Ctx 


44 1 


! Control 2 Temporal Ctx 


16.7 




Control 3 Parietal Ctx 


14.3 


jControl 3 Temporal Ctx 


11.5 


^uiiLiui ^raiii^ i I ariciai l.c\ 




[Control 4 Temporal Ctx {5.0 


Control (Path) 2 Parietal Ctx 


20.6 


jControl (Path) 1 Temporal L _ , 
|Ctx i 35 ' 1 


Control (Path) 3 Parietal Ctx 


1.4 


jControl (Path) 2 Temporal L n A 

jCtx | 2 ™ 


Control (Path) 4 Parietal Ctx 


40.3 



Table AIC General_screening_panel_vL4 



i 
i 

j i issue jName 


Rel. Exp.(%) 
Ag4043, Run 
218426211 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 


jAdipose 


4.4 


'Renal ca TK- 1 0 


I 7.0 


jMeianoma* Hs688(A).T 


5.9 


jBladder 


10 4 


jMeianoma* Hs688(B)T 


4.5 


[Gastric ca. (liver met.) NCI-N87 


33.9 


jMeianoma* M14 


8.8 


Gastric ca. KATO 111 


27.9 j 


Melanoma* LOXIMVI 


15.6 


Colon ca. SW-948 


6.5 


Melanoma* SK-MEL-5 


19.1 


Colon ca. SW480 


27.2 


Squamous cell carcinoma 
SCC-4 


13.7 


Colon ca.* (SW480 met) 
SW620 


11.0 


Testis Pool 


4.9 


Colon ca. HT29 


10.5 


Prostate ca.* (bone met) 
PC-3 . 


46.3 


Colon ca. HCT-1 16 


27.7 


Prostate Pool 


2.2 


Colon ca. CaCo-2 


18.0 


Placenta 


3.5 | 


Colon cancer tissue 


6.5 


Uterus Pool 


2.3 


Colon ca.SW1116 


2.5 


Ovarian ca. OVCAR-3 


17.3 ] 


Colon ca. Colo-205 


4.7 


Ovarian ca. SK-OV-3 


37.1 | 


Colon ca. SW-48 


4.0 


Ovarian ca. OVCAR-4 


3.7 


Colon Pool 


4.5 


Ovarian ca. OVCAR-5 


17.3 


Small Intestine Pool 


5.3 


Ovarian ca. IGROV-1 


9.7 


Stomach Pool 


5.5 


Ovarian ca. OVCAR-8 


5.0 


Bone Marrow Pool 


2.6 


Ovary 


4.3 


Fetal Heart 


5.3 


Breast ca. MCF-7 ii 


3.7 


Heart Pool j; 


2.3 
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Breast ca. MDA-MB-23 1 


29.9 


Lymph Node Pool 


6.7 


Breast ca. d i j**y 


1 A 1 

14./ 


retal Skeletal Muscle 


1.8 


Breast ca. T47D 


36.3 


Skeletal Muscle Pool 


4.4 


Breast ca. MDA-N 


4.1 


Spleen Pool 


7.9 


Breast Pool 


4.7 


Thymus Pool 


11.5 


Trachea 


6 7 


CNS cancer (glio/astro) U87- 
MG 


1 C A 

15.4 


Lung 


0.9 


CNS cancer (glio/astro) U-1 18- 
MG 


11.2 


Fetal Lung 


34.4 


CNS cancer (neuro;met) SK-N- 
AS 


16.3 


Lung ca. NCI-N417 


3.2 


CNS cancer (astro) SF-539 


5.8 


Lung ca. LX-1 


27.5 


CNS cancer (astro) SNB-75 


14.5 


Lung ca. NCI-HI 46 


5.0 


CNS cancer (glio) SNB-19 


8.9 


Lung ca. SHP-77 


12.2 


CNS cancer (glio) SF-295 


18.3 


Lung ca. A549 


14.3 


Brain (Amygdala) Pool 


4.5 


Lung ca. NCI-H526 


2.3 |Brain (cerebellum) 


4.2 : 


Lung ca. NCI-H23 


10.7 j Brain (fetal) ]8.l 


jLung ca. NCI-H460 


9.9 


Brain (Hippocampus) Pool s5.5 j 


jLung ca. HOP-62 


6.8 


Cerebral Cortex Pool ;7.6 


;Lung ca. NCI-H522 


7-3 


Brain (Substantia nigra) Pool 


8.0 


Liver 


0.5 


Brain (Thalamus) Pool 


8.8 


Fetal Liver 


100.0 


Brain (whole) 


9.3 


Liver ca. HepG2 


5.0 


Spinal Cord Pool 


8.3 


Kidney Pool 


7.6 


Adrenal Gland 


4.2 


Fetal Kidney 


7.9 


Pituitary gland Pool 


2.3 


Renal ca. 786-0 


15.9 


Salivary Gland 


1.6 


Renal ca. A498 


2.3 


Thyroid (female) 


3.5 


Renal ca. ACHN 


20.7 


Pancreatic ca. CAPAN2 


16.8 


Renal ca. UO-31 


8.3 


Pancreas Pool 


6.0 


Table AID. Panel 4.1D 


|Rel. Exp.(%) 

Tissue Name Ag4043, Run 

|171616960 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
171616960 


Secondary Thl act jl9.2 


HUVEClL-lbeta 


9.2 


Secondary Th2 act j 


25.5 


HUVEC I FN gamma 


8.9 


Secondary Trl act j 


14.3 


HUVEC TNF alpha + IFN 
gamma 


9.3 


Secondary Thl rest ]5.2 


HUVEC TNF alpha. + IL4 


5.3 
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Secondary Th2 rest 


4.9 


HUVECIL-11 


3.4 


Secondary Trl rest 


8.5 


Lung Microvascular EC none 


7.2 


Prtmarv Th 1 art 


10 7 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


j.j 


Primary Th2 act 


12.8 


Microvascular Dermal EC none 


5.0 


Prim 51 rv Trl cirt 
Jill Hal y ill u^i 


1 £ & 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


*> C 
J.O 


Primarv Th 1 rp^t 

i i iiiiaij' iiii i toi 


J.z. 


Bronchial epithelium TNFalpha 
+ ILlbeta 


0.4 


Primary Th2 rest 


27~~~ 


Small airway epithelium none 


3.2 


Primp rv Trl r**Qt 
j iiiiiCLiy ill i to i 




Small airway epithelium 
TNFalpha + IL-1 beta 


y. / 


CD45RA CD4 lymphocyte 
act 


1 1 1 

i j . i 


i^oronery artery oiviv^ resr 


J.U 


CD45ROCD4 lymphocyte 
act 




Coronery artery SMC TNFalpha 
+ IL-1 beta 


4.9 


jCD8 lymphocyte act 


12.8 


Astrocytes rest 


3.2 


'Secondary CD8 lymphocyte 
rest 


11.4 


Astrocytes TNFalpha + IL-1 beta 


3.3 


jSecondary CDS lymphocyte 
|act 


8.0 


! 

KU-812 (Basophil) rest !5L4 

! 


YTj4 ivmnliof^vt^ nnnp 
V^JJ'-t jyilipilULj IC IlUflL 


60 fc; 2(Basophil) (mo 

jPMA/ionomycin j 


2ry Thl/Th2/Trl anti- 
CD95CH11 


11.6 


CCD1 1 06 (Keratinocytes) none 


13.6 


LAK cells rest 


10.4 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


6.7 


LAix ceils 


14.9 


Liver cirrhosis 


5.6 


L*AlV CeilS 1L-Z+1L-1Z 


11.8 


NCI-H292 none 


13.9 


ceiis iJL-z^iriN 
gamma 


5.1 


NC1-H292 IL-4 


14.5 


LAK cells IL-2+ IL-1 8 


6.7 


NCI-H292 IL-9 


15.6 


LAK cells PMA/ionomycin 


10.7 


NCI-H292 IL-1 3 


14.0 


iNJv i^eiis iLi-z rest 


12.2 


NCI-H292 IFN gamma 


13.3 


i wo way mlk j day 


10.1 


HPAEC none 


3.1 


Two Wav MLR 5 dav 


8.6 


HPAEC TNF alpha + IL-1 beta 


7.4 


Two Way MLR 7 day 


8.2 


Lung fibroblast none 


3.7 


PBMC rest 


6.7 


Lung fibroblast TNF alpha + IL- 
1 beta 


2.8 


PBMC PWM 


10.0 


Lung fibroblast IL-4 


5.1 


PBMC PHA-L 


5.8 


Lung fibroblast IL-9 


6.7 
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Ramos (B cell) none 


18.3 


Lung fibroblast IL-13 


4.2 


xvdnios \t> ceuj lonomycin 


1 *7 O 

17.8 


Lung fibroblast IFN gamma 


6.3 


B lymphocytes PWM 


10.7 


Dermal fibroblast CCD 1070 rest 


7.0 


B lymphocytes CD40L and 

II -4 


10.4 


Dermal fibroblast CCD 1070 
TNF alpha 


12.9 


EOL-1 dbcAMP 


5.7 


Dermal fibroblast CCD 1070 IL- 
1 beta 


8.4 


EOL-I dbcAMP 
PMA/ionomycin 


4.8 


Dermal fibroblast IFN gamma 


3.8 

■ 


Dendritic cells none 


8.0 


Dermal fibroblast IL-4 |7.6 


Dendritic cells LPS 


6.6 


Dermal Fibroblasts rest 


5.2 


Dendritic cells anti-CD40 


11.1 


Neutrophils TNFa+LPS 


4.1 


Monocytes rest 


17.1 


Neutrophils rest 


12.2 


Monocytes LPS 


9.1 


Colon 


1.6 


Macrophages rest 


6.9 


Lung 


3.5 


Macrophages LPS 


4.2 j 


Thymus 


51.1 


HUVEC none 


3.9 i 


Kidney 


14.8 


jHUVEC starved 


5.5 ! 

i 







Table AIE . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
268362932 


(Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
268362932 


Colon cancer 1 


22.2 


Bladder cancer NAT 2 


0!4 


Colon NAT 1 


5.1 


Bladder cancer NAT 3 


0.6 


Colon cancer 2 


29.7 


Bladder cancer NAT 4 


11.6 


Colon cancer NAT 2 


7.9 


Adenocarcinoma of the prostate 
1 


20.0 


Colon cancer 3 


67.8 


Adenocarcinoma of the prostate 
2 


2.8 


Co Ion cancer NAT 3 


17.6 


Adenocarcinoma of the prostate 
3 


9.2 


Colon malignant cancer 4 


91.4 


Adenocarcinoma of the prostate 
4 


95.9 


Colon normal adjacent 
tissue 4 


5.3 


Prostate cancer NAT 5 


3.3 


Lung cancer 1 


9.7 


Adenocarcinoma of the prostate 
6 


4.2 


Lung NAT 1 


3.8 


Adenocarcinoma of the prostate 
7 


3.8 


Lung cancer 2 


100.0 


Adenocarcinoma of the prostate 
8 


0.6 
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Lung NAT 2 


23.5 


A Hp»n n r* nrr* in oiriiJ rvf flip* *imcf"£it*» 

9 


19.1 


Squamous cell carcinoma 3 


42.3 


Prostate cancer NAT 10 


1.8 


Lung NAT 3 


1.1 


Kidney cancer 1 


17.1 


metastatic melanoma 1 


12.9 


KidneyNAT 1 


8.8 


Melanoma 2 


11.7 


Kidney cancer 2 


74.2 


Melanoma 3 


18.2 


Kidney NAT 2 i20.0 


metastatic melanoma 4 


47.6 


Kidney cancer 3 43.2 


metastatic melanoma 5 


42.9 


Kidney NAT 3 |8.4 


Bladder cancer 1 


2.6 


Kidney cancer 4 1 1-4.6 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


10.9 


Bladder cancer 2 


12.2 


1 



CNS_neurodegencration_vl.O Summary: Ag4043 This panel does not show 
differential expression of the CG9623I-01 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1 A for 
discussion of utility of this gene in the central nervous system. 

Gencral_screcning_panel_vl.4 Summary: Ag4043 Expression of the CG96231- 
01 gene is highest in fetal liver (CT=25.6). This gene is also expressed at moderate levels 
in other metabolic tissues, including pancreas, thyroid, adrenal, pituitary, adipose, and 
adult and fetal heart and skeletal muscle. This widespread expression among these tissues 
suggests that this gene product may play a role in normal neuroendocrine and metabolic 
function and that disregulated expression of this gene may contribute to neuroendocrine 
disorders or metabolic diseases, such as obesity and diabetes. 

In addition, this gene is expressed at much higher levels in fetal liver when 
compared to expression in the adult counterpart (CT=33.1). Thus, expression of 'this gene 
may be used to differentiate between the fetal and adult source of this tissue. In addition, 
the relative overexpression of this gene in fetal liver suggests that the protein product may 
act in a regenerative capacity in the adult. Therefore, therapeutic modulation of the protein 
encoded by this gene could be useful in treatment of liver disease. 

This gene is also expressed at moderate levels in the CNS, including the 
hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
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in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Expression is also slightly higher in cell lines derived from gastric, breast, ovarian, 
and prostate cancers. Overall, this gene is ubiquitously expressed in this panel suggesting a 
role for this gene product in cell growth and/or proliferation. 

Panel 4.1D Summary: Ag4043 Expression of the CG96231-01 gene is highest in 
the KU-812 basophil cell line treated with EMA/ionomycin (CT=27.2), with ubiquitous 
expression in this panel. Basophils release histamines and other biological modifiers in 
reponse to allergens and play an important role in the pathology of asthma and 
hypersensitivity reactions. Therefore, therapeutics designed against the putative protein 
encoded by this gene may reduce or inhibit inflammation by blocking basophil function in 
these diseases. In addition, these cells are a reasonable model for the inflammatory cells 
that take part in various inflammatory lung and bowel diseases, such as asthma. Crohn's 
disease, and ulcerative colitis. Therefore, therapeutics that modulate the function of this 
gene product may reduce or eliminate the symptoms of patients suffering from asthma, 
Crohn's disease, and ulcerative colitis. 

General oncology screening panel_v_2.4 Summary: Ag4043 Highest 
expression of the CG96231-01 gene on this panel is seen in lung cancer (CT=28.9). In 
addition, this gene is overexpressed in lung and colon cancer when compared to 
expression in the normal adjacent tissue. Thus, expression of this gene could be used as a 
marker of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of lung and colon cancer. 

AJ. CG96260-01: Phospholipase A2 

Expression of gene CG96260-01 was assessed using the primer-probe set Ag4048, 
described in Table AJA. Results of the RTQ-PCR runs are shown in Tables AJB, AJC and 
AJD. 



Table AJA . Probe Name Ag4048 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 1 -atgaaggtcattgccatcct-3 1 


20 


85 


338 


Probe 


TET-5 1 -cacccacagcagtttctggcagtt-3 • -TAMRA 


24 


129 


339 
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(Reverse js ' -atgtgtttgaccctcctctga- 3' * j 21 j 153 


| 340 


Table AJB. CNS neurnrle^eneratmn vl 0 


Tissue Name 


Rel. Exp.(%) 
Ag4048, Run 

1 HZ7ZZOZ 


Tissue Name 


Rel. Exp.(%) 
Ag4048, Run 
214292262 


AD 1 Hippo 


0 0 


control (rath) 3 I emporal Ctx 


30.6 


|AD 2 Hippo 


Ju. 1 


control (Path) 4 i emporal Ctx 


61.1 


AD 3 Hippo 


0 0 

V/.V/ 


AD i occipital Ctx 


137.1 


AD 4 Hippo 


0 0 

V/.V/ 


AU 2 Occipital Ctx (Missing) 


jO.O 


AD 5 Hinno 


u.u 


AD j Occipital Ctx 


0.0 


AD 6 Hippo 


47 6 


AD 4 Occipital Ctx 


0.0 


Control 2 Hinno 




AD 5 Occipital Ctx 


L . 

jO.O 


Control 4 Hippo 


1.A A 


AD 6 Occipital Ctx 


0.0 


Control (Path) 3 Hinno 


.JU.U 


Control I Occipital Ctx 


40.3 


tAD I TeniDoral Ctx 




Control 2 Occipital Ctx 


24.8 


AD 2 Temnoral Ctx 


J7.J 


Control 3 Occipital Ctx 


0.0 


!AD 3 Temnoral Ptv 


u.u 


jControl 4 Occipital Ctx 


>0.0 


!AD 4 TemDoral Ctx 


0 0 
u.u 


[Control (Path) I Occipital Ctx 


1 

[58.2 


lAD 5 I nf Temnoral Ctx 

I * * —f III! 1 Will r l«\ 


u.u 


Control (Path) 2 Occipital Ctx 


!o.o 


AD 5 Sup Temporal Ctx 


0.0 


Control (Path) 3 Occipital Ctx jO.O 


AD 6 Inf Temporal Ctx 


0.0 


Control (Path) 4 Occipital Ctx jO.O 


AD 6 Sup Temporal Ctx 


59.9 


Control 1 Parietal Ctx 


84.7 


Control 1 Temporal Ctx 


0.0 


Control 2 Parietal Ctx 


28.7 


Control 2 Temporal Ctx 


100.0 


Control 3 Parietal Ctx 


0.0 


Control 3 Temporal Ctx 


0.0 


Control (Path) 1 Parietal Ctx 


18.7 


Control 3 Temporal Ctx 


27.0 


Control (Path) 2 Parietal Ctx 


60.3 


Control (Path) 1 Temporal 
Ctx 


0.0 


Control (Path) 3 Parietal Ctx 


0.0 


Control (Path) 2 Temporal 
Ctx 


45.4 


Control (Path) 4 Parietal Ctx 


0.0 


Table AJC. General screeningjianel vl d 




] 

Tissue Name 4 


Rel. Exp.(%) 
<Vg4048, Run ' 
218535022 


Tissue Name 


Rel. Exp.(%) 
Ag4048, Run 
218535022 


Adipose < 


U ] 


^enalca.TK-10 j( 


).0 


Melanoma* Hs688(A).T ( 


).0 1 


31adder ( 


).0 


Melanoma* Hs688(B).T ( 


).0 ( 


jastric ca. (liver met) NCI-N87 ( 


).0 


Melanoma* M 14 ( 


1.0 < 


jastricca. KATO III ( 


).0 


Melanoma* LOXIM VI ( 


>-0 jColon ca. SW-948 ( 


).0 
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Melanoma* SK-MEL-5 


0.0 


Colon ca. SW480 


23.0 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


0.0 


Te^fi^ Pnnl 




Pnlrtn na HT9Q 

v^oion ca. n i 


n n 


Prostate ca * fbone met^ 

* J VJIULV Vila \ UvllV ItlX/L/ 

PC-3 


0.0 


Colon ca. HCT-1I6 


0.0 


Prostate Pool 


52.9 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. OVCAR-3 


0.0 


Colon ca. Coio-205 


25.5 


Ovarian ca. SK-OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool 


51.4 


Ovarian ca. OVCAR-5 


0.0 


Small Intestine Pool 


92.0 


Ovarian ca. 1GROV-1 


0.0 


Stomach Pool 


19.2 


Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool 


12.2 


Ovary 


9.9 


Fetal Heart 


0.0 ! 


IBreast ca. MCF-7 


0.0 


Heart Pool 


0.0 ; 


jBreast ca. MDA-MB-23 1 jO.O 


Lymph Node Pool 


35.6 ! 

i 


jBreast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


[Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


o.o" 1 


jBreast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


55.9 


Thymus Pool 


32.3 


Trachea 


22.7 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18-MG j 


0.0 


Fetal Lung 


27.9 


CNS cancer (heuro;met) SK-N-AS 


0.0 


Lung ca.NCi-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


4.0 


CNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lungca.NCI-H526 


0.0 


Brain (cerebellum) 


15.4 


Lung ca.NCI-H23 


0.0 


Brain (fetal) 


0.0 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) Pool 


22.5 


Lung ca. HOP-62 


7.2 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


48.3 



606 



WO 03/010327 



PCT7US02/14199 



Kidney Pool 


62.9 . 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


100.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


11.3 


(Renal ca. UO-31 


0.0 


Pancreas Pool 


49.3 


Table AJD. General oncolosv screening nanel v 2:4 


j 

! 

I 

jTissue Name 

j 


Rel E\d(%) 
Ag4048, Run 
268362939 


Tissue Name 


Rel Fvn (%\ 
Ag4048, Run 
268362939 


jCoIon cancer 1 


0.0 


Bladder cancer NAT 2 


3.9 


jColon cancer NAT 1 


16.2 


Bladder cancer NAT 3 


0.0 


JColon cancer 2 


41.8 


Bladder cancer NAT 4 


0,0 


jColon cancer NAT 2 


39.8 


Adenocarcinoma of the prostate 1 


76.3 


jColon cancer 3 


4.6 


Adenocarcinoma of the prostate 2 


20.7 


jColon cancer NAT 3 


20.6 


Adenocarcinoma of the prostate 3 


0.0 


!CoIon malignant cancer 4 


0.0 


Adenocarcinoma of the prostate 4 


0.0 


Colon normal adjacent 
tissue 4 


58 " 


r rosiaie cancer in a i j 


c c 
J.J 


Lung cancer 1 


0.0 


Adenocarcinoma of the prostate 6 


5.4 


Lung NAT 1 


0.0 


Adenocarcinoma of the prostate 7 


19.9 


Lung cancer 2 


0.0 


Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 


0.0 


Adenocarcinoma of the prostate 9 


100.0 


Squamous cell carcinoma 
3 


0.0 


Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 


0.0 


Kidney cancer 1 


1.9 


metastatic melanoma 1 


0.0 


KidneyNAT 1 


0.0 


Melanoma 2 


0.0 


Kidney cancer 2 


21.6 


Melanoma 3 


7.7 


Kidney NAT 2 


0.0 


metastatic melanoma 4 


6.0 


Kidney cancer 3 


0.0 


metastatic melanoma 5 


5.6 


Kidney NAT 3 


0.0 


Bladder cancer 1 


4.4 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 j 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag4048 This panel confirms the 



expression of the CG96260-01 gene at low levels in the temporal cortex of brain 
(CT=34.3). However, no differential expression of this gene was detected between 
5 Alzheimer's. diseased postmortem brains and those of non-demented controls in this 
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experiment. The the CG96260-01 gene codes for a homolog of rat phospholipase A2 
(PLA(2)). Mouse deficient in cytosolic PLA(2), a member of PLA(2) family, has been 
shown to suffer smaller infarcts and fewer neurological deficits after transient occlusion of 
the middle cerebral artery and have less injury after administration of a dopaminergic 
5 selective neurotoxin (Sapirstein A, and Bonventre JV, 2000, Biochim Biophys Acta 2000 
Oct 31 ; 1488(1 -2): 139-48, PMID: 1 1080683). Thus, PLA(2) encoded by this gene could 
also play a role in inflammation and injuries of brain and pharmacological targeting of this 
enzyme may have important therapeutic benefits. 

General_screening_panel_vl.4 Summary: Ag4048 Highest expression of the 
CG96260-01 gene is detected in renal cancer A498 cell line (CT=33). Therefore, 
therapeutic modulation of this gene product may be beneficial in the treatment of renal 
cancer. 

In addition, moderate to low expression of this gene is seen in samples derived 
from normal tissues including testis, prostate, breast, kidney, lymphnode. thymus and 
spinal cord. Therefore, therapeutic modulation of this gene product may be useful in 
treatment of diseases associated with these tissues. 

Among tissues with metabolic or endocrine function, this gene is expressed at 
moderate to low levels in pancreas, and the gastrointestinal tract. Therefore, therapeutic 
modulation of the activity of this gene may prove useful in the treatment of 
endocrine/metabolically related diseases, such as obesity and diabetes. 

The the CG96260-01 gene codes for a homolog of rat phospholipase A2 (PLA(2)). 
PLA(2) has been implicated in the pathogenesis and pathophysiology of acute pancreatitis 
(Nevalainen et aL 1999, Hepatogastroenterology 46(29):2731-5, PMID: 10576338). In 
addition, in blood plasma, PLA(2) modifies the circulating lipoproteins and so induce 
formation of small dense LDL particles, which are associated with increased risk for 
cardiovascular disease (Hurt-Camejo et al, 2001, Circ Res 89(4):298-304, PMID: . 
1 1509445). Furthermore, the cytoplasmic PLA(2) knockout mouse has revealed important 
roles of cPLA(2) in normal fertility, generation of eicosanoids from inflammatory cells, 
brain injuries and allergic responses (Sapirstein A, and Bonventre JV, 2000, Biochim 
Biophys Acta 2000 Oct 31;1488(l-2):139-48, PMID: 1 1080683). Therefore, therapeutic 
modulation of PLA(2) encoded by this gene may be useful in the treatment of pancreatitis, 
neurological disorders, allergies and cardiovascular diseases. 
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Panel 4.1D Summary: Ag4048 Expression of the CG96260-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel CNS_1 Summary: Ag4048 Expression of the CG96260-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
5 General oncology screening panel_v_2.4 Summary: Ag4048 Highest 

expression of the CG96260-01 gene is detected in prostate adenocarcinoma samples 
(CTs=33). In addition, low expression of this gene is also seen in a single colon cancer and 
the adjacent normal tissue sample. The CG96260-01 gene codes for a homolog of rat 
phospholipase A2 (PLA(2)). Phospholipase A2 has recently been recognized to be 
10 involved in a wide number of pathophysiological situations, ranging from systemic and 
acute inflammatory conditions to cancer (Balsinde et al., 1999, Annu Rev Pharmacol 
Toxicol 39:175-89, PMID: 10331081). Therefore, therapeutic modulation of this gene 
product through the use of small molecule drug may be useful in the treatment of colon 
and prostate cancers, [mpattu, 29-Mar-02] 

15 

AK. CG96364-01: ADP/ATP Translocase 2 Like protein 

Expression of gene CG96364-01 was assessed using the primer-probe set Ag4073. 
described in Table AKA. Results of the RTQ-PCR runs are shown in Tables AKB, AKC, 
AKD, AKE and AKF. 
20 Table AKA . Probe Name Ag4073 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -tcctcagatacttccccacc-3 ' 


20 


263 


341 


Probe 


TET-5 • -aacacaagcagatcttcctgggtggt-3 ' -TAMRA 


26 : 


311 


342 


Reverse 


5 1 -aactgggtcctcttgtccac-3 ' 


20 


337 


343 



Table AKB , AI_comprehensive panel_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
248081917 


Tissue Name 


Rel. 

Exp.(%) 
Ag4073, 
Run 

248081917 


110967COPD-F 


11.3 


1 12427 Match Control Psoriasis-F 


40.6 


110980COPD-F 


8.1 


1 12418 Psoriasis-M 


17.6 


110968COPD-M 


12.1 


1 12723 Match Control Psoriasis-M 


8.8 


1 10977 COPD-M 


20.9 


112419 Psoriasis-M 


24.1 
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110989Emphysema-F 


39.0 


1 1 2494 Matrh Pontrnl PcrwiocSe \A 


1 A 1 

14./ 


110992Emphysema-F . 


16.4 


1 12420 P<;oria<;K-M 


1A O 


1 10993 Emphysema-F 


12.3 


1 12425 Matrh Pnntrnl Pcnriacie \A 


07 A 


l]0994Emphysema-F 


5.9 


1 04689 (MF^ OA Rnnp-Rprknc 


J 1 .u 


110995 Emphysema-F 


39.5 


104690 (MF) Adj "Normal" Bone- 
Backus 


8.4 


110996 Emphysema-F 


8.5 


104691 (MF'lOA ^vnAvium RapL.- 


Zo.z 


110997 Asthma-M 


7.6 


104692 (BA) OA CArtilaof*-Rarl/,ic 


I o.V 


111001 Asthma-F 


18.7 


104694 (BA) OA Bone-Backus 


38.7 


111002 Asthma-F 


24.7 


104695 (BA) Adj "Normal" Bone- 


17.6 


111003 Atopic Asthma-F 


29.1 


104696 (BA'l OA Svnnvinm Ra^L'nc 


OZ.J 


111004 Atopic Asthma-F 


40.6 


1 04700 CSS^ OA Rnnp Rarlnc 
iU "'vu \JO j \Jr\ DOIIC-DaCKllS 


16.2 


1 1 1 005 Atopic Asthma-F 


20.6 


104701 fSS^ Adi "Nnrmnl" Rnn*» 

Backus 


19.3 


111006 Atopic Asthma-F 


6.2 


104702 (SS) OA Synovium-Backus 


34.2 


111417 Allergy-M 


13.0 


117093 OA Cartilage Rep7 


1 1.6 


112347 Allergy-M 


0.2 


112672 OA Bone5 


27.4 


112349 Normal Lung-F 


0.2 


1 12673 OA Synoviums 


12.j 


1 12357 Normal Lung-F 


26-6 jl 12674 OA Synovial Fluid cellsS 


12.6 


1 12354 Normal Lung-M 


12.4 


117100 OA Cartilage Rep 14 


8.1 


1 1 9 11A Prnline F 


19.3 


112756 OA Bone9 


100.0 


i ujoy iviatcn L/Ontroi 
Crohns-F 


21.6 


112757 OA Synovium9 


3.8 


j i z j / d i»,ronns-r 


15.5 


1 12758 OA Synovial Fluid Cells9 


11.8 


1 197^9 \Aatr>U r*r\n+-*.i 

i iz/jz iviaicii control 
Crohns-F 


42.6 


117125 RA Cartilage Rep2 


15.3 


1 1979^ Crr\Unc \A 

i IZ./ZD ^ronns-ivi 


7.5 


113492 Bone2 RA 


13.2 


1 19^87 Matnh Prinfr^t 

i Ijldo/ iviaicn V-^OlltrOI 

Crohns-M 


13.3 


113493 Synovium2RA 


6.0 


1 19*378 frnhnc \A i 

i izj)/o i^ronns-ivi 


3.1 


113494 Syn Fluid Cells RA 


12.2 


1 19^Q0 Mfltrh Cnntml 

* 1Z-J7V ividicn v^ontroi 
Crohns-M 


23.3 


1 13499 CartiIage4RA : 


23.5 


i iz/zo i^ronns-ivi ; 4 


>4.0 


1 13500 Bone4RA j: 


22.4 


112731 Match Control 
Crohns-M 


11.8 


1 13501 Synovium4RA 


17.2 


112380 Ulcer Col-F 


4.4 


1 13502 Syn Fluid Cells4 RA 


0.4 


112734 Match Control 
Ulcer Col-F " 


M.7 ] 


13495 Cartilage3RA ] 


3.4 


112384 Ulcer Col-F 2 


.3.7 1 


13496 Bone3RA 1 


3.3 
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1 12737 Match Control 
Ulcer Col-F 


9.9 


1 13497 Synovium3 RA 


7.1 


112386 Ulcer Col-F 


i .0 


1 13498 Syn Fluid Cells3 RA 


16.0 


112738 Match Control 
Ulcer Col-F 


38.4 


1 1 71 06 Normal Cartilage Rep20 


5.3 


112381 Ulcer Col-M 




1 1 Jooj BoneJ Normal 


0.6 


1 12735 Match Control 
Ulcer Col-M 


9.3 


1 13664 Synovium3 Normal 


0.0 


112382 Ulcer Col-M 


~\26 4 

[ 


1 1 jooi byn Fluid Cells3 Normal 


0.3 


1 12394 Match Control 
Ulcer Col-M 


3.6 . 

i 


1 1 7107 Normal Cartilage Rep22 


3.0 


112383 Ulcer Col-M 


19.5 


1 13667 Bone4 Normal 


6.1 


112736 Match Control 
Ulcer Col-M 


14.2 


1 13668 Synovium4 Normal 


10.2 


112423 Psoriasis-F 


19.3 


1 13669 Syn Fluid Cells4 Normal 


i 


13.6 


Table AKC CNS neurnH^npr^mn () 


Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
214294826 


Tissue Name 


Rel. Exp.(%) 
Ag4073. Run 
214294826 j 


AD 1 Hippo 


8.4 


Control (Path) 3 Temporal Ctx 


3.8 ! 


AD 2 Hippo 


18.3 


Control (Path) 4 Temporal Ctx 


^ 

14.2 i 


AD 3 Hippo 


2.1 


AD 1 Occipital Ctx 


8.9 




AD 4 Hippo 


3.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital Ctx 


3.9 


AD 6 Hippo 


28.1 


AD 4 Occipital Ctx 


12.4 


Control 2 Hippo 


12.9 


AD 5 Occipital Ctx 


25.2 


Control 4 Hippo 6.5 


AD 6 Occipital Ctx 


77.4 


Control (Path) 3 Hippo ; 


5.2 


Control 1 Occipital Ctx 


3.3 


AD 1 Temporal Ctx : 


5.5 


Control 2 Occipital Ctx 


58.2 


AD 2 Temporal Ctx { } 


17.4 


Control 3 Occipital Ctx 


12.8 


AD 3 Temporal Ctx ; 


1.3 


Control 4 Occipital Ctx 


3.9 


AD 4 Temporal Ctx ] 


1.1 | 


Control (Path) 1 Occipital Ctx 


49.3 


AD 5 Inf Temporal Ctx 1 


'3 7 jControl (Path) 2 Occipital Ctx 


6.5 


AD 5 SupTemporal Ctx J37.9 (control (Path) 3 Occipital Ctx 


1.6 


AD 6 Inf Temporal Ctx 2 


7 -0 Control (Path) 4 Occipital Ctx 


8.6 


AD 6 Sup Temporal Ctx 2 


8.9 jControl 1 Parietal Ctx 


3.4 


Control 1 Temporal Ctx 4 


-1 jControl 2 Parietal Ctx 


26.8 


Control 2 Temporal Ctx 2 


9.5 jControl 3 Parietal Ctx 


14.5 


Control 3 Temporal Ctx 8 


• 4 jControl (Path) 1 Parietal Ctx ( 


56.0 
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Control 4 Temporal Ctx 


6.3 


Control (Path) 2 Parietal Ctx 


15.7 


Control (Path) 1 Temporal 
Ctx 


26.6 


Control (Path) 3 Parietal Ctx 


3.7 


Control (Path) 2 Temporal 
Ctx 


26.4 


Control (Path) 4 Parietal Ctx 


26.8 


Table AKD. General «rrPPmj n <T panel vl 1 




Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
218906344 


Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 


jAdipose 


1.4 


Renal ca. TK-10 


21.0 


(Melanoma* Hs688(A).T 


5.9 


Bladder 


11.8 


{Melanoma* Hs688(B).T 


6.7 


Gastric ca. (liver met.) NC1-N87 


22.5 


(Melanoma* Ml4 


72 7 {Gastric ca. KATO III 


100.0 


(Melanoma* LOXIMVI 


34.6 


Colon ca. SW-948 


21.6 


(Melanoma* SK-MEL-5 


67.8 


Colon ca. SW480 


36.9 


Squamous cell carcinoma j 

(SCC-4 P 2 - 7 

I — — - ! 


Colon ca.* (SW480 met) 
SW620 


66.0 


jTestis Pool h 0 

i ■ ; 


Colon ca. HT29 


34.6 


jProstate ca.* (bone met) 
jPC-3 


i 

140.9 


Colon ca. HCT-I 16 


68.3 


(Prostate Pool 


1.2 


Colon ca. CaCo-2 [24.3 


(Placenta 


3.1 


Colon cancer tissue 


35.1 


Uterus Pool 


0.4 


Colon ca.SWl 116 7.5 


Ovarian ca. OVCAR-3 


36.6 


Colon ca. Colo-205 


23.8 


Ovarian ca. SK-OV-3 


32.8 


Colon ca. SW-48 


25.9 


Ovarian ca. OVCAR-4 


37.9 


Colon Pool 


3.2 


Ovarian ca. OVCAR-5 


241 (Small Intestine Pool 


1.7 


Ovarian ca. IGROV-1 


J 8.9 (stomach Pool 


2.7 


Ovarian ca. OVCAR-8 


1 4 2 (Bone Marrow Pool 


0.8 


Ovary \* 


2 -5 jFctal Heart I 


4.2 


Breast ca. MCF-7 


^9.2 jHeart Pool 


1.5 


Breast ca. MDA-MB-23 1 : 


*9- 2 jLymph Node Pool 


5.4 


Breast ca. BT 549 f 


> 2 :' jFetal Skeletal Muscle ( 


).9 


Breast ca. T47D ] 


J9 - 8 jSkeletal Muscle Pool ] 


.2 


Breast ca. MDA-N 5 


2.9 S 


>pleen Pool |; 


!.4 


Breast Pool 2 


..7 i 


Tiymus Pool 3 


.6 


Trachea 4 


.1 < 
N 


:NS cancer (glio/astro) U87- 
4G 3 


2.1 


Lung 0 


.8 C 


!NS cancer (glio/astro) IM 1 8- ' 
1G 2 


7.2 
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Fetal Lung 


4.9 


LNo cancer (neuro,met) oiv-in- 
AS 


25.7 


Lung ca. NCI-N417 


17.4 


CNS bancer (astro) SF-539 


31.4 


Lung ca. LX-1 


48.3 


CNS cancer (astro) SNB-75 


50.3 


Lung ca. NCI-H146 


11.6 


CNS cancer (glio) SNB-19 


20.7 


Lung ca. SHP-77 


30.6 


CNS cancer (glio) SF-295 


16.5 , 


Lung ca. A549 


52.1 


Brain (Amygdala) Pool 


3.8 


Lungca.NCI-H526 jl2.4 


Brain (cerebellum) 


6.7 


Lung ca. NCI-H23 


31.6 


Brain (fetal) 


3.2 


Lungca. NCI-H460 


15.3 


Brain (Hippocampus) Pool 


4.2 


Lung ca. HOP-62 


11.8 


Cerebral Cortex Pool 


5.1 


Lungca.NCI-H522 


17.7 


Brain (Substantia nigra) Pool 


6.6 


Liver 


2.0 


Brain (Thalamus) Pool 


6.3 


Fetal Liver 


8.0 


Brain (whole) 


4.1 


Liver ca. HepG2 


21.8 


Spinal Cord Pool 


2.6 


Kidney Pool 


3.0 


Adrenal Gland 


4.6 


Fetal Kidney 


3.3 jPituitary gland Pool 


0.8 


Renal ca. 786-0 


15.2 jSalivary Gland 


4.4 


! |Renal ca. A498 15.7 iThyroid (female) 


2.1 


Renal ca. ACHN j 1 8.9 iPancreatic ca. CAPAN2 


30.4 


Renal ca. UO-3 1 


12.9 jPancreas Pool 


4.5 


Table AKE. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
171808865 


Tissue Name 


Rel.Exp.(%) 
Ag4073, Run 
171808865 


Secondary Thl act 


88.3 


HUVECIL-lbeta 


33.7 


Secondary Th2 act 


82.9 


HUVEC I FN gamma 


44.1 


Secondary Trl act 


59.5 


HUVECTNF alpha + 1FN 
gamma 


24.3 


Secondary Thl rest 


12.8 


HUVECTNF alpha + IL4 


32.1 


Secondary Th2 rest 


16.6 


HUVEC 1L- 11 


17.1 


Secondary Trl rest 


26.2 


Lung Microvascular EC none 


50.7 


Primary Thl act 


86.5 


Lung Microvascular EC 
TNFaIpha+ IL-lbeta 


31.6 


Primary Th2 act 


95.3 


Microvascular Dermal EC none 


29.3 


Primary Trl act 


100.0 


Microsvasular Dermal EC 
TNFalpha+ IL-lbeta 


28.1 


Primary Thl rest 


25.7 


Bronchial epithelium TNFalpha 
+ ILlbeta 


44.4 
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Primary Th2 rest 


13.9 


Small airway epithelium none 


20.3 


Primary Trl rp^t 

A J ii ncii y ill 1 vol 




Small airway epithelium 
TNFalpha + IL-1 beta 


C 1 A 

51.4 


CD45RA CD4 lymphocyte 
act 


77 A 


Coronery artery bMC rest 


17.7 


CD45RO CD4 lymphocyte 
act 


QA (k 


Coronery artery SMC TNFalpha 
+ IL-1 beta 


21.3 


CD8 lymphocyte act 


90.1 


Astrocytes rest 


14.5 


Secondary CDS 
lymphocyte rest 


80.1 


! 

Astrocytes TNFalpha + IL-1 beta 


12.2 


Secondary CDS 
lymphocyte act 


45.4 


KU-8 12 (Basophil) rest 


24.5 


i^rnpnocyie none 


7 ") 

/.Z 


KU-81 2 (Basophil) 
PMA/ionomycin 


59.5 


2ry Thl/Th2/Trl anti- 
CD95CH11 


42.3 


CCD1 3 06 (Keratinocytes) none 


64.6 


LAK cells rest 


57.0 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


46.0 


LAK cells IL-2 


42.9 


Liver cirrhosis 


6.8 


LAK cells IL-2+IL-12 


31.0 


NCI-H292 none 


61.6 


LAK cells IL-2+IFN 
gamma 


23.2 


NC1-H292 IL-4 


79.0 


LAK cells IL-2+IL-18 


28.7 


NC1-H292 IL-9 


95 3 


LAK cells PMA/ionomycin 


55.9 


NCI-H292 IL-1 3 


79.6 


NK Cells IL-2 rest 


59.0 


NCI-H292 I FN gamma 


65.5 


Two Way MLR 3 day 


27.5 


HPAEC none 


24.7 


Two Way MLR 5 day 


57.4 


HPAEC TNF alpha + IL-1 beta 


44.4 


1 wo Way MLR 7 day 


38.2 


Lung fibroblast none 


16.7 


PBMC rest 


14.2 


Lung fibroblast TNF alpha + IL- 
1 beta 


14.2 


PBMC PWM 


55.9 


Lung fibroblast IL-4 


23.2 


DDAyfr 1 r>T_T a t 

roMl PHA-L 


72.2 


Lung fibroblast IL-9 


35.4 


Ramos (B cell) none 


65.5 


Lung fibroblast IL-1 3 


23.5 


Ramos (B cell) ionomycin 


100.0 


Lung fibroblast IFN gamma 


28.3 


B lymphocytes PWM 


82.4 


Dermal fibroblast CCD 1070 rest 


36.1 


B lymphocytes CD40L and 
IL-4 


57.8 


Dermal fibroblast CCD1 070 
TNF alpha 


65.1 


EOL-1 dbcAMP 


51.4 


Dermal fibroblast CCD 1070 IL-1 
beta 


37.6 


EOL-1 dbcAMP 
PMA/ionomycin 


33.2 


Dermal fibroblast IFN gamma 


26.8 
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Dendritic cells none 46.0 


Dermal fibroblast IL-4 


31.2 


Dendritic cells LPS 


31.6 


Dermal Fibroblasts rest 


19.6 


Dendritic cells anti-CD40 


48.0 


Neutrophils TNFa+LPS 


3.1 


Monocytes rest 


25.3 


Neutrophils rest 


4.2 


Monocytes LPS 


18.0 


Colon 


35.4 


Macrophages rest 


70.2 


Lung 


17.6 


Macrophages LPS 


14.7 


Thymus 


32.8 


HUVEC none 


39.2 


Kidney 


47.6 


HUVEC starved 


43.8 






Table AKF. General oncologv screening panel v ? i 




Rel. Exp.(%) 

A ciA f\T2 Dim 

Ag^u / j, Kun 
268389975 


Tissue Name 


iRel. 
Exp.(%) 
Ag4073, 
Run 

1268389975 


Colon cancer 1 


50.3 


Bladder cancer NAT 2 jo. 1 


Colon cancer NAT 1 
Colon cancer 2 


17.7 
17.8 


Bladder cancer NAT 3 
Bladder cancer NAT 4 


|0J_ 
1.7 


Colon cancer NAT 2 


22.4 


Adenocarcinoma of the prostate 1 


6.9 


Colon cancer 3 


100.0 


Adenocarcinoma of the prostate 2 


0.3 


Colon cancer NAT 3 


23.8 


Adenocarcinoma of the prostate 3 


2.5 


Colon malignant cancer 4 


47.6 


Adenocarcinoma of the prostate 4 


7.8 


Colon normal adjacent 
tissue 4 


14.5 


Prostate cancer NAT 5 


1.4 


Lung cancer 1 


8.2 


Adenocarcinoma of the prostate 6 


1.1 


Lung NAT 1 


0.5 


Adenocarcinoma of the prostate 7 


1.3 


Lung cancer 2 


14.8 


Adenocarcinoma of the prostate 8 


0.4 


Lung NAT 2 


0.7 


Adenocarcinoma of the prostate 9 


3.7 


Squamous cell carcinoma 3 


23.0 


Prostate cancer NAT 1 0 


0.1 


Lung NAT 3 


0.4 


Kidney cancer 1 


6.3 


metastatic melanoma 1 


3.8 


KidneyNAT 1 


3.3 . 


Melanoma 2 


3.3 


Kidney cancer 2 


21.2 


Melanoma 3 


2.3 


Kidney NAT 2 


17.7 


metastatic melanoma 4 


8.8 


Kidney cancer 3 


4.5 


metastatic melanoma 5 


14.7 


Kidney NAT 3 


6.7 


Bladder cancer 1 l 


3.6 


Kidney cancer 4 


11.6 


Bladder cancer NAT 1 1 


3.0 


Kidney NAT 4 


17.4 


Bladder cancer 2 1.9 
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AI_comprehensive panel_vl.O Summary: Ag4073 Highest expression of the 
CG96364-01 gene is detected in osteoarthritic bone sample (CT=27). Low to moderate 
levels of expression of this gene are detected in samples derived from osteoarthritic (OA) 
bone and adjacent bone as well as OA cartilage, OA synovium and OA synovial fluid 
samples. Moderate level expression is also detected in cartilage, bone, synovium and 
synovial fluid samples from rheumatoid arthritis patients. Low level expression of this 
gene is also detected in normal samples derived from cartilage, synovium, bone or 
synovial fluid cells, normal lung samples. COPD lung, emphysema, atopic asthma, 
asthma, allergy, Crohn's disease (normal matched control and diseased), ulcerative 
colitis(normal matched control and diseased), and psoriasis (nonnal matched control and 
diseased). Therefore, therapeutic modulation of this gene product may ameliorate 
symptoms/conditions associated with autoimmune and inflammatory disorders including 
psoriasis, allergy, asthma, inflammatory bowel disease, rheumatoid arthritis and 
osteoarthritis 

CNS_neurodegencration_vl.O Summary: Ag4073 This panel confirms the 
expression of the CG96364-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screeningjpanel_vl.4 Summary: Ag4073 Highest expression of the 
CG96364-01 gene is detected in a gastric cancer KATO III cell line (CT=22). High 
expression of this gene is seen in cluster of breast, ovarian, colon, gastric, renal, lung, 
pancreatic, CNS, hepatic, prostate cancer cell lines and melanoma cell lines. Thus, 
therapeutic modulation of this gene product could be beneficial in the treatment of these 
cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 
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The CGI 00466-01 gene codes for adenine nucleotide translocator (ANT 2) " 
homolog. Dysfiinctioning of the ANT2 have been shown to to induce myopathies in 
mouse and in humans (Fiore et ah, 2001, Clin Chim Acta 31 1(2): 125-35, PMID: 
1 1566172). ANT has a role in mtDNA maintenance and mutation in ANT has been 
5 implicated in autosomal dominant progressive external ophthalmoplegia and other 
mitochondrial diseases (Kaukonen et al., 2000, Science 289(5480):782-5, PMID: 
10926541). Mice deficient in the heart/muscle specific isoform of the ANT1 exhibit many 
of the hallmarks of human oxidative phosphorylation (OXPHOS) disease, including a 
dramatic proliferation of skeletal muscle mitochondria (Murdoch et al., 1999, J Biol Chem 

10 274(20): 14429-33, PMID: 10318868). Therefore, therapeutic modulation of the ANT 
protein encoded by the CGI 00466-01 gene through the use of small molecule drug could 
be useful in the treatment mitochondria related diseases. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus. 

15 cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease. Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression- 
Panel 4.1D Summary: Ag4073 Highest expression of the CG96364-01 gene is 
detected in activated primary Trl and ionomycin treated Ramos B cell (CTs=25.4). This 

20 gene is expressed at high to moderate levels in a wide range of cell types of significance in 
the immune response in health and disease. These cells include members of the T-cell, B- 
cell, endothelial cell, macrophage/monocyte, and peripheral blood mononuclear cell 
family, as well as epithelial and fibroblast cell types from lung and skin, and normal 
tissues represented by colon, lung, thymus and kidney. This ubiquitous pattern of 

25 expression suggests that this gene product may be involved in homeostatic processes for 
these and other cell types and tissues. This pattern is in agreement with the expression 
profile in General_screeningjpanel_y 1 .4 and also suggests a role for the gene product in 
cell survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 

30 to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 
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General oncology screening panel_v_2.4 Summary: Ag4073 Highest 
expression of the CG96364-01 gene is detected in colon cancer OD06297-04 sample 
(CT=24.3). Higher expression of this gene is seen in colon cancer and lung cancer 
compared to the normal adjacent tissues. Therapeutic modulation of this gene product 
could be beneficial in the treatment of these cancers. Please see Panel 1 A for additional 
discussion of the potential utility of this gene in treatment of cancers. 

AL. CG96422-01: ADP/ATP Translocase 3 

Expression of gene CG96422-01 was assessed using the primer-probe set Ag4057, 
described in Table ALA. Results of the RTQ-PCR runs are shown in Tables ALB and 
ALC. 



Table ALA . Probe Name Ag4057 



jPrimers (Sequences 

•Forward 15 ■ -gccaatgtcatccgctact-3 ' 


Length 
19 


Start 
Position 

232 


SEQ ID 
No 

344 


iProbe 


TET-5 ' -agccctcaatttcgccttcaaggata-3 1 -TAMRAj 26 


261 


345 


; Reverse ;5 ' -gccaggaagatctgcttgtact-3 • | 22 


287 


346 



Table ALB. Panel 4.1 D 



Tissue Name 


Rel. Exp.(%) 
Ag4057, Run 
171620020 


Tissue Name 


Rel. 

Exp.(%) 
Ag4057, 
Run 

171620020 


Secondary Thl act 


33.9 


HUVECIL-lbeta 


51.1 


Secondary Th2 act 


32.5 


HUVEC IFN gamma 


56.6 


Secondary Trl act 


29.9 


HUVEC TNF alpha + IFN gamma 


29.9 


Secondary Th 1 rest 


14.9 


HUVEC TNF alpha + IL4 


42.9 


Secondary Th2 rest 


21.5 


HUVEC IL-1 1 


43.5 


Secondary Trl rest 


22.7 


Lung Microvascular EC none 


81.8 


Primary Thl act 


33.7 


Lung Microvascular EC TNFalpha 
+ IL-1 beta 


60.3 


Primary Th2 act 


54.7 


Microvascular Dermal EC none 


53.6 


Primary Trl act 


42.6 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


36.3 


Primary Thl rest 


16.4 


Bronchial epithelium TNFalpha + 
ILlbeta 


65.1 


Primary Th2 rest 


8.6 


Small airway epithelium none 


21.8 
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miliary in rest 


zz.z 


Small airway epithelium TNFalpha 
+ IL-l beta 


36.9 


CD45RA CD4 
lymphocyte act 


11 A 

-53.1) 


Coronery artery SMC rest 


46.7 


CD45RO CD4 
lymphocyte act 


/I "7 A 


Coronery artery SMC TNFalpha + 
IL-l beta 


39.5 


CD8 lymphocyte act 


[55.1 


Astrocytes rest 


29.3 


Secondary CDS 
lymphocyte rest 


1 

;76.3 

! 


Astrocytes TNFalpha + 1 L- ! beta 


26.8 


Secondary CD8 
lymphocyte act 


25.5 


KU-8 12 (Basophil) rest 


72.7 j 


k^uh lympnocyie none 


ZO.Z 


KU-8 12 (Basophil) 
PMA/ionomycin 


100.0 


2ryThl/Th2/Trl anti- 
CD95CHII 


34.6 


CCDI 106 (Keratinocytes) none 


45.4 


LAK. cells rest 


26.8 


CCD 1106 (Keratinocytes) 
TNFalpha + IL-l beta 


27.4 

i 


LAK cells IL-2 


0.0 


Liver cirrhosis 


8.1 i 


LAKcellsIL-2+IL-12 


29.5 


INCI-H292 none 


90.1 ' 


LAK cells IL-2+IFN 
gamma 


22.1 


NCI-H292 IL-4 


81.8 


jLAK cells IL-2+ IL-l 8 


17.7 


NCI-H292 IL-9 


92.7 | 


LAK cells 
PMA/ionomycin 


41.5 


NCI-H292 IL-l 3 


63.3 j 

! 


XII/ 1 1 ■¥ o . 

NK Cells IL-2 rest 


28.7 


NCI-H292 I FN gamma 


57.4 


1 wo Way MLR 3 day 


24.0 


HPAEC none 


50.0 


T"...^. ti/„. . x jit r> c j _ 

1 wo Way MLR 5 day 


33.0 


HPAEC TNF alpha + IL-l beta 


65.5 


i wo Way MLR 7 day 


28.1 


Lung fibroblast none 


49.0 


PBMC rest 


32.1 


Lung fibroblast TNF alpha + IL-l 
beta 


14.2 


PBMC PWM 


33.0 


Lung fibroblast IL-4 


37.6 


roML JrHA-L 


30.1 


I C~ 1 11 IT /X 

Lung fibroblast IL-9 


57.4 


Ramos (B cell) none 


36.1 


Lung fibroblast IL-l 3 


84.7 


Ramos (B cell) ionomycin 


51.8 


Lung fibroblast IFN gamma 


51.1 


B lymphocytes PWM 


42.9 


Dermal fibroblast CCD 1 070 rest 


46.7 


B lymphocytes CD40L 
and IL-4 


^0 1 

->Z. 1 


Dermal fibroblast CCD 1070 TNF 
alpha 


SI 1 


EOL-1 dbcAMP 


46.7 


Dermal fibroblast CCD 1 070 IL-l 
beta 


34.4 


EOL-1 dbcAMP 
PMA/ionomycin 


53.2 


Dermal fibroblast IFN gamma 


25.5 


Dendritic cells none 


57.0 


Dermal fibroblast IL-4 


55.5 
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Dendritic cell*; I PS 

J^SlsllVll fill* 1 1 O l_fl KJ 




Dermal Fibroblasts rest 


Zl .z 


Dendritic cel1<5 anti-(~T)4f) 


DO,7 


iNeutropniis i fNr'a+JLro 


4.2 


Monocytes rest 


51.4 


Neutroohils rest 


6.5 


Monocytes LPS 


16.7 


Colon 


14.8 


Macrophages rest 


45.7 


Lung 


23.7 


Macrophages LPS 


12.7 


Thymus 


40.3 


HUVEC none 


38.7 


Kidney 


36.3 


HUVEC starved 


57.0 







Table ALC . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4057, Run 


Tissue Name 


Rel. 

Exp.(%) 
Ag4057, 
Run 

268390123 


Colon cancer 1 


9.2 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


6.5 


Bladder cancer NAT 3 


0.1 


jColon cancer 2 


12.8 


Bladder cancer NAT 4 


3.4 


jCoIon cancer NAT 2 


12.7 


Adenocarcinoma of the prostate 1 


11.6 


jColon cancer 3 


43.8 


Adenocarcinoma of the prostate 2 jo.9 


Colon cancer NAT 3 


16.2 


Adenocarcinoma of the prostate 3 |4.4 


Colon malignant cancer 4 


100.0 


Adenocarcinoma of the prostate 4 


14.4 


Colon normal adjacent tissue 
4 


8.7 


Prostate cancer NAT 5 


1.9 


Lung cancer 1 


8.0 


Adenocarcinoma of the prostate 6 


3.3 


Lung NAT 1 


0.4 


Adenocarcinoma of the prostate 7 


4.0 


Lung cancer 2 


19.1 


Adenocarcinoma of the prostate 8 


0.6 


Lung NAT 2 


0.6 


Adenocarcinoma of the prostate 9 


17.6 


Squamous cell carcinoma 3 


39.0 


Prostate cancer NAT 10 


0.3 


Lung NAT 3 


0.3 


Kidney cancer 1 


6.5 


metastatic melanoma 1 


16.4 


KidneyNAT 1 


3.3 


Melanoma 2 


2.7 


Kidney cancer 2 


42.3 


Melanoma 3 


1.9 


Kidney NAT 2 


11.3 


metastatic melanoma 4 


44.1 


Kidney cancer 3 


13.1 


metastatic melanoma 5 


38.7 


Kidney NAT 3 


4.7 


Bladder cancer 1 


0.2 


Kidney cancer 4 


16.8 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


15.4 


Bladder cancer 2 


2.5 
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CNS_neurodegeneration_vl.O Summary: Ag4057 Results from one experiment 
with the CG96422-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag4057 Results from one 
5 experiment with the CG96422-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4. ID Summary: Ag4057 Highest expression of the CG96422-01 gene is 
detected in PMA/ionomycin treated basophils (CT27.4). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 

1 0 health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 

1 5 tissues. This pattern is in agreement with the expression profile in 

General_screening_panel_vl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 

20 diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4057 Highest 
expression of the CG96422-01 gene is detected in colon malignant cancer sample 
(CT=27.5). In addition, significant expression of this gene is associated with kidney 

25 cancer, prostate adenocarcinoma, bladder cancer, melanoma, lung and colon cancers. 

Therefore, therapeutic modulation of this gene product through the use of small molecule 
target may be beneficial in the treatment of these cancers. 

AM. CG96442-01: Acyl CoA-Domain Protein 

30 Expression of gene CG96442-01 was assessed using the primer-probe set Ag4058, 

described in Table AMA. Results of the RTQ-PCR runs are shown in Tables AMB and 
AMC. 
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Table AMA . Probe Name Ag4058 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 • -tcaaaggagattgtgccaatta-3 ' 


22 


703 . 


347 


Probe 


TET-5 • -tgcacctgtcagtgttcaacaacaga-3 ' -TAMRA 


26 


726 


348 


Reverse 


5 ' -ctgtttgggtttccgaattaat-3 ' 


22 


756 


349 



Table AMB , General_screcning_panel_vl.4 



Tissue Name 


|Kei. bxp.(yo) 
jAg<4058, Run 
|2 18902477 


Tissue Name 


Rel. Exp.(%) 
Ag4058, Run 
218902477 


Adipose 


|2.8 


Renal ca. TK-10 


9.9 


Melanoma* Hs688(A).T 


io.o 


Bladder 


33.2 


Melanoma* Hs688(B).T 




Gastric ca. (liver met.) NCI-N87 


26.2 


Melanoma* M 1 4 


Io.o 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


jo.o 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


jl 7.3 


Colon ca. SW480 


0.0 


Squamous cell carcinoma 
SCC-4 


Io.o 


Colon ca * CSW480 met) SW6">0 


1.6 


Testis Pool 


|66.4 


Colon ca. HT29 


0.0 


Prostate ca * (bone met) PC-3 


0.0 


Colon ca. HCT-116 


2.6 ' 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


19.9 


Uterus Pool 


2.0 


Colon ca. SW1116 


0.0 


Ovarian ca. OVCAR-3 


6U 


Colon ca. CoIo-205 


0.0 


Ovarian ca. SK-OV-3 


3.2 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


49.7 


Colon Pool 


16.0 


Ovarian ca. OVCAR-5 


1.8 


Small Intestine Pool 


5.2 


Ovarian ca. IGROV-1 


0.0 


Stomach Pool 


19.9 


Ovarian ca. OVCAR-8 


5.4 


Bone Marrow Pool 


6.7 


Ovary 


6.3 


Fetal Heart 


8.2 


Breast ca. MCF-7 


3.8 


Heart Pool 


2.4 


Breast ca. MDA-MB-23 1 


7.4 


Lymph Node Pool 


15.3 


Breast ca. BT 549 


4.0 


Fetal Skeletal Muscle 


10.4 


Breast ca. T47D 


11.3 


Skeletal Muscle Pool 


1.2 


Breast ca. MDA-N 


0.0 


Spleen Pool 


100.0 


Breast Pool 


28.3 


Thymus Pool 


5.8 


Trachea 


1.1 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U- 11 8- 
MG 


0.0 
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Fetal Lung 


13.5 




JCNS cancer (neuro;met) SK-N-AS 


0.0 


Lungca. NCI-N417 


0.0 




|CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


3.4 




]CNS cancer (astro) SNB-75 


0.0 


Lung ca.NCI-H 146 


12.7 




|CNS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 


1.9 




|CNS cancer (glio)SF-295 


3.4 


Lung ca. A549 


0.0 




[Brain (Amygdala) Pool 


|0.0 


Lung ca. NCI-H526 


0.0 




Brain (cerebellum) 


0.0 


Lung ca. NC1-H23 


j 1 4.7 




Brain (fetal) 


5.8 


Lung ca. NC1-H460 


jo.o . 




Brain (Hippocampus) Pool 


6.6 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


8.0 


Lungca.NCl-H522 


3.2 


jBrain (Substantia nigra) Pool 


11.7 


Liver 


3.0 




Brain (Thalamus) Pool 


9.0 


Fetal Liver 


4.2 


Brain (whole)" 


0.0 


Liver ca. HepG2 


0.0 


! 


Spinal Cord Pool 


0.0 


Kidney Pool 


20.0 


j Adrenal Gland 


16.4 


Fetal Kidney 


23.8 


[Pituitary gland Pool 


8.0 


Renal ca. 786-0 


jo.o 


i 
i 


Salivary Gland i8.3 


Renal ca. A498 j 


0.0 


jThyroid (female) [ 


0.0 


Renal ca. ACHN 


11.9 ~ 


iPancreatic ca. CAPAN2 ;7.4 j 


Renal ca. UO-3 1 


=21.2 


jPancreas Pool [ 


8.7 | 



Table AMC. Panel 4.1 D 



Tissue Name 


Rel. Exp.(%) 
Ag4058, Run 
171620034 


Tissue Name 


Rel. Exp.(%) 
Ag4058, Run 
171620034 


Secondary Thl act 


0.0 


HUVEClL-lbeta 


7.7 


Secondary Th2 act 


0.0 


HUVEC 1FN gamma 


2.9 


Secondary Trl act 


0.0 


HUVEC TNF alpha + I FN gamma 


2.9 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


2.5 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


5.8 


Secondary Trl rest 


3.1 


Lung Microvascular EC none 


7.5 


Primary Thl act 


1.7 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


3.7 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 


0.0 
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TNFalpha* IL-lbeta 




CD45RA CD4 lymphocyte 
act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte 
act 


0.0 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 lymphocyte 
rest 


0.0 


Astrocytes TNFalpha + IL-lbeta 


4.0 
0.0 


Secondary CD8 lymphocyte 
act 


0.0 


KU-812 (Basophil) rest 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2ATr1 anti-CD95 
CH11 


0.0 


CCD 1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


9.5 


CCD1 1 06 (Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 


LAK cells IL-2 


1.8 


Liver cirrhosis i9.3 

s 


LAK cells IL-2+IL-I2 


2.0 


NCI-H292 none jo.O 


|LAK cells IL-2+IFN gamma 


o7o" 


NCI-H292 IL-4 '0.0 


(LAK cells IL-2+IL- 18 


0.0 jNCI-H292 IL-9 jo.O 


jLAK cells PMA/ionomycin jo.0 ;NCI-H292 IL-13 io.O 


INK Cells IL-2 rest 


0.0 


NCI-H292 I FN gamma JO.O 


Two Way MLR 3 day 


6.5 


HPAEC none 


5.2 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha 4- IL-1 beta 


48.3 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-1 
beta 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


0.0 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma 


2.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


0.0 


B lymphocytes CD40L and 
IL-4 


0.0 


Dermal fibroblast CCD1070 TNF 
alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD1 070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells LPS 


3.5 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


4.0 
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Monocytes rest 


38.4 


Neutrophils rest 


0.0 


Monocytes LPS 


52.5 


Colon 


0.0 


Macrophages rest 


9.9 


Lung 


0.0 


Macrophages LPS 


5.9 


Thymus 


6.1 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


2.5 







AI_comprehensive panelvl.O Summary: Ag4058 Expression of the CG96442- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 



CNS_neurodegeneration_vl.O Summary: Ag4058 Results from one experiment 
5 with the CG96442-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag4058. Highest expression of the 
CG96442-01 gene is detected in spleen (CT=31.4). In addition, moderate to low 
expression of this gene is seen in pancreas, adrenal gland, gastrointestinal tract, kidney, 
10 and testis. Therefore, therapeutic modulation of this gene can be useful in the treatment of 
diseases associated with these tissues, including diabetes, obesity, inflammatory bowel 
. disease, lupus and glomerulonephritis. 

Low expression of this gene is also detected in brain thalamus and substantia nigra 
samples. Therefore, therapeutic modulation of this gene product can be beneficial in 
1 5 neurological disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple 
sclerosis, schizophrenia and depression. 

Low expression of this gene is associated with cluster of cancer cell lines including 
renal cancer, gastric cancer, breast cancer, ovarian cancer, lung cancer and melanoma cell 
lines and colon cancer tissue. Therefore, therapeutic modulation of this gene product can 
20 be beneficial in the treatment of these cancers. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=34.5) when 
compared to adult skeletal muscle and lung (CT>38). Thus, expression of this gene can be 
used to distinguish fetal from adult skeletal muscle and lung. In addition, the relative 
overexpression of this gene in fetal skeletal muscle suggests that the protein product may 
25 enhance growth or development of muscle and lung in the fetus and thus may also act in a 
regenerative capacity in the adult. Therefore, therapeutic modulation of the protein 
encoded by this gene could be useful in treatment of muscle and lung related diseases. 
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Panel 4.1D Summary: Ag4058 Highest expression of the CG96442-01 gene is 
detected in kidney (CT=3 1.4). Therefore, expression of this gene can be used to 
distinguish this sample from other samples in the panel. In addition, small molecule 
therapies designed with the protein encoded for by this gene could modulate kidney 
function and be important in the treatment of inflammatory or autoimmune diseases that 
affect the kidney, including lupus and glomerulonephritis. 

Low level expression of this gene is also detected in TNF alpha + IL-1 beta treated 
HP AEC, liver cirrhosis, and monocytes. Therefore, therapeutic modulation of this gene 
product can be useful in the treatment of inflammatory and autoimmune diseases such as 
asthma, IBD, and psoriasis and liver cirrhosis. 

General oncology screening panel_v_2.4 Summary: Ag4058 Expression of the 
CG96442-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

AN. CG96501-01: Brain Mitochondrial Carrier Protcin-1 Like protein 

Expression of gene CG96501-01 was assessed using the primer-probe set Ag4059, 
described in Table ANA. Results of the RTQ-PCR runs are shown in Tables ANB, ANC 
and AND. 



Table ANA . Probe Name Ag4059 



Primers 


Sequences 


Length 


Start j SEQID 
Position No 


Forward 


5 1 -ctcagtttcccagtcattgatg-3 ' 


22 


53 j 350 


Probe 


TET-5 1 -cctgagcagcagcaaagtaccactgt-3 1 -TAMRA 


26 


77 j 351 


Reverse 


5 1 -ggcgtcatatacaaagggttt-3 ' 


21 


131 j 352 



Table ANB . General screening panel vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
218902504 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
218902504 


Adipose 


0.1 


Renal ca.TK-10 


1.4 


Melanoma* Hs688(A).T 


0.6 


Bladder 


1.2 


Melanoma* Hs688(B).T 


0.3 


Gastric ca. (liver met.) NCI-N87 


4.3 


Melanoma* M14 


0.6 


Gastric ca. KATO III 


1.8 


Melanoma* LOXIMVI 


0.4 


Colon ca. SW-948 


0.4 


Melanoma* SK-MEL-5 


0.7 


Colon ca. SW480 


1.7 


Squamous cell 


0.3 


Colon ca.* (SW480 met) SW620 


2.6 
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(carcinoma SCC-4 








[Testis Pool 


1.1 


Pnlnn osk RT9Q 


1 .3 


[Prostate ca.* (bone met) 
PC-3 


1.0 


Colon ca. HCT-116 


2.6 


jProstate Pool 


0.5 


Colon ca. CaCo-2 


2.8 


jpiacenta 


0.7 


Colon cancer tissue 


1.7 


(Uterus Pool 


0.3 


Colon ca. SW1116 


|0.9 


{Ovarian ca. OVCAR-3 


0.6 


Colon ca. CoIo-205 


0.3 


jOvarian ca. SK-OV-3 


3.6 


Colon ca. SW-48 


0.1 


Ovarian ca. OVCAR-4 


0.5 


Colon Pool 


3.3 


Ovarian ca. OVCAR-5 


2.1 


Small Intestine Pool 


4.0 


Ovarian ca. IGROV-1 


0.6 


Stomach Pnnl 

*J LVI 1 IHV 1 1 I UUI 


I ._> 


Ovarian ca. OVCAR-8 


0.6 


Bone Mflrrnw PnrJ 


1 ~> 
I .J 


Ovary 


2.4 


Fetal Heart 


1.2 


Breast ca. MCF-7 


2.0 


Hf»art Pnnl 
i it-ai l ruui 


i \ •"> 


Breast ca. MDA-MB- 
231 


3.3 


Lymph Node Pool 


3.7 ! 


Breast ca. BT 549 


|5.4 


Fetal Skeletal Muscle 




Breast ca. T47D 
Breast ca. MDA-N 


5.7 

~ ' 

0.6 


orvcicicll ivjuscie rOOl 

Spleen Pool 


1 1 i 

r * 


[Breast rOOl 


3.9 


Thymus Pool 




jTrachea 


1.3 


CNS cancer (glio/astro) U87-MG 


3.4 


Lung 


0.7 


CNS cancer (glio/astro) U-l 1 8- 
MG 


2.7 


Fetal Lung 


3.8 


CNS cancer fnf»iirrvm At ^ TvT 
AS 


3.8 


Lungca. NCI-N417 


0.4 j 


CNS cancer (astro) SF-539 


1.0 


Lung ca. LX-1 


2.1 j 


CNS cancer (astro) SNB-75 


3.5 


Lung ca. NCI-H146 


0.0 ( 


CNS cancer (glio) SNB-19 


0.7 


Lungca. SHP-77 


j, 


CNS cancer (glio) SF-295 


3.3 


Lung ca. A549 : 


2.6 |] 


Brain (Amygdala) Pool I 


D.6 


Lungca.NCI-H526 < 


).2 ] 


Brain (cerebellum) { 


3.9 


Lungca.NCI-H23 : 


1-9 1 


Brain (fetal) 


1.2 


Lungca.NCI-H460 ; 


U I 


Brain (Hippocampus) Pool 


1.2 


Lung ca. HOP-62 ; 


5.3 ( 


Cerebral Cortex Pool ( 


).8 


Lungca.NCI-H522 ] 


.5 I 


irain (Substantia nigra) Pool 


1.5 


Liver c 


1.1 i 


irain (Thalamus) Pool 1 


.2 


Fetal Liver 2 


.2 i 


irain (whole) 1 


.2 


Liver ca. HepG2 2 


•1 S 


Ipinal Cord Pool 1 


.3 
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Kidney Pool 


3.1 


Adrenal Gland 0.7 


Fetal Kidney 


3.9 


Pituitary gland Pool jo.5 


Renal ca. 786-0 


2.1 


Salivary Gland |o.4 


Renal ca. A498 


1.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


1.3 


Pancreatic ca. CAPAN2 


2.1 


Renal ca. UO-3 1 


2.0 


Pancreas Pool 


100.0 


Table ANC. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
171620235 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
171620235 


Secondary Th 1 act 


14.4 • 


HUVECIL-lbeta 


15.7 


Secondary Th2 act 


39.2 


HUVEC I FN gamma 


26.1 


SeconHarv TV1 ant 

o wvji iuai y ill aLL 


34.4 


HUVECTNF alpha + IFN 
gamma 


16.8 


Secondary Th 1 rest 


18.8 


HUVECTNF alpha + IL4 


10.6 , 


Secondary Th2 rest 


6.1 jHUVEClL-11 


10.7 


Secondary Trl rest 

j *—* WW V/ • ■ w) III J All I w «3 1 


36.6 jLung Microvascular EC none 


19.5 


Primary Thl act 


13.4 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


14.4 


Primary Th2 act 

i — — — — — — 


3 1 2 j|Vl icrovascular Dermal EC none 


18.8 


Primary Trl act 


2 jMicrosvasular Dermal EC 
|TNFalpha + IL-lbeta 


12.2 


Primary Thl rest 


6.4 


Bronchial epithelium TNFalpha + 
ILlbeta 


4.6 


Primary Th2 rest 


16.5 


Small airway epithelium none j3.0 


Primary Trl rest 


12.1 


Small airway epithelium 
TNFalpha + IL-lbeta 


4.4 


CD45RA CD4 
lymphocyte act 


21.9 


Coronery artery SMC rest 


2.8 


CD45ROCD4 
lymphocyte act 


39.5 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


4.5 


CD8 lymphocyte act 


40.3 


Astrocytes rest 


4.3 


Secondary CD8 
lymphocyte rest 


23.2 


Astrocytes TNFalpha + IL- 1 beta 


10.5 


Secondary CD8 
lymphocyte act 


7.2 


KU-812 (Basophil) rest 


33.9 


CD4 lymphocyte none 


27.7 


KU-8 12 (Basophil) 
PMA/ionomycin 


34.6 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


45.7 


CCD 1 106 (Keratinocytes) none 


9.2 


LAK cells rest 


17.9 


CCD 1 1 06 (Keratinocytes) 


8.7 
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rNFalpha + IL-1 beta 




LAK cells IL-2 


27.7 1 


Liver cirrhosis ' 


1.3 


LAK cells IL-2+IL- 12 


20.2 ~Jl 


NCI-H292 none 


11.0 


LAK cells IL-2+1FN 
gamma 


12.0 


NCI-H292 IL-4 


11.5 


LAK cells IL-2+IL- 18 


29.3 


NCI-H292 IL-9 


24.3 


LAK cells 
PMA/ionomycin 


5.6 

i 


NCI-H292 IL-13 


12.3 


NK Cells IL-2 rest 


37.6 | 


NC1-H292 IFN gamma 1 1 3.8 


Two Way MLR 3 day 


52.1 


HPAEC none 


18.8 


Two Way MLR 5 day 


22.8 


HPAEC TNF alpha + IL-1 beta 


20.0 


Two Way MLR 7 day 


30.8 


Lung Fibroblast none 


23.8 


PRMC rest 


4.6 


Lung fibroblast TNF a|pha + IL-1 
beta 


9.2 


PBMC PWM 


24.5 


Lung Fibroblast IL-4 j 


10.5 


PBMC PHA-L 


29.1 (Lung fibroblast IL-9 


23.8 


Ramos (B cell) none 


41.8 jLung fibroblast IL-1 3 


19.1 


Ramos (B cell) 
ionomycin 


37.9 
76~4 


Lung Fibroblast IFN gamma 


20.9 


B lymphocytes PWM 


Dermal Fibroblast CCD 1070 rest 


29.1 


\B lymphocytes CD40L 
andIL-4 


jDermal fibroblast CCD 1070 TNF 
423 (alpha 


23.7 


EOL-1 dbcAMP 


40.3 


Dermal Fibroblast CCD 1070 IL-1 
beta 


4.4 


EOL-1 dbcAMP 
PMA/ionomycin 


15.5 


Dermal fibroblast IFN gamma 


7.3 


Dendritic cells none 


11.4 


Dermal Fibroblast IL-4 


20.3 


Dendritic cells LPS 


18.9 


Dermal Fibroblasts rest 


21.3 


Dendritic cells anti- 
rr>40 


4.2 


Neutrophils TNFa+LPS 


2.8 


IVlOlli/L'yiCa I Col 


6.7 


Neutrophils rest 


17.4 


Monocytes LPS 


21.9 


Colon 


3.1 


Macrophages rest 


16.2 


Lung 


6.5 


Macrophages LPS 


12.9 


Thymus 


57.0 


HUVEC none 


12.7 


Kidney 


100.0 


HUVEC starved 


33.9 






Table AND. General oncology screening panel_v_2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
268390135 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
268390135 
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Colon cancer 1 


8.7 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


0.6 


Bladder cancer NAT 3 


0.4 


Colon cancer 2 


9.5 


Bladder cancer NAT 4 


8.1 


Colon cancer NAT 2 


4.1 


Adenocarcinoma of the prostate 1 


31.2 


Colon cancer 3 


12.9 


Adenocarcinoma of the prostate 2 


2.4 


iColon cancer NAT 3 


14.7 


Adenocarcinoma of the prostate 3 j 12.2 


jCoIon malignant cancer 
I 4 


29.7 

• 


! 

Adenocarcinoma of the prostate 4 1 1 7.0 


Colon normal adjacent 
tissue 4 


2.1 


Prostate cancer NAT 5 |2.7 


Lung cancer l 


7.9 


Adenocarcinoma of the prostate 6 


4.7 


Lung NAT I 


1.9 


Adenocarcinoma of the prostate 7 


0.0 


Lung cancer 2 


23.8 


Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 


0.6 


Adenocarcinoma of the prostate 9 


27.4 


.Squamous cell 
jcarcinoma 3 


11.2 


Prostate cancer NAT 10 


0.5 


■Lun CT NAT 3 

j metastatic melanoma 1 


0.0 
19.5 


Kidney cancer 1 
KidneyNAT 1 


17.0 

8.3 


i 

jMelanoma 2 


1.9 


Kidney cancer 2 


47.6 


{Melanoma 3 


'•3 


Kidney NAT 2 ■ 


20.7~ 


metastatic melanoma 4 


97.9 jKidney cancer 3 ;44.1 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


6.4 


Bladder cancer 1 


0.5 


Kidney cancer 4 


10.4 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


2.1 


Bladder cancer 2 


3.4 







CNS_neurodegenerationjvl.O Summary: Ag4059 Results from one experiment 
with the CG96501-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 
5 General_screening_panel_vl.4 Summary: Ag4059 Highest expression of the 

CG96501-01 gene is detected in pancreas (CT=24.6). In addition, moderate expression of 
this gene is seen among the tissues with metabolic or endocrine function such as pancreas, 
adipose, adrenal gland, pituitary gland, skeletal muscle, heart, liver and the gastrointestinal 
tract. Therefore, therapeutic modulation of the activity of this gene may prove useful in the 
10 treatment of endocrine/metabolically related diseases, such as obesity and diabetes. 

The CG96501-01 gene codes for homolog of mouse mitochondrial uncoupling 
protein 5 (UCP 5). Proteins belonging to UCP family play important role in 
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thermoregulatory responses, including acute hypothermia usually followed by temperature 
recovery (Yu et al., 2000, Am J Physiol Endocrinol Metab 279(2):E433-46). In addition, 
mutation in UCP3, another member of UCP family, causes severe obesity and type II 
diabetes (Brown et al., 1999, Hum. Mutat. 13: 508 only). Therefore, therapeutic 
5 modulation of this gene product can be useful in treatment of disorders affecting 
thermoregulatory responses, obesity and type II diabetes. 

Significant expression of this gene is also seen in cluster of cancer cell lines 
including pancreatic cancer, CNS cancer, colon and gastric cancer, renal cancer, lung 
cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell carcinoma and 

10 melanoma cell lines. Therefore, therapeutic modulation of this gene product can be 
beneficial in the treatment of these cancers. 

Also, this gene is expressed at high levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 

15 central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4059 Highest expression of the CG96501-01 gene is 
detected in kidney (CT=32). This gene is expressed at low to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 

20 cells include members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 
This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. This pattern is in 

25 agreement with the expression profile in General_screening_panel_vl.4 and also suggests 
a role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 

30 inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 
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General oncology screening panel_v_2.4 Summary: Ag4059 Highest 
expression of the CG96501-01 gene is detected in tow metastatic melanoma samples 
(CTs=31.7). Interestingly, significant expression of this gene is also seen in cluster of 
cancer samples including kidney, prostate adenocarcinoma, metastatic melanoma, lung 
and colon cancers. Please see Panel 1 .4 for a discussion of the potential utility of this gene 
in treatment of these cancers. 



AO. CG96557-01: L-ALLO-Threoninc Aldolase-Likc Protein 

Expression of gene CG96557-01 was assessed using the primer-probe set Ag4067, 
described in Table AOA. 

Table AOA . Probe Name Ag4067 



iPrimers 



Sequences 



jForward 



Length 



5 1 -tggagcactgtgactctgtgt- 3 



21 



Probe iTBT -5'-ctttetgcctctccaaggg*cctq-3~lTaMRA! 23 



: Reverse 



5 ' -aggcttcttcaatgaagtcctt - 3 



Start 
Position 



614 



SEQ ID No | 



353 



635 



->-> 



691 



354 



355 



CNS_neurodegeneration_vl.O Summary: Ag4067 Results from one experiment 
with the CG96557-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag4067 Expression of the CG96557- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). , 

Panel 2.2 Summary: Ag4067 Expression of the CG96557-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4.1D Summary: Ag4067 Expression of the CG96557-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
Results from one experiment (run 171 808741) with the CG96557-01 gene are not 
included. The amp plot indicates that there were experimental difficulties with this run. 



AP. CG96581-01:RP42 
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Expression of gene CG96581-01 was assessed using the primer-probe set Ag4070, 
described in Table APA. Results of the RTQ-PCR runs are shown in Tables APB, APC, 
APD and APE. 



j — — • 

Primers {Sequences 

i 


Length 


Start 
Position 


SEQID 
No 


Forvvard,5 ' -ccagtagtcacaggtggaaaac-3 ' 


22 


835 


356 


Probe TET-5 ' -cagccttttctaggcagcaagttaagca-3 ' -TAMRA 


28 


858 


357 


Reverse j5 ■ -caatctccttgcaggatacaaa-3 1 


22 


906 


358 



Table APB , CNS_ncurodegencration_vl.O 



Tissue Name 


Rel Exn (%\ 
Ag4070, Run 
214294588 


Tissue Name 


Kel. bxp.(%) 
Ag4070, Run 
214294588 


AD 1 Hippo 


10.2 


[Control (Path) 3 Temporal Ctx 


0.1 


AD 2 Hippo 


14.1 


[Control (Path) 4 Temporal Ctx 


26.6 


AD 3 Hippo 




;AD 1 Occipital Ctx 


16.8 


AD 4 Hippo 


4.0 


;AD 2 Occipital Ctx (Missing) 


•0.0 j 


AD 5 hippo 


Jloo.o 


iAD 3 Occipital Ctx 


F-i ~ ! 


AD 6 Hippo 


55.1 


[AD 4 Occipital Ctx 


{ 1 5.6 


Control 2 Hippo 


15.3 


{AD 5 Occipital Ctx 


12.8 


Control 4 Hippo 


.10.1 


AD 6 Occipital Ctx 


28.9 


Control (Path) 3 Hippo 


4.5 


Control 1 Occipital Ctx 


3.5 


AD 1 Temporal Ctx 


13.5 


Control 2 Occipital Ctx 


76.8 


AD 2 Temporal Ctx 


19.8 


Control 3 Occipital Ctx 


19.3 


AD 3 Temporal Ctx 


4.1 


Control 4 Occipital Ctx 


5.2 


AD 4 Temporal Ctx 


16.8 


Control (Path) 1 Occipital Ctx 


82.9 


AD 5 Inf Temporal Ctx 


50.0 


Control (Path) 2 Occipital Ctx 


11.3 


AD 5 SupTemporal Ctx 


20.0 


Control (Path) 3 Occipital Ctx 


1.6 


AD 6 Inf Temporal Ctx 


66.4 


Control (Path) 4 Occipital Ctx 


12.6 


AD 6 Sup Temporal Ctx 


62.4 


Control 1 Parietal Ctx 


3.7 


Control 1 Temporal Ctx 


4.2 


Control 2 Parietal Ctx 


21.5 


Control 2 Temporal Ctx 


32.5 


Control 3 Parietal Ctx 


17.8 


Control 3 Temporal Ctx 


9.9 


Control (Path) 1 Parietal Ctx 


58.6 


Control 4 Temporal Ctx 


4.9 


Control (Path) 2 Parietal Ctx 


21.0 


Control (Path) 1 
Temporal Ctx 


37.1 


Control (Path) 3 Parietal Ctx 


2.4 


Control (Path) 2 32.1 


Control (Path) 4 Parietal Ctx 


56.9 
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Temporal Ctx 



Table APC . Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
1 72382436 


Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
| 7238^416 


Secondary Thl act 


42.9 


HUVECIL-Ibeta 


27.0 


Secondary Th2 act 


58.6 jHUVEC I FN gamma 


27.0 


Secondary Trl act 


47.3 


HUVECTNF alpha + IFN 
gamma 


10.8 


Secondary Thl rest 


16.2 


HUVEC TNF alpha + 1L4 


21.3 


Secondary Th2 rest 


14.6 


HUVECIL-11 


15.7 


Secondary Trl rest 


27.7 [Lung Microvascular EC none 


38.2 


Primary Thl act 


46 y 'Lung Microvascular EC 
iTNFalpha + lL-Ibeta 


31.0 


Primary Th2 act 


46.7 


Microvascular Dermal EC none |26.1 


Primary Trl act 

i 

, — 


42.9 

! 


'Microsvasular Dermal EC 
[TNFalpha + IL-lbeta 


19.8 


iPrimary Thl rest 117.9 

i i 


jBronchial epithelium TNFalpha 
l+ILIbeta 


33.7 


•Primary Th2 rest 


5.6 


Small airway epithelium none jl 1 .1 


Primary Trl rest 


19.6 


Small airway epithelium 
TNFalpha + IL-1 beta 


22.4 


CD45RA CD4 
lymphocyte act 


42.3 


Coronery artery SMC rest 


17.8 


CD45ROCD4 
lymphocyte act 


48.0 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


19.5 


CD8 lymphocyte act 


57.4 


Astrocytes rest 


14.1 


Secondary CDS 
lymphocyte rest 


26.6 


Astrocytes TNFalpha + IL-1 beta 1 


8.2 


Secondary CD8 
lymphocyte act 


14.3 


KU-812 (Basophil) rest 


49.3 


CD4 lymphocyte none 


19.5 


KU-8 12 (Basophil) 
PMA/ionomycin 


74.2 


2ryThl/Th2yTrl anti- 
CD95 CHI 1 


35.8 


CCD 1 1 06 (Keratinocytes) none i 


39.5 


LAK cells rest ; 


24.8 


CCD1106(Keratinocytes) 
TNFalpha + 1L-1 beta 


17.9 


LAK cells IL-2 


30.8 


Liver cirrhosis 


3.8 


LAK cells IL-2+IL-12 


18.6 ] 


MCI-H292none : 


28.3 


LAK cells IL-2+IFN 
gamma 


IL7 l 


MCI-H292IL-4 i 


52.5 
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LAK cells 1L-2+ 1L- 1 8 


14.2 


NCI-H292 IL-9 j62.9 


TAT/" ^IL 

LrAJv cells 
PMA/ionomycin 


13.3 


NCI-H292 IL-13 


43 <5 


in iv uejis ijl-z rest 


f A -7 

5u\7 


NCI-H292 IFN gamma 


,20.6 


i wo way MLrK j> day 


26.1 


HPAEC none 


12.9 


i wo way iviLK j day 


22.4 


HPAEC TNF alpha + IL-1 beta 


22.1 


i wo way mjlk / day 


23.5 


Lung fibroblast none 


16.8 


PBMC rest 


10.0 


Lung fibroblast TNF alpha + IL- 
I beta 


19.0 


rDivlC r W1V1 


2 j. 2 


Lung fibroblast IL-4 


13.3 


PBMC PHA-L 


21.2 


Lung fibroblast IL-9 


13.5 


Ramos (B cell) none 


54.3 


Lung fibroblast IL-13 


14 9 


Ramos (B cell) 
ionomycin 




Lung fibroblast IFN gamma 


21.0 


B lymphocytes PWM 


26.4 


Dermal fibroblast CCD 1070 rest 


36.6 


B lymphocytes CD40L 
and IL-4 


07 7 
Z /, / 


Dermal fibroblast CCD 1070 
TNF alpha 


51.8 


EOL-1 dbcAMP 


55.1 


Dermal fibroblast CCD 1070 IL-l 
beta 


23.7 


jEOL-1 dbcAMP 
j P M A/ ionomy c i n 


49.7 


Dermal fihrnhlflQt IFN oflmmn 
l/v*i uicii 1 1 ui \jKJia.oi 11 IN Tallin Itl 


1 o.j 


Dendritic cells none 


49.7 


Dermal fibroblast IL-4 


29.1 


Dendritic cells Lro 


20.9 


Dermal Fibroblasts rest 


13.8 


Dendritic cells anti- 
CD40 


41.2 


Neutrophils TNFa+LPS 


6.7 


Monocytes rest 


23.5 


Neutrophils rest 


17.1 


Monocytes LPS 


36.1 


Colon 


7.0 


Macrophages rest 


24.8 


Lung 


6.9 


Macrophages LPS 


7.0 


Thymus 


100.0 


HUVEC none 


12.7 


Kidney 


44.1 


HUVEC starved 


19.6 






Table APD. Panel CNS 1 


Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
180912025 


Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
180912025 


BA4 Control 


25.2 


BA17PSP 


21.5 


BA4 ControI2 


32.5 


BA17PSP2 


6.5 


BA4 Alzheimer ! s2 : 


2.9 


Sub Nigra Control 


33.7 


BA4 Parkinson's 


40.1 


Sub Nigra Control2 


23.2 
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BA4 Parkinson's2 


100.0 


olio iNigra Aizneimersz 


1Z.I 


BA4 Huntineton's 


29.1 


ouu iNigra rarKinson sz 




BA4 Huntington's2 


6.1 


ouu iNigra riuniingion s 


4V. / 


BA4 PSP 


17 0' 


ouu iNigra nuniingion sz 


Zo. 1 


BA4 PSP2 


26.6 


QiiH MJora PCDO 


O. 1 


BA4 Depression 


9.2 


ouu iNi^id depression 


Q 0 
O.Z 


BA4 Depression2 


10.4 


ouu iNjg.ia L/cpr cssionz 


/.z 


BA7 Control 


37.6 


vjiuu raiiauus *-<oniroi 


111 

11.1 


BA7 Control2 


32.8 


vjiuu rajutuuz> v^omroiz 


O.Z 


BA7 Alzheimer's2 


5.1 


vjiou rdiiauus /\izneimers 


^7 1 
/. I 


BA7 Parkinson's 


18.0 


vjiod rdiiduus /\izneimersz 


J.J 


BA7 Parkinson's2 


— O . 1 


vjiod i aiiaaus rarKinson s 


jj.4 


BA7 Huntineton's 


40 Q 


vjiod raiiaaus rarKinson sz 


A C 

4. j 


BA7 Huntin»ton's2 


1 

JU. 1 


VJIOD rulldflUS ror 


z.z 


BA7 PSP 


?R 7 


UIOD raiiaaus rorZ 


10.2 


!BA7 PSP2 


OA 7 

1 1 7 
11./ 


Glob Palladus Depression 


5.7 


RA7 DpnrpQ^irm 
i— / coj lull 


Temp Pole Control 


O 1 

8.1 


BA9 Control 




i enip roie controiz 


1 o o 

1 8.8 


lBA9 Control2 

r\. s Willi 


J7.U 


Temp Pole Alzheimer's 


j. 1 


)BA9 Alzheimer's 


j . y 


l emp roie Aizneimersz 


A A 


BA9 AIzheimer's2 


1 1 0 


l emp roie rarKinson s 


1 A 1 

19. 1 


BA9 Parkinson's 


16 7 


i emp roie rarKinson sz 


14.0 


BA9 Parkinson f s2 


4R 0 


i emp r oie nuniingion s 


21.0 


BA9 Huntington'*; 

A-**V >' 1 1UI till ItlUI 1 


J J .7 


1 emp rOJe ror 


z.8 


BA9 Huntington's? 




T« mn D^lo DC DO 

i emp r oie ror/ 


1 Q 

1.5 


BA9 PSP 


1 L/.Z. 


iemp roie uepressionz 


c c 
J.O 


BA9 PSP2 


7 0 


\^ing vjyr i^oniroi 


41.8 


BA9 Deoression 


6 7 
u. / 


Coiner (~l\rr nnir/^IO 

v^ing vjryr i^oniroiz 


13.2 


BA9 Deoression2 


4 6 


v^ing vjyr /\izneimer s 


1 1 A 

1 1.4 


BA 17 Control 


SO > 


v^ing uyr Aizneimer sz 


a i 
o.3 


BA17 Control2 


SR ? 


L/ing vjyr rarKinson s 


8.0 


BA17 Alzheimer f s2 


14 6 


v^ing vjyr rarKinson sz 


OA *7 
Z4. / 


BA17 Parkinson's 


38.7 


Cinff Gvr Huntineton's 


Z>Z,.J 


BA17Parkinson's2 


53.2 


Cing Gyr Huntington's2 


12.9 


BA 17 Huntington's 


35.1 


Cing Gyr PSP 


12.7 


BA17 Huntington's2 


9.5 


Cing Gyr PSP2 


1.6 


BA17 Depression 


9.3 


Cing Gyr Depression 


5.3 


BA17 Depression2 


26.8 1 


Cing Gyr Depression2 


3.8 
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Table APE . General oncology screening panel_v_2.4 



Tissue Name 


Dpi C VM /o/ \ 

K.ei, fcXp.(/7o) 

Ag4070 5 Run 
268390170 


Tissue Name 


Rel.Exp.(%) 
Ag4070, Run 
268390170 


Colon cancer 1 


14.3 


Bladder cancer NAT 2 


0.0 


inlnn ranrpr MAT 1 


j.j 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


10.1 


Bladder cancer NAT 4 


8.6 


Colon cancer NAT 2 


7.5 


Adenocarcinoma of the prostate | 
i [27.7 


Colon cancer 3 


47.3 


Adenocarcinoma of the prostate 

2 


0.9 


Colon cancer NAT 3 


14.2 


Adenocarcinoma of the prostate 

j 


16.6 


Colon malignant cancer 
4 


28.3 


Adenocarcinoma of the prostate 
4 


44.4 


Colon normal adjacent 
tissue 4 


1.3 


Prostate cancer NAT 5 


p. 


Lung cancer I 

i 


10.2 


Adenocarcinoma of the prostate L rt 
6 I 19 


Lung NAT I 




0.1 

- 


Adenocarcinoma of the prostate 
7 


3.6 


Lung cancer 2 


100.0 


Adenocarcinoma of the prostate 
8 


2.0 


Lung IN A 1 z 


3.5 


Adenocarcinoma of the prostate 
9 


20.0 


Squamous cell 
carcinoma 3 


17.9 


Prostate cancer NAT 1 0 


0.6 


Lung NAT 3 


0.3 


Kidney cancer 1 


9.1 


metastatic melanoma 1 


15.6 


KidneyNAT 1 


8.1 


Melanoma 2 




Kidney cancer 2 


45.7 


Melanoma'3 


2.2 


Kidney NAT 2 


17.6 


metastatic melanoma 4 


39.8 


Kidney cancer 3 


19.9 


metastatic melanoma 5 


52.0 


Kidney NAT 3 


3.1 


Bladder cancer 1 


2.2 


Kidney cancer 4 


7.2 


Bladder cancer NAT 1 1 


3.0 


Kidney NAT 4 


1.7 


Bladder cancer 2 [ 


2.8 







CNS_neurodegeneration_vl.O Summary: Ag4070 This panel confirms the 
expression of the CG96581-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
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Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Furthermore, low expression of this gene in brain suggests that this gene may 
play a role in central nervous system disorders such as Parkinson's disease, epilepsy, 
multiple sclerosis, schizophrenia and depression. 

The CG96581-01 gene codes for RP42 homolog. The mouse RP42 gene is 
expressed in proliferating neuroblasts, whose human orthologs map to susceptibility loci 
for autism (Mas et al., 2000, Genomics 65: 70-74, PubMed ID: 10777668). Therefore, this 
gene may also play a role in autism and therapeutic modulation of this gene may be useful 
in,the treatment of autism and other neurological disorders. 

Genera l_screening u j>anel_v 1.4 Summary: Ag4070 Results from one 
experiment with the CG96581-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4.1D Summary: Ag4070 Highest expression of the CG96581-01 gene is 
detected in thymus (CT=30.3). This gene is expressed at high to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 
cells include members of the T-celL B-cell, endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 
This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. This pattern suggests a 
role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

Panel CNS_1 Summary: Ag4070 This panel confirms the expression of the 
CG96581-01 gene at low levels in the brains. Please see Panel 
•CNS_neurodegeneration_vl.0 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General oncology screening panel_v_2.4 Summary: Ag4070 Highest 
expression of the CG96581-01 gene is detected in lung cancer sample (CT=30). 
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Interestingly, expression of this gene is higher in number of cancer samples including 
lung, prostate, colon, melanoma and kidney cancers (CTs=30-33) as compared to control 
normal samples (CTs>35). Therefore, expression of this gene can be used as diagnostic 
marker for these cancers and therapeutic modulation of this gene can be beneficial in the 
5 treatment of these cancers. 

AQ. CG96624-01 and CG96624-02: Putative Seven Pass Transmembrane Protein 

Expression of gene CG96624-01 and variant CG96624-02 was assessed using the 
primer-probe sets Agl372 and Ag4082, described in Tables AQA and AQB. Results of the 
1 0 RTQ-PCR runs are shown in Tables AQC, AQD, AQE and AQF. 

Table AQA . Probe Name Agl372 



Primers 


Sequences 


Length 


Start 
Position 


! SEQ ID 
No 


Forward 


5 1 -ctttgttccccaggtcattc-3 ' 


20 


792 


359 


Probe 


TET-5 1 -cgcctggtcagacacattgtaccagt-3 ' -TAMRA 


nr 


1 758 


" ~360 I 


Reverse 


5 ' -ggctggacaccttcgattac-3 ' 


20 


737 


361 ~j 




Table AOB, Probe Name Ag4082 






Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 1 -tgcccttctggcttctctac-3 1 


20 


320 


362 


Probe 


TET-5 ' -cctgcagttcttcaccttgacgctta-3 ' -TAMRA 


26 


354 


363 


Reverse 


5 1 -ttacctgggcaaagtagaggtt-3 ' 


22 


382 


364 



Table AQC CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
214294986 


Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
214294986 


AD 1 Hippo 


26.8 


Control (Path) 3 Temporal Ctx 


7.3 


AD 2 Hippo 


34.2 


Control (Path) 4 Temporal Ctx 


29.1 


AD 3 Hippo 


16.6 


AD 1 Occipital Ctx 


15.4 


AD 4 Hippo 


9.2 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


95.9 


AD 3 Occipital Ctx 


15.9 


AD 6 Hippo 


52.9 


AD 4 Occipital Ctx 


15.4 


Control.2 Hippo 


27.7 


AD 5 Occipital Ctx 


55 ; 1 


Control 4 Hippo 


15.8 


AD 6 Occipital Ctx 


22.8 


Control (Path) 3 Hippo 


8.5 


Control 1 Occipital Ctx 


4.6 
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j\Lj i temporal utx 


2.3.5 


Control 2 Occipital Ctx 


89.5 


r\u z i em p orai v^ix 


jo .4 


Control 3 Occipital Ctx 


18.2 


AD 3 Temporal Ctx 


11.6 


Control 4 Occipital Ctx 


9.4 


AD 4 Temporal Ctx 


18.2 


Control (Path) 1 Occipital Ctx 


95.9 


AD 5 Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


8.5 


ADS Sup Temporal Ctx 


51.4 


Control (Path) 3 Occipital Ctx 


6.2 


AD 6 Inf Temporal Ctx 


39.0 


Control (Path) 4 Occipital Ctx 


15.3 


AD 6 Sup Temporal Ctx |36.3 


Control 1 Parietal Ctx 


8.1 


Control 1 Temporal Ctx j 1 1 . 1 


Control 2 Parietal Ctx 


42.9 


Control 2 Temporal Ctx 


66.0 


Control 3 Parietal Ctx 


18.4 


Control 3 Temporal Ctx 


19.1 


Control (Path) 1 Parietal Ctx 


69.3 


Control 3 Temporal Ctx 


15.8 


Control (Path) 2 Parietal Ctx 


18.0 


Control (Path) 1 Temporal 
Ctx 


58.2 


Control (Path) 3 Parietal Ctx 


9.7 


Control (Path) 2 Temporal 
Ctx 


30.1 


Control (Path) 4 Parietal Ctx 


36.3 


Table AOD. General screening panel vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
218897864 


Tissue Name . 


Rel. Exp.(%) 
Ag4082. Run 
218897864 


Adipose 


1.3 


Renal ca. TK-10 


8.8 


Melanoma* Hs688(A).T 


18.8 


Bladder 


8.2 


Melanoma* Hs688(B).T 


15.7 


Gastric ca. (liver met.) NCI-N87 


27.5 


Melanoma* Ml 4 


38.4 


Gastric ca. KATO III 


21.8 


Melanoma* LOXIMVI 


27.0 


Colon ca. SW-948 


7.2 


Melanoma* SK-MEL-5 


10.3 


Colon ca. SW480 


39.8 


Squamous cell carcinoma 
SCC-4 


21.0 


Colon ca.* (SW480 met) SW620 


7.9 


Testis Pool 


40.3 


Colon ca. HT29 


9.3 


Prostate ca.* (bone met) 
PC-3 


51.4 


Colon ca. HCT-116 


22.2 


Prostate Pool 


3.4 


Colon ca. CaCo-2 


17.2 


Placenta 


32.3 


Colon cancer tissue 


7.9 


Uterus Pool 


1.2 


Colon ca. SW1116 


6.2 


Ovarian ca. OVCAR-3 


12.2 


Colon ca. Colo-205 


5.3 


Ovarian ca. SK-OV-3 


26.1 


Colon ca. SW-48 


5.9 


Ovarian ca. OVCAR-4 


14.2 


Colon Pool 


9.0 


Ovarian ca. OVCAR-5 


31.4 


Small Intestine Pool 


6.2 


Ovarian ca. IGROV-1 


17.6 


Stomach Pool 


2.8 
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Ovarian ca. OVCAR-8 


9.3 


Bone Marrow Pool 


2.4 


Ovary 


7.2 


Fetal Heart 


3.6 


Breast ca. MCF-7 


17.4 


Heart Pool 


5.1 


Breast ca. MDA-MB-23 1 


80.1 


Lymph Node Pool 


10.1 


Breast ca. BT 549 


38.2 


Fetal Skeletal Muscle 


2.0 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


2.1 


IBreast ca. MDA-N 


37.6 


Spleen Pool 


3.9 


iBreast Pool 


11.9 


Thymus Pool 


6.4 


Trachea 


6.6 


CNS cancer (glio/astro) U87-MG 


55.9 


Lung 


A A 
U.4 


CNS cancer (glio/astro) U-l 18- 
MG 


j>8.4 


jFetal Lung 

I 


12.0 


CNS cancer (neuro:met) SK-N- 
AS 


13.8 


{Lung ca. NCI-N417 


7,5 


CNS cancer (astro) SF-539 


8.8 


ILung ca. LX-I 


12.0 


CNS cancer (astro) SNB-75 


32.5 


jLung ca. NCI-H146 


17.2 


CNS cancer (glio) SNB-19 


17.1 


.Lung ca. SHP-77 


26.6 


CNS cancer (giio) SF-295 


_ . _ 


ILung ca. A549 


18.2 


Brain (Amygdala) Pool 


10.2 


Lung ca. NCI-H526 


20.6 


- ., 

Brain (cerebellum) 


38.2 


.Lung ca. NCI-H23 


20.9 


Brain (fetal) 


23.3 


:Lung ca. NCI-H460 


7.6 


Brain (Hippocampus) Pool 


10.5 


Lung ca. HOP-62 


6.4 


Cerebral Cortex Pool 


15.2 


Lung ca. NCI-H522 


47.3 


Brain (Substantia nigra) Pool 


19.8 


Liver 


2.4 


Brain (Thalamus) Pool 


16.3 


Fetal Liver 


12.9 


Brain (whole) 


17.9 


Liver ca. HepG2 


3.8 


Spinal Cord Pool 


9.7 


Kidney Pool 


9.5 


Adrenal Gland 


11.2 


Fetal Kidney 


5.3 


Pituitary gland Pool 


2.5 


Renal ca. 786-0 


15.3 


Salivary Gland 


6.7 


Renal ca. A498 


8.1 


Thyroid (female) 


8.8 


Renal ca. ACHN 


7.3 


Pancreatic ca. CAPAN2 


21.9 


Renal ca.UO-31 


17.0 


Pancreas Pool 


10.7 


Table AOE. Panel 1.2 


Tissue Name 


Rel. Exp.(%) 
Agl372,Run 
134887810 


Tissue Name 


Rel. Exp.(%) 
Agl372, Run 
134887810 


Endothelial cells 


13.1 


Renal ca. 786-0 


2.6 


Heart (Fetal) 


47.6 


Renal ca. A498 


10.0 



641 



WO 03/010327 



PCT/US02/14199 



rancrcas 




Renal ca. RXF 393 


0.7 


rancreauc ca. c/vr/YIN z 


I.J 


Renal ca. ACHN 


7.5 


/\urenai vjriaiiQ 


H A 


Renal ca. UO-31 


6.4 


i iiyiuiu 


l.J 


Renal ca.TK-10 


5.8 


oaiivary gidiiu 




Liver 


8.5 


Pitnifarv/ olonrl . 
.ruuitdiy iiiduu 


U.Z 


Liver (fetal) 


3.5 


jDl all 1 ^icmi ) 


1 .o 


Liver ca. (hepatoblast) HepG2 


6.8 


; DIaIII yVVIiUIC^ 


0 A 


Lung 


1.4 


Brain f'amvoHaljA 




Lung (ietal) 


2.1 


Brain (cerebellum) 


5.3 


Lung ca. (small cell) LX-1 


7.7 


Brain (hippocampus) 


32.3 


Lung ca. (small cell) NCI-H69 


7.6 


jBrain (thalamus) 


20.3 


Lung ca. (s.cell var.) SHP-77 


1.1 


jCerebral Cortex 


! 100.0 


Lung ca. (large cell)NCI-H460 


13.9 


jSpinal cord 


[3.7 


Lung ca. (non-sm. cell) A549 


9.5 


jglio/astro U87-MG J 15.4 


Lung ca. (non-s.cell) NC1-H23 


19.7 


jglio/astro U-l 18-MG 


10 1 


Lung ca. (non-s.cell) HOP-62 


8.7 


|astrocytoma SW1783 


3.8 


Lung ca. (non-s.cl) NCI-H522 


71.7 j 


jneuro*; met SK-N-AS 


14.8 


Lung ca. (squam.) SW 900 


?± J 


jastrocytoma SF-539 


2.8 


Lung ca. (squam.) NCI-H596 


19.6 j 


jastrocytoma SNB-75 


2.5 


Mammary gland 


3.4 


glioma SNB-19 


12.6 


Breast ca.* (pl.ef) MCF-7 


2.7 


glioma U251 


0.0 


Breast ca.* (pl.ef) MDA-MB- 
231 


6.4 


glioma SF-295 


19.5 


Breast ca.* (pi. ef)T47D 


11.6 


Heart 


15.0 


Breast ca. BT-549 


31 


Skeletal Muscle 


4.5 


Breast ca. MDA-N 


50.7 


Bone marrow 


2.5 


Ovary 


22.8 


Thymus 


1.7 


Ovarian ca. OVCAR-3 


0.0 


Spleen 


3.6 


Ovarian ca. OVCAR-4 


19.6 


Lymph node 


1.7 


Ovarian ca. OVCAR-5 


25.2 ; 


Colorectal Tissue 


2.1 


Ovarian ca. OVCAR-8 


23.7 


Stomach 


9.7 


Ovarian ca. IGROV-1 


6.9 


Small intestine 


14.5 


Ovarian ca. (ascites) SK-OV-3 


10.4 


Colon ca. SW480 


2.6 


Uterus 


3.7 


Colon ca.* SW620 (SW480 
met) 


1.5 


Placenta 


32.8 


Colon ca. HT29 


1.2 


Prostate j; 


23.0 


Colon ca. HCT-116 


1.0 


Prostate ca.* (bone met) PC-3 j. 


52.5 
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Colon ca. CaCo-2 


3.7 


Testis 


10.4 


Colon ca. Tissue 
(OD03866) 


1.0 ' 


Melanoma Hs688(A).T 


3.0 


Colon ca. HCC-2998 


7.1 


Melanoma* (met) Hs688(B).T 


2.5 


Gastric ca.* (liver met) 
NCI-N87 


7.5 


Melanoma UACC-62 


29.3 


Bladder 


9.5 


Melanoma MI4 


9.3 ■ 


Trachea 


1.3 iMelanoma LOX IMVI 


2.6 "! 


Kidney 


50.3 


jMelanoma* (met) SK-MEL-5 


_l 

4.4 ! 


Kidney (fetal) 


3.1 






Table AOF. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
171809988 


Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
171809988 


Secondary Th 1 act 


44.1 


HUVEC IL-lbeta 


29.3 


Secondary Th2 act 


55.5 


HUVEC IFN gamma 


24.7 


Secondary Trl act 


45.1 


riUVtC 1 INr alpha + IrN 
gamma 


29.5 

i 


Secondary Th] rest 


9.2 


HUVEC TNF alpha + IL4 


47.0 | 


Secondary Th2 rest 


10.8 


ui r\/pr^ it 1 1 
nUV t,L. JL- 1 1 


I j.v j 


Secondary Tr 1 rest J 1 0.8 


Lung Microvascular EC none 


58.2 


Primary Thl act J29.9 


Lung Microvascular EC 
TNFaloha+ IL-lbeta 


66.4 


Primary Th2 act 


47.6 


Microvascular Dermal EC none 


20.2 


Primary Trl act 


45.7 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


37.1 


Primary Thl rest 


9.2 


Bronchial epithelium TNFalpha 
+ ILlbeta 


31.2 


Primary Th2 rest 


5.8 


Small airway epithelium none 


9.2 


Primary Trl rest 


14.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


17.9 


CD45RA CD4 lymphocyte 
act 


40.6 


Coronery artery SMC rest 


16.0 


CD45RO CD4 lymphocyte 
act 


45.1 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


18.3 


CD8 lymphocyte act 


51.8 


Astrocytes rest 


25.3 


Secondary CD8 
lymphocyte rest 


60.7 


Astrocytes TNFalpha + IL-1 beta 


27.0 


Secondary CD8 
lymphocyte act 


29.5 


KU-8 12 (Basophil) rest 


13.2 


CD4 lymphocyte none 6.9 


KU-8 12 (Basophil) 


37.4 | 
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PMA/ionomycin 




2ryThl/Th2/Trl anti- 
CD95CH11 


19.3 


CCD1 106 (Keratinocytes) none 


31.2 


LAK cells rest 


11/1 

J 1.4 


CCD 1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


23.0 


LAK cells IL-2 


28.9 


Liver cirrhosis 


4.5 


!LAKceIisIL-2+IH2 


28.1 


NCI-H292 none 


40.3 


:LAK cells IL-2+IFN 
gamma 


14.7 


NCI-H292 IL-4 


0.2 


LAK cells IL-2+ IL-18 


25.9 


NCI-H292 IL-9 


— — . .. i. — 

100.0 


LAK cells PMA/ionomycin 


27.4 


NCI-H292 IL-13 


83.5 


NK Cells IL-2 rest 


55.9 


NC1-H292 1FN gamma 


99.3 


Two Way MLR 3 day 


45.1 


HPAEC none 


16.5 


Two Way MLR 5 day 


36.9 


HPAEC TNF alpha + IL-1 beta 


68.3 


Two Way MLR 7 day 


17.9 


Lung fibroblast none 


27.0 


PBMC rest 


9.6 


Lung fibroblast TNF alpha + 1L- 
1 beta 


- — i 

29.7 


:PBMC PWM 


49.3 


Lung fibroblast IL-4 


29.9 


; PBMC PHA-L )25.5 


Lung fibroblast IL-9 


28.3 ] 


iRamos (B cell) none |35.4 


Lung fibroblast IL-13 


26.4 i 


;Ramos (B cell) ionomycin |3 1 .6 


Lung fibroblast I FN gamma 


32.8 I 


B lymphocytes PWM 


34.4 


Dermal tibroblast CCD 1070 rest 


23.2 


B lymphocytes CD40L and 
IL-4 


41.2 


n«„„ni f:u M ui«r.+ rrTMAan 

Dermal tibroblast CCD 1070 
TNF alpha 


40.3 


EOL-1 dbcAMP 


12.8 


Dermal fibroblast CCD 1070 IL-1 
beta 


19.6 . 


EOL-1 dbcAMP 
PMA/ionomycin 


15.8 


Dermal fibroblast IFN gamma 


28.1 


Dendritic cells none 


24.5 


Dermal fibroblast IL-4 


30.8 


Dendritic cells LPS 


37.6 


Dermal Fibroblasts rest 


24.8 


Dendritic cells anti-CD40 


32.1 


Neutrophils TNFa+LPS 


3.5 


Monocytes rest 


11.7 


Neutrophils rest 


8.0 


Monocytes LPS 


33.7 


Colon 


9.0 


Macrophages rest 


72.2 


Lung 


14.9 


Macrophages LPS 


25.2 


Thymus 


11.3 


HUVEC none 


15.7 


Kidney 


24.0 


HUVEC starved 


23.3 







CNS_neurodegeneration_vl.O Summary: Ag4082 This panel does not show 
differential expression of the CG96624-01 gene in Alzheimer's disease. However, this 
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expression profile confirms the presence of this gene in the brain. Please see Panel 1 A for 
discussion of utility of this gene in the central nervous system. 

. General_screeningjpanel_vl.4 Summary: Ag4082 Highest expression of the 
CG96624-01 gene is seen in a breast cancer cell line (CT=25.8). Significant levels of 
5 expression are also seen in a cluster of samples derived from breast, brain, colon, lung, 
ovarian, prostate and melanoma cancer cell lines. 

In addition, this gene is expressed at much higher levels in fetal lung (CT=28.9) 
when compared to expression in the adult counterpart (CT=36.7). Thus, expression of this 
gene may be used to differentiate between the fetal and adult source of this tissue. 

10 The ubiquitous expression of this gene in this panel and the higher levels of 

expression in cancer cell lines and some fetal tissues suggest a role for tins gene product in 
cell survival and proliferation. 

Among tissues with metabolic function, this gene is expressed at moderate levels 
in pituitary, adrenal gland, pancreas, thyroid, and adult and fetal skeletal muscle, heart. 

15 and liver and at low but significant levels in adipose. This widespread expression among 
these tissues suggests that this gene product may play a role in normal neuroendocrine and 
metabolic function and that disregulated expression of this gene may contribute to 
neuroendocrine disorders or metabolic diseases, such as obesity and diabetes. 

This gene is also expressed at high to moderate levels in the CNS, including the 

20 hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 

Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 1.2 Summary: Ag4082 Highest expression of the CG96624-01 gene is seen 

25 in the cerebral cortex (CT=23.8). Expression of this gene is ubiquitous in this panel, with 
prominent levels of expression in lung and breast cancer cell lines and normal kidney 
(CT=24.8). Thus, expression of this gene could be used as a marker of lung and breast 
cancer and to differentiate kidney from fetal kidney (CT=28.8). Please see Panel 1.4 for 
further discussion of utility of this gene. 

30 Panel 4.1D Summary: Ag4082 Highest expression of the CG96624-01 gene is 

seen in IL-9 treated NCI-H292 cells (CT=29.6). In addition, this gene is expressed at 
moderate levels in a wide range of cell types of significance in the immune response in 
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health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl .4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 



AR. CG96747-01: Voltage-Dependent Calcium Channel Gamma-3 Subunit - Like 
Protein 

Expression of gene CG96747-01 was assessed using the primer-probe set Ag4076, 
described in Table ARA. Results of the RTQ-PCR runs are shown in Tables ARB and 
ARC. 

TabJeARA. Probe Name Ag4076 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 • -agcgtgtctctgctgcttt-3 • 


19 


421 


365 


Probe 


TET-5 ' -tcaccggctgctacttcctgctg-3 ' -TAMRA 


23 


440 


366 


Reverse 


5 ■ -aggtgcgagtagctgatgtaga-3 ' 


22 


494 


367 



Table ARB, CNS_neurodegeneration_vl.O 



1 

Tissue Name 


Rel. Exp.(%) 
Ag4076, Run 
214294983 


Tissue Name 


Rel. 

Exp.(%) 
Ag4076, 
Run 

214294983 


AD 1 Hippo 


15.1 


Control (Path) 3 Temporal Ctx 


0.0 


AD 2 Hippo 


71.2 


Control (Path) 4 Temporal Ctx 


13.3 


AD 3 Hippo 


6.4 


AD 1 Occipital Ctx 


9.1 


AD 4 Hippo 


7.1 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


53.2 ~^ 


AD 3 Occipital Ctx 


2.4 
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AD 6 Hippo 


15.2 


AD 4 Occipital Ctx 


23.2 


Control 2 Hippo 


34.2 


AD 5 Occipital Ctx 


14.0 


Control 4 Hippo 


29.3 


AD 6 Occipital Ctx 


49.0 


Control (Path) 3 Hippo 


3.5 


Control 1 Occipital Ctx 


11.7 


AD 1 Temporal Ctx 


17.0 


Control 2 Occipital Ctx 


74.7 


AD 2 Temporal Ctx 


43.8 


Control 3 Occipital Ctx (9.0 ~^ 


AD 3 Temporal Ctx 


0.0 


Control 4 Occipital Ctx - 1 1.3 


AD 4 Temporal Ctx 


13.6 


Control (Path) 1 Occipital Ctx ;97.9 


AD 5 Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


8.8 


AD 5 SupTemporal Ctx 


48.0 [Control (Path) 3 Occipital Ctx 


0.9 


AD 6 Inf Temporal Ctx 


27.9 jControl (Path) 4 Occipital Ctx 


4.2 


AD 6 Sup Temporal Ctx 


1 7.9 jControl 1 Parietal Ctx 


32.8 


Control 1 Temporal Ctx 


16.7 


Control 2 Parietal Ctx 


48.3 


Control 2 Temporal Ctx 


50.3 jControl 3 Parietal Ctx j 1 4.0 


|Control 3 Temporal Ctx 


8.8 (Control (Path) 1 Parietal Ctx [ 


42.9 


jControl 4 Temporal Ctx 


25.9 

27.0 : 


Control (Path) 2 Parietal Ctx 1 


17.4 


Control (Path) 1 Temporal 
Ctx 1 


Control (Path) 3 Parietal Ctx b.O ! 

* i 


Control (Path) 2 Temporal 
Ctx 


7.0 ! 

1 


" "i 

Control (Path) 4 Parietal Ctx - 


1 

5.8 j 

i 



Table ARC . General screcning panel vl. 4 



Tissue Name 


Rel. Exp.(%) 
Ag4076, Run 
218903895 


Tissue Name 


Rel. 

Exp.(%) 
Ag4076, 
Run 

218903895 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* Hs688(B).T 


0.0 


Gastric ca. (liver met.) NC1-N87 


0.0 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell carcinoma SCC- 
4 


0.0 


Colon ca * (SW480 met) SW620 


0.0 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


o.o 1 
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ovarian ca. UVLAKo 


A A 

0.0 


Colon ca. Colo-205 


0.0 


wvanan ca. biv-ov-i 


0.0 


Colon ca. SW-48 


0.0 


Ovarian r»o CWIC* AD /I 

v^vdnan ca. uvlak-h 


v.y) 


/""» I r» i 

Colon Pool 


0.0 


vyvanan ca. vjvcako 


0.0 


Small Intestine Pool 


0.0 


Ovarian r»o 1 

vjvanan ca. ivjKvj v - 1 


0.0 


Stomach Pool 


0.0 


Ovarian CW1C A O Q 

vjvarian ca. uvlak-o 


0.0 


Bone Marrow Pool 


0.0 




0.0 


Fetal Heart 


0.0 


Dicasi ca. ivit^r-/ 


0.0 


[Heart Pool 


foT 


Dreast ca. ivilja-iVId-zj I 


0.0 [Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 




u.u 


-■ '■ -...i... ... ., — — — _ 

CNS cancer (glio/astro) U-l 1 8- 
MG 


0.0 


Fetal Lung 

{ — 




CNS cancer (neuro;met) SK-N-AS 


0.0 


Lung ca. NLI-N4 1 7 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX- 1 


0.0 


CNS cancer (astro) SNB-75 


0.0 


Lung ca. Nd-Hl4o 


0.0 


CNS cancer (glio) SNB-19 


0.0 


Lung ca. orir-/ / 


0.0 


CNS cancer (glio) SF-295 


0.0 


LUng ca. 


0.0 


Brain (Amygdala) Pool 


54.7 


Lung ca. JNC1-H526 


0.0 


Brain (cerebellum) 


37.1 


Lung ca. ML.1-H23 


0.0 


Brain (fetal) 


0.0 


Lung ca. JnC1-H4oU 


0.0 


Brain (Hippocampus) Pool 


41.2 


Lung ca. rlvjr-oz 


0.0 


Cerebral Cortex Pool 


38.2 


i>ung ca. iNt^i-rijzz 


0.0 


Brain (Substantia nigra) Pool 


100.0 


Liver 


0.0 


Brain (Thalamus) Pool 


45.7 


tTo'f'i 1 T i\/Ar* 

reiai l^iver 


0.0 


Brain (whole) 


18.2 


Liver ca. riepoz 


0.0 


Spinal Cord Pool 


67.8 


Kidney Pool 


0.0 


Adrenal Gland 


A A 

0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



CNS_neurodegeneration_vl.O Summary: Ag4076 This panel confirms the 
expression of the CG96747-01 gene at low levels in the brains of an independent group of 

648 



WO 03/010327 



PCT/US02/14199 



individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 A for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 
5 General_screening_panel_vL4 Summary: Ag4076 Highest expression of the 

CG96747-01 gene is detected in substantia nigra of brain (CT=30.l). Interestingly 
expression of this gene is exclusive to brain regions examined. Therefore, expression of 
this gene can be used to distinguish brain from other other samples used in this panel. 
Furthermore, therapeutic modulation of this gene through use of small molecule target 

10 may be useful in the treatment of neurological disorders such as Alzheimer's disease. 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

The CG96747-01 gene codes for a protein belonging to PMP- 
22/EMP/MP20/Claudin family. Proteins belonging to this family are small integral 
membrane glycoproteins which arc evolutionarily related including eye lens specific 

15 membrane protein 20 (MP20 or MP19); epithelial membrane protein- 1 (EMP-1/TMP), 
epithelial membrane protein-2 (EMP-2). and peripheral myelin protein 22 (PMP-22). 
PMP-22 plays a role both in myelinization and in cell proliferation. Mutations affecting 
PMP-22 are associated with hereditary motor and sensory neuropathies such as Charcot- 
Marie-Tooth disease type 1 A (CMT-1 A) in human or the trembler phenotype in mice 

20 (Jetten AM, Suter U, 2000, Prog Nucleic Acid Res Mol Biol 64:97-129, PMID: 10697408; 
PFAM: IPR004031). Thus, the CG96747-01 gene product may also play a role in 
hereditary motor and sensory neuropathies such as CMT-1 A and therapeutic modulation 
of this gene product may be useful in the treatment of this disease. 

Panel 4.1D Summary: Ag4076 Expression of the CG96747-01 gene is 

25 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4076 Expression of the 
CG96747-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

30 AS- CG97462-01: Prohibitin 

Expression of gene CG97462-01 was assessed using the primer-probe set Ag41 11, 
described in Table ASA. 
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Table ASA. Probe Name Ag4111 



Primers Jsequences 


Length 


Start 
Position 


SEQID No 


Forwardjs • -ggtctccagggagcaattac-3 • 


20 


432 


368 


Probe |TET-5 • -ttacagagcgagcagccacctttg-3 • -tamra 


24 


452 


369 


Reverse j5 ' -atgtgtcaaggacacgtcatc-3 ' 


21 


487 


370 


CNb_neurodcgeneration_vl.O Summary: Ag4111 Expi 


ession of t 


he CG97462 



10 



shown). 

General_screening_paiicI_vL4 Summary: Ag4111 Expression of the CG97462- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1 D Summary: Ag411I Expression of the CG97462-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag41 1 1 Expression of the 
CG97462-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



15 



AT. CG97472-01: Glucose Transporter 

Expression of gene CG97472-01 was assessed using the primer-probe set Ag41 13, 
described in Table ATA. Results of the RTQ-PCR runs are shown in Tables ATB, ATC, ' 



ATD and ATE. 



Table ATA. Probe Name Ag4113 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 • -agctccttttctgttggactgt-3 • 


22 


235 


371 


Probe 


TET-5 • -tcaacagctttgacaggcgtaattca-3 • -TAMRA 


26 


260 


372 


Reverse 


5 • -cagccaacaggttgacagtaag-3 ■ 


22 


289 


373 



Tissue Name 


Rel. Exp.(%) 
Ag4I13, Run 
219543072 


Tissue Name 


Rel. 

Exp.(%) 
Ag4113, 
Run 

219543072 


Adipose 


3.4 


Renal ca.TK-10 


16.5 


Melanoma* Hs688(A).T 


2.4 


Bladder 


11.4 
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Melanoma* Hs688(B).T 


1.0 


Gastric ca. (liver met.) NCI-N87 


23.3 


Melanoma* M14 


2.1 


Gastric ca. KATO III 


2.0 


Melanoma* LOXIMV1 


0^8 


Colon ca. SW-948 


0.2 


Melanoma* SK-MEL-5 


2.3 


Colon ca. SW480 


5.0 


Squamous cell carcinoma 
SCC-4 


A A 
U.O 


Colon ca.* (SW480 met) SW620 


3.9 


Testis Pool 


5.5 
0.0 


Colon ca. HT29 


1.6 


Prostate ca.* (bone met) 
PC-3 


Colon ca. HCT-116 


1.3 


Prostate Pool 


7.4 


Colon ca. CaCo-2 


2.1 


Placenta 


1.0 


Colon cancer tissue 


2.8 


Uterus Pool 


6.2 


Colon ca. SW1 1 16 


0.0 


Ovarian ea. OVCAR-3 


4.3 


Colon ca. Colo-205 


0.6 


Ovarian ca. SK-OV-3 


3.D 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool 


.37.1 


Ovarian ca. OVCAR-5 


28.7 


Small Intestine Pool |7L2 


Ovarian ca. IGROV-1 


2.0 


Stomach Pool 


& 


Ovarian ca. OVCAR-8 


1.3 


Bone Marrow Pool 


17.S 


Ovary 


14.3 


Fetal Heart !3.6 


Breast ca. MCF-7 


2.0 


Heart Pool 


11.9 


Breast ca. MDA-MB-23 1 


6.0 


Lymph Node Pool 


36.1 


Breast ca. BT 549 


6.3 


Fetal Skeletal Muscle 


3,9 


Breast ca. T47D 


25.7 


Skeletal Muscle Pool 


2.8 


Breast ca. MDA-N 


3.1 


Spleen Pool 


5.5 


Breast Pool 


35.1 


Thymus Pool 


15.6 


Trachea 


8.7 


CNS cancer (glio/astro) U87-MG 


8.8 


Lung 


24.5 


CNS cancer (glio/astro) U-l 18- 
MG 


1.7 


Fetal Lung 


16.5 


CNS cancer (neuro;met) SK-N- 
AS 


5.1 


Lung ca. NCI-N4 1 7 


0.9 


CNS cancer (astro) SF-539 


2.0 


Lung ca. LX-1 


8.9 


CNS cancer (astro) SNB-75 


4.0 


Lung ca.NCI-H 146 


0.2 


CNS cancer (glio) SNB-19 


4.4 


1 ■■•*#> n« COO *7~7 

Lung ca. aHP-77 


1.7 


CNS cancer (glio) SF-295 


7.6 


Lung ca. A549 


4.4 


Brain (Amygdala) Pool 


4.1 


Lungca. NCI-H526 


1.9 


Brain (cerebellum) 


3.6 


Lung ca. NCI-H23 


9.9 


Brain (fetal) 


8.7 


Lung ca. NCI-H460 


5.9 


Brain (Hippocampus) Pool 


7.7 


Lung ca. HOP-62 


5.3 


Cerebral Cortex Pool 


8.7 
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Lune ca NPT-HS?? 


1 o 
J.o 


Brain (Substantia nigra) Pool 


7.9 


Liver 


A A 

0.4 


Brain (Thalamus) Pool 


12.7 


I vial 1—rlVwI 




Brain (whole) 


6.0 


Liver ca Hpnfi9 




Spinal Cord Pool 


7.0 


Kidnev Pool 


i fin f\ 


Adrenal Gland 


11.0 


Fetal Kidney 


12.9 


Pituitary gland Pool 


O.J 


Renal ca. 786-0 


3.6 


Salivary Gland 


2.4 


Renal ca. A498 


0.0 


Thyroid (female) 


2.6 


Renal ca. ACHN 


2.7 


Pancreatic ca. CAPAN2 


9.8 


Renal ca. UO-3 1 


0.5 


Pancreas Pool 


30.4 


Table ATC. Panel 4 JD 




• 

Tissue Name 


Rei. Exp.(%) 
Ag41 13, Run 
172792545 


Tissue Name 


Rel. 

Exp.(%) 
As4l 13 
Run 

172792545 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


2.8 


Secondary Th2 act 


3.8 


HUVEC I FN gamma 


41.2 


Secondary Trl act 


1.8 


HUVEC TNF alpha + 1FN gamma 


9.7 


Secondary Thl rest 


12.0 


HUVEC TNF alpha + IL4 


1.4 


Secondary Th2 rest 


21.8 


HUVEC 1L-1 1 


6.1 


Secondary Trl rest 


24.7 


Lung Microvascular EC none 


21.8 


Primary Thl act 


5.3 


Lung Microvascular EC TNFalpha 
+ ]L-lbeta 


31.6 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


4.0 


Primary Trl act 


0.3 


Micros vasular Dermal EC 
TNFalpha + IL-1 beta 


14.2 


Primary Thl rest 


8.7 


Bronchial epithelium TNFalpha + 
ILlbeta 


5.7 


Primary Th2 rest 


6.4 


Small airway epithelium none 


0.0 


Primary Trl rest 


9.6 


Small airway epithelium 
TNFalpha +IL-1 beta 


0.0 


CD45RA CD4 lymphocyte 
act 


10.3 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte , 
act 


u. 


Coronery artery SMC TNFalpha + 
IL-lbeta 


3.3 


CD8 lymphocyte act i 


1.4 


Astrocytes rest 


1.3 


Secondary CD8 
lymphocyte rest 


).6 


Astrocytes TNFalpha + IL- 1 beta 1 


3.0 


Secondary CD8 
lymphocyte act 


).0 , 


OJ-812 (Basophil) rest ; 


>3.8 
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CD4 lymphocyte none 


5.3 


KU-812CBasoDhin 
PMA/ionomycin 


23.0 


2ryThI/Th2yTrl anti- 
CD95 CHI 1 


27.7 


CCD\ 106 (Keratinocytes) none 


0.0 


LAIC cells rest 


12.9 


CCD1 106 (Keratinocytes) 
TNFaIpha + IL-1 beta 




LAK cells IL-2 


k7 


jLiver cirrhosis 


1 

3.6 | 


LAK cells IL-2+IL-12 


;13.2 


;NCI-H292 none 


12.9 


LAK cells IL-2+IFN 

gall ll lid 


1 1 — ' 
!5.5 'NCI-H292 IL-4 

! 1 




I AK rplk If II 1 R 


10.4 


|NCI-H292 IL-9 


17.3 


LAK cells PMA/iononiycin 


jo.o 


Incuh?9? II -11 


12.3 


NK Cells IL-2 rest 


17.6 


NCI-H292 I FN gamma 


21.9 


Two Way MLR 3 day 


20.3 


HPAEC none 


14.8 


Two Way MLR 5 day 


10.9 


jHPAEC TNF alpha + IL-I beta 


17.2 


Two Way MLR 7 day 


0.9 


Lung fibroblast none 


i:i ' ~ 


PBMC rest 


.2.5 


Lung fibroblast TNF alpha + IL-1 
beta 


5.4 ! 


PBMC PWM 

. — 


7.3 


r* ■ 1 * 

I nno fihrnhlast II -4 


4.0 1 


jPBMC PHA-L 


10 7 

IV. / 


I nn<T fihrnhlast II -0 




Ramos (B cell) none 


17 ~ 


iLun" fibroblast 11-11 


10.2 j 


Ramos (B cell) ionomycin 


1.5 


Luns fibroblast I FN era mm a 


4.8 I 


B lymphocytes PWM 


1.5 


Dermal fibroblast CCDI 070 rpst 


1 ~ -Z 


B lymphocytes CD40L and 
IL-4 


0.9 


Dermal fibroblast CCD1070 TNF 
alpha 


12.8 


EOL-1 dbcAMP 


6.0 


Dermal fibroblast CCD 1070 IL-1 
beta 




EOL-1 dbcAMP 

PK4 A /innnmvp in 

1 lrU~U IVHIUI Jljrkslll 


3.1 


Dermal fibroblast I FN gamma 


2.3 


Dpnrlritip r*f»llc nnnp 
A~/wIIUl 1UL, tCUb nunc 


9.6 


Dermal fibroblast IL-4 


2.5 




21.5 


Dermal Fibroblasts rest 


1 1.5 


Dendritic cells anti-PD40 

j_/ciiui jliw Wvllo all ll-V^L/HU 


5.3 


Neutrophils TNFa+LPS 


11.1 


IVlUIUJwjr ICo I Col 


14.0 


Neutrophils rest 


92.7 


Monocytes LPS 


88.9 


Colon 


24.7 


Macrophages rest 


8.7 


Lung 


29.7 


Macrophages LPS 


6.5 


Thymus 


56.6 


HUVEC none 


3.7 


Kidney 


100.0 


HUVEC starved ] 


3.1 






Table ATD. Panel 5D 


Tissue Name 


Rel. Exp.(%) 


Tissue Name 


Rel. 
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Ag4113, Run 
172784076 




Exp.(%) 
Ag4113, 
Run 

172784076 


07457 Patient- 

02go_adipose 


71.2 


94709_Donor 2 AM - A_adipose 


0.0 


97476 Patipnt- 

s i ~ i \j i all-in 

07sk_skeletal muscle 


13.2 


94710_Donor 2 AM - B_adipose 


0.0 


97477JPatient-07ut_uterus 


35.4 J9471 l_Donor 2 AM - C_adipose 


0.0 


Q747X Patipnt- 
07pl_placenta 


0.0 


1 

94712_Donor 2 AD - A_adipose 


36.3 


074K 1 Pntipnt- 

OSskskeletal muscle 


40.6 


94713_Donor 2 AD - B_adipose 


12.4 


97482 JPatient-08ut_uterus 


9.4 


94714_Donor 2 AD - C_adipose 


29.3 


0748*} Patient- 

08pl_placenta 


22.5 


94742_Donor 3 U - 
AMesenchymal Stem Cells 


0.0 


i97486_Patient- 
j09sk_skeletal muscle 


0.0 


94743_Donor 3 U - 
B_Mesenchymal Stem Cells 


6.8 


97487_Patient-09ut_uterus 


30.8 


94730_Donor 3 AM - A_adipose 


0.0 


074525? Patient 
i — 

j09pl_placenta 


0.0 


9473 l_Donor 3 AM - B_adipose 


0.0 


! O7J.00 Patient 1Ant utonie 

.7 / 47Z r auiciii- 1 uul iiierus 


32.5 |94732_Donor 3 AM - C_adipose 


0.0 


0740^ Patient 

1 OpI ^placenta 


29.3 


94733_Donor 3 AD - A_adipose 


10.6 


0740S P^tiVnt 
" / *T7J rallCill- 

1 lgo_adipose 


100.0. 


94734_Donor 3 AD - B_adipose 


11.6 


0740^ Patient 
y i Hy%j jraiieni.- 

1 lsk_skeletal muscle 


23.2 


94735_Donor 3 AD - C_adipose 


0.0 


y iHy § ratieni-i iui uterus 


26.4 


77 1 38_Liver_HepG2untreated 


0.0 


0740R Pntient- 
7/t70 r alien l~ 

1 Ipljplacenta 


12.9 


73556_Heart_Cardiac stromal 
cells (primary) 


15.7 


I2go_adipose 


97.9 


8 1735_Small Intestine 


44.1 


9750I_Patient- 

1 7<2lf ckpiptal mncrla 


82.9 


72409_Kidney_Proximal 
Convoluted Tubule 


0.0 


97502_Patient- 1 2ut_uterus 


15.7 


82685_Small intestineDuodenum 


21.6 


97503_Patient- 
12pl_placenta 


25.2 


90650_Adrenal_AdrenocorticaI 
adenoma 


ft ft 

v.U 


94721_Donor2U- 
A Mesenchymal Stem 
Cells 


0.0 


72410JCidney_HRCE 


37.1 


94722_Donor2U- 
B Mesenchymal Stem 
Cells 


0.0 


72411_K.idney_HRE 


6.9 
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94723 J)onor2U- 
C Mesenchymal Stem 
Cells 


14.7 


173 139_UterusJUterine smooth 
muscle cells 


0.0 


Table ATE. General oncology screening pan^l v 7 d 


jTissue Name 

I 
1 

j. . 


Rel. Exp.(%) 
Ag4l 13, Run 
268389993 


Tissue Name 


Rel. 

Exd (%} 
Ag4113. 
Run 

268389993 


jColon cancer 1 


7.7 -Bladder cancer NAT 2 


7.0 


Colon cancer NAT 1 


1.8 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


2.6 


Bladder cancer NAT 4 


3.1 


Colon cancer NAT 2 


2.0 


Adenocarcinoma of the prostate 1 


70.2 


Colon cancer 3 


7.6 


Adenocarcinoma of the prostate 2 


3.7 


Colon cancer NAT 3 


15.4 


Adenocarcinoma of the prostate 3 


11.0 


jColon malignant cancer 4 


7.7 


Adenocarcinoma of the prostate 4 


16.8 


[Colon normal adjacent 
•tissue 4 


2, 


Prostate cancer NAT 5 


1.6 


jLung cancer 1 j6. 1 


Adenocarcinoma of the prostate 6 


3.1 


ILung NAT 1 


1.5 


Adenocarcinoma of the prostate 7 7.3 


|Lung cancer 2 


9.1 


Adenocarcinoma of the prostate 8 


2.9 


|Lung NAT 2 


4 9 (Adenocarcinoma of the prostate 9 


37.4 


Squamous cell carcinoma 3 


1 0.2 j Prostate cancer NAT 1 0 


3.3 


Lung NAT 3 


0.6 


Kidney cancer 1 


35.6 


metastatic melanoma 1 


46.3 


KidneyNAT 1 


8.9 


Melanoma 2 


0.5 


Kidney cancer 2 


46.3 


Melanoma 3 


0.9 


Kidney NAT 2 


9.2 


metastatic melanoma 4 


90.8 


Kidney cancer 3 


29.1 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


5.6 


Bladder cancer 1 


2.2 


Kidney cancer 4 


5.8 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


5.3 


Bladder cancer 2 


6.0 







CNS_neurodegeneration_vLO Summary: Ag4 1 13 Results from one experiment 
with the CG97472-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General__screening_panel_vl.4 Summary: Ag41 13 Highest expression of the 
CG97472-01 gene is detected in kidney (CT=29.5). Therefore, therapeutic modulation of 
this gene may be useful in the treatment of kidney related disease including lupus and 
glomerulonephritis. 
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Among tissues with metabolic or endocrine function, this gene is expressed at low 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
5 diseases, such as obesity and diabetes. 

In addition, this gene is expressed at moderate to low levels in all regions of the 
central nervous system examined, including amygdala, hippocampus, substantia nigra, 
thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a 
role in central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 

1 0 epilepsy, multiple sclerosis, schizophrenia and depression. 

Low level expression of this gene is also seen in cluster of ovarian cancer, breast, 
lung, renal, colon and CNS cancer. Therefore, therapeutic modulation of this gene may be 
useful in the treatment of these cancers. 

Panel 4.1D Summary: Ag41 13 Highest expression of the CG97472-01 gene is 

1 5 detected in kidney (CT=3 1 ). This gene is expressed at low to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 
cells include members of the T-cell, B-celL endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 

20 This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. This pattern is in 
agreement with the expression profile in General_screeningjpanel_vl .4 and also suggests 
a role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 

25 associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

Panel 5D Summary: Ag41 13 Highest expression of the CG97472-01 gene is 
30 detected in adipose (CT=34.4). Low level of expression of this gene is seen exclusively in 
adipose and skeletal muscle samples. Therefore, expression of this gene can be used to 
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distinguish these samples from other samples used in this panel. Please see Panel 1 A for a 
discussion of the potential utility of this gene 

General oncology screening panel_v_2.4 Summary: Ag41 13 Highest 
expression of the CG97472-01 gene is detected in metastatic melanoma sample (CT=31). 
5 Interestingly, significant expression of this gene is seen in number of cancer samples 
including kidney, metastatic melanoma, prostate adenocarcinoma and lung cancer. 
Therefore, expression of this gene may be used as diagnostic marker for these cancers and 
therapeutic modulation of this gene can be useful in the treatment of these cancers. 

1 0 AU. CG97528-01 : Guanylate Binding Protein 

Expression of gene CG97528-01 was assessed using the primer-probe set Ag4107, 
described in Table AUA. Results of the RTQ-PCR runs are shown in Tables AUB and 
AUC. 

Tabic AUA . Probe Name Ag4107 



Primers Jsequences 


Length 

*" 


Start 
Position 


SEQ ID i 
No 


[Forward 


5 ' -aacaggcccgagtactaaagg-3 * 


1814 


374 . 


Probe 


TET-5 1 -tgccaaggtgaaagtacccaacttca-3 ' -TAMRA 


26 


1840 


375 


Reverse 


5 1 -tcagggtcttctgtagcttttg-3 ■ 


22 


1876 


376 ! 



15 Table AUB . General screening panel vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4107, Run 
2f9447017 


Tissue Name 


Rel. 

Exp.(%) 
Ag4107, 
Run 

219447017 


Adipose 


5.5 


Renal ca. TK-10 


3.3 


Melanoma* Hs688(A).T 


12.5 


Bladder 


19.5 


Melanoma* Hs688(B).T 


11.0 


Gastric ca. (liver met.) NCI-N87 


100.0 


Melanoma* M 14 


10.8 


Gastric ca. KATO III 


24.0 


Melanoma* LOXIMVI 


3.4 


Colon ca. SW-948 


4.0 


Melanoma* SK-MEL-5 


0.0 


Colon ca. SW480 


6.1 


Squamous cell carcinoma 
SCC-4 


4.0 


Colon ca .* (SW480 met) SW620 


0.0 


Testis Pool 


1.3 


Colon ca. HT29 


4.0 


Prostate ca.* (bone met) 
PC-3 


11.6 


Colon ca. HCT-116 


0.6 


Prostate Pool 


4.9 


Colon ca. CaCo-2 


0.4 
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fi 7 


Colon cancer tissue 


11.8 


T It^niQ Pnnl 




Colon ca. bwl 116 


0.5 


Ovarian ra OVPAR-^ 




Colon ca. Colo-205 


0.6 


Ovarian ca 9K-OV-.7 

Wai Iclll va. ul\"VJ V J 




Colon ca. b W-48 


0.7 


Ovarian ra OVPAR-4 

Veil Id II Ud. W V / v I V i 




colon Fool 


7.2 


Ovarian ra OVfAR-^ 




Small Intestine Pool 


5.3 


Ovarian ra IftROV-l 

v/vai lull i^a. ivjfxW V I 


1 Q 


Stomach Pool 


2.8 


Ovarian ra OVPAR.R 


1 A 


Bone Marrow Pool 


3.3 


Ovarv 
\_/ v ai y 


A 1 
H.J 


retal Heart 


1.1 


Rrea^t ra MPF-7 


1 0 


Heart Pool 


2.5 


Breast ra MDA-MR 7^1 


AC\ 0 


Lymph Node Pool 


6.8 


Breast ca. BT 549 


5.8 


Fetal Skeletal Muscle 


0.7 


Breast ca. T47D 


29.3 


Skeletal Muscle Pool 


2.7 


Breast ca. MDA-N 


1.1 


Spleen Pool 


12.2 


Breast Pool 


5.6 


Thymus Pool 


5.2 


Trachea 


5.2 


CNS'cancer (glio/astro) U87-MG 


0.7 1 


Lung J0.8 


CNS cancer (glio/astro) U-l 1 8- 
MG 


40.6 j 


Fetal Lung 


14.4 


CNS cancer (neuro;met) SK-N-AS 


2.6 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF-539 


8.2 


Lung ca. LX- 1 


0.9 


CNS cancer (astro) SNB-75 


17.9 ; 


Lung ca. NCI-Hi 46 


0.0 


CNS cancer (glio) SNB-19 


1.5 


|Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


40.1 


Lung ca. A549 


0.5 


Bram (Amygdala) Pool 


0.4 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.1 


Lung ca. NCI-H23 


0.2 


Brain (fetal) 


0.3 


Lung ca. NCI-H460 


0.2 


Brain (Hippocampus) Pool 


0.7 


Lung ca. HOP-62 


4.0 


Cerebral Cortex Pool 


0.8 


Lungca.NCI-H522 


0.2 


Brain (Substantia nigra) Pool 


0.3 


Liver 


0.2 


Brain (Thalamus) Pool 


1.1 


Fetal Liver 


0.7 


Brain (whole) 


0.3 


Liver ca. HepG2 


0.1 


Spinal Cord Pool 


2.1 


Kidney Pool 


9.6 


Adrenal Gland 


2.1 


Fetal Kidney 


4.7 


Pituitary gland Pool 


0.8 


Renal ca. 786-0 


13.0 


Salivary Gland jo.3 


Renal ca. A498 


3.8 


Thyroid (female) J3.2 


Renal ca. ACHN 


2.8 


Pancreatic ca. CAPAN2 j 


1.9 


Renal ca. UO-31 


6.0 


Pancreas Pool js.7 
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Table AUC . General oncology screening panel_v_2.4 



Tissue Name 


Rel. bxp.(yo) 
Ag4107, Run 
268623659 


Tissue Name 

- . 


Rel. 

c,Xp\/o) 

Ag4107, 
Run 

268623659 


Colon cancer 1 


22.4 


Bladder cancer NAT 2 


0.8 


Colon NAT 1 


13.9 


Bladder cancer NAT 3 


1.3 


Colon cancer 2 


36.1 


Bladder cancer NAT 4 


2.4 


Colon cancer NAT 2 


19.5 


Adenocarcinoma of the prostate 1 


69.7 


Colon cancer 3 


76.3 


Adenocarcinoma of the prostate 2 


6.5 


Colon cancer NAT 3 


66.4 


Adenocarcinoma of the prostate 3 


19.8 


Colon malignant cancer 4 


64.6 


Adenocarcinoma of the prostate 4 


51.1 


Colon normal adjacent 
tissue 4 


28.3 


Prostate cancer NAT 5 


7.1 


Lung cancer 1 


23.3 


Adenocarcinoma of the prostate 6 


4.8 


Lung NAT 1 


5.8 


Adenocarcinoma of the prostate 7 


6.4 


Lung cancer 2 


11.3 


Adenocarcinoma of the prostate 8 


1.6 


Lung NAT 2 


5.4 


Adenocarcinoma of the prostate 9 


28.5 


Squamous cell carcinoma 
j 


21.9 


r 

Prostate cancer NAT 10 


1.7 


Lung NAT 3 


2.7 


Kidney cancer 1 


50.0 


metastatic melanoma 1 


27.5 


KidneyNAT 1 


18.7 


Melanoma 2 


0.8 


Kidney cancer 2 


100.0 


Melanoma 3 


2.4 


Kidney NAT 2 


43.2 


metastatic melanoma 4 


50.7 


Kidney cancer 3 


94.0 


metastatic melanoma 5 


47.6 


Kidney NAT 3 


11.9 


Bladder cancer 1 


3.3 


Kidney cancer 4 


16.5 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


3.1 


Bladder cancer 2 


9.2 







CNSjaeurodegenerationjvl.O Summary: Ag4107 Results from one experiment 
with the CG97528-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General screening panel vl.4 Summary: Ag4107 Highest expression of the 
CG97528-01 gene is detected in gastric cancer cell line (CT=25). High expression of this 
gene is also seen in cluster of cancer cell lines including CNS, colon, renal, breast, 
ovarian, prostate, squamous cell carcinoma and melanoma. Therefore, therapeutic 
modulation of this gene may be useful in the treatment of these cancers. 
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Significant expression is also detected in fetal and adult lung. Interestingly, this 
gene is expressed at much higher levels in fetal (CT = 27.8) when compared to adult lung 
(CT=32). This observation suggests that expression of this gene can be used to distinguish 
fetal from adult lung. In addition, the relative overexpression of this gene in fetal lung 
suggests that the protein product may enhance lung growth or development in the fetus 
and thus may also act in a regenerative capacity in the adult. Therefore, therapeutic 
modulation of the protein encoded by this gene could be useful in treatment of lung related 
diseases. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at moderate to low levels in all regions of the 
central nervous system examined, including amygdala, hippocampus, substantia nigra, 
thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a 
role in central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4107 Results from one experiment with the CG97528- 
01 gene are not included. The amp plot indicates that there were experimental difficulties 
with this run. 

General oncology screening panel_v_2.4 Summary: Ag4107 Highest 
expression of the CG97528-01 gene is detected in kidney cancer (CT=27). Interestingly, 
expression of this gene is higher in number of cancer samples including kidney, 
adenocarcinoma of the prostate, melanoma, lung, and colon cancers. Therefore, 
therapeutic modulation of this gene may be useful in the treatment of these cancers. 

AV. CG97629-01: Cell Division Protein Kinase 7 

Expression of gene CG97629-01 was assessed using the primer-probe set Ag4122, 
described in Table AVA. 

Table AVA , Probe Name Ag4122 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
ID No 
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Forward 


5 ' -gggacattttgctacagcctat-3 1 


22 | 142 


377 


Probe 


TET-5 1 -agaacaccaaccaaatcgtcaccatt-3 ' -TAMRA 


26 | 177 


378 


Reverse 


5 1 -agcttctgacctgtgtccaa-3 ' 


20 ] 216 


379 



CNSjneurodegeneration_vl.O Summary: Ag4122 Expression of the CG97629- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 



General screening panel vl. 4 Summary: Ag4122 Expression of the CG97629- 
5 01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1D Summary: Ag4122 Expression of the CG97629-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel CNS_1 Summary: Ag4122 Expression of the CG97629-01 gene is 
10 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4122 Expression of the 
CG97629-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



15 AW. CG97648-01 : G Protein-Coupled Receptor Kinase GRK7 

Expression of gene CG97648-01 was assessed using the primer-probe sets Ag3046 
and Ag4125, described in Tables AWA and AWB. Results of the RTQ-PCR runs are 
shown in Tables AWC, AWD ; AWE and AWF. 

Table AWA . Probe Name Ag3046 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -gaagcaaagaactctgcaagac-3 ' 


22 


1281 


380 


Probe 


TET-5 ' - ttccagcatgataacttcacagagga-3 * -TAMRA 


26 


1312 


381 


Reverse 


5 ' -gagcctgcaaatatcttttgct-3 ' 


22 


1338 


382 . 



20 Table AWB . Probe Name Ag4125 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -gaagcaaagaactctgcaagac-3 ' 


22 


.1281 


383 


Probe 


TET-5 ' - ttccagcatgataacttcacagagga-3 1 -TAMRA 


26 


1312 


384 


Reverse 


5 ' -gagcctgcaaatatcttttgct-3 ' 


22 


1338 


385 



Table AWC. Panel 1.3D 
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Tissue Name 


Rel. 

Exp.(%) 
Ag3046, 
Run 

165533236 


Rel. 

Exp.(%) 
Ag3046, 
Run 

165724481 


Tissue Name 


Rel. 

Exp.(%) 
Ag3046, 
Run 

165533236 


Rel. Exp.(%) 
Ag3046, Run 

1657944R1 


Liver 

^adenocarcinoma 

! 


27.5 


0.0 


K iHni»v f fptnh 
fvlUIICy ^ICldl f 


u.u 


ft ft 
U.U 


[Pancreas 


|0.0 


72.2 


Renal ca. 786-0 


0.0 


0.0 


! Pancreatic ca. 
CAPAN 2 


!o.o 


0.0 


Renal ca. A498 


43.2 


0.0 


Adrenal gland iO.O 


u.u 


tvenai ca. tvAr jvj 


ft ft 
U.U 


0.0 


Thyroid 


jo.o 


0.0 


Renal ca. ACHN 


0.0 


0.0 


Salivary gland 


0.0 


25.5 


Renal ca. UO-3 1 


0.0 


0.0 


Pituitary gland 


|0.0 


0.0 


Renal ca. TK-10 


0.0 


0.0 


Brain (fetal) 


138.4 


0.0 


Liver 


30.6 


0.0 


Brain (whole) 


31.4 


jO.O 


liver (fetal) 


0.0 


0.0 


^Brain (amv^dala) 


i 

! 

41.8 


0.0 


U>| vci td, 

(hepatoblast) 
HepG2 


0.0 


0.0 


[Brain (cerebellum) 


'o.o 


'o.o 


Lung 


0.0 


0.0 


(Brain (hippocampus) 

r • " - — — — 


i37.I 


0.0 


. . — 

f wtMT ( fpia W ■ 

Llll fiddly 


U.u 


U.U 


orain ^suDSiantia 
nigra) 


0.0 

1 


22.8 


Lung ca. (small 


0.0 


0.0 


Brain (thalamus) 


73.2 

t 


12.8 


Lungca. (small 
ceinNCI-H69 


OA 


0.0 


V_/CICUId.i L/OnCA 


n n 
u.u 


0.0 


Lung ca. (s.cell 
var ^ SHP-77 


0.0 


0.0 


Spinal cord 


0.0 


0.0 


Lung ca. (large 
cellYNCI-H460 


0.0 


0.0 


glio/astro U87-MG 


0.0 


0.0 


Lung ca. (non-sm. 
celH A549 


0.0 


0.0 


glio/astro U-118-MG 


0.0 


0.0 


Lung ca. (non- 
s ceinNCI-H23 


33.7 

» - 


0.0 


astrocytoma SW1783 


0.0 


0.0 


Lung ca. (non- 


0.0 


0.0 

* 


neuro*; met SK-N- 
AS 


0.0 


0.0 


I lino - r*a /nfMi-cfft 
luting w. y . 

NCI-H522 


0.0 


0.0 


astrocytoma SF-539 


0.0 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 


35.4 


astrocytoma SNB-75 


0.0 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


0.0 


glioma SNB-19 


0.0 


0.0 


Mammary gland 


0.0 


0.0 


glioma U251 


0.0 


0.0 


Breast ca.* (pl.ef) 


0.0 


100.0 
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glioma SF-295 



0.0 



0.0 



MCF-7 

Breast ca.* (pl.ef) 
MDA-MB-231 



0.0 



0.0 



Heart (fetal) 



0.0 



24.7 



Breast ca.* (pl.ef) 
T47D 



0.0 



Heart 



0.0 



10.0 



Skeletal muscle 
(fetal) 



:o.o 



:o.o 



Breast ca. BT-549 0.0 



Breast ca. MDA-N 



0.0 



Skeletal muscle 



iO.O 



Bone marrow 



0.0 



! 32.3 



129.5 



'.Ovary 

iOvarian ca. 
OVCAR-3 



0.0 



37.9 



0.0 



64.6 



0.0 
0.0 
0.0 



Thymus 



0.0 



Spleen 



0.0 



52.5 



0.0 



Ovarian ca. 
OVCAR-4 



0.0 



Ovarian ca. 
OVCAR-5 



0.0 



Lymph node 



Colorectal 



00.0 



'94.6 



^Ovarian ca. 
SoVCAR-8 



0.0 



i46.7 



82.9 



iOvarian ca. 
llGROV-l 



0.0 



Stomach 



.0.0 



! iOvarian ca.* I Q Q 

• 0 0 (ascites) SK-OV-3 



Small intestine 



;o.o 



;o.o 



iUterus 



0.0 



Colon ca. SW480 



0.0 



0.0 



Placenta 



41.2 



0.0 



0.0 



0.0 



0.0 



39.8 



0.0 



12.6 



Colon ca.* 
SW620(SW480 met) 



0.0 



0.0 



Prostate 



0.0 



5.2 



Colon ca. HT29 



0.0 



0.0 



Prostate ca.* (bone 
met)PC-3 



0.0 



0.0 



Colon ca. HCT-116 



0.0 



0.0 



Testis 



0.0 



32.1 



Colon ca. CaCo-2 



0.0 



0.0 



Melanoma 
Hs688(A).T 



0.0 



0.0 



Colon ca. 
tissue(OD03866) 



0.0 



0.0 



Melanoma* (met) 
Hs688(B).T 



0.0 



0.0 



Colon ca. HCC-2998 0.0 



0.0 



Melanoma UACC- 
62 



0.0 



0,0 



Gastric ca * (liver 
mef)NCI-N87 



0.0 



0.0 



Melanoma M14 



0.0 



0.0 



Bladder 



45.1 



0.0 



Melanoma LOX 
IMV1 



0.0 



0.0 



Trachea 



64.2 



0.0 



Melanoma* (met) 
SK-MEL-5 



0.0 



0.0 



Kidney 



40.9 



0.0 



Adipose 



0.0 



28.7 
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Tissue Name 


Rel. Exp.(%) 
Ag3046, Run 
162559104 


Tissue Name 


Rel ExdC%^ 
Ag3046,Run 
162559104 


Normal Colon 


0.0 


Kidney Margin 8120608 


0.0 


CC Well to Mod Diff 
(ODQ3866) 


0.0 


Kidney Cancer 8120613 


0 0 

v/.v/ 


CC Margin (OD03866) ;0.0 


Kidney Margin 8120614 


0.0 


CC Gr.2 rectosigmoid • 
(OD03868) i 00 


Kidney Cancer 9010320 


0 0 


CC Margin (OD03868) 


0.0 (Kidney Margin 90 1 032 1 


0.0 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


!o.o 

1 


Normal Thyroid 


0.0 


CC Margin (OD03921) jo.0 


Thyroid Cancer 064010 


0.0 


CC from Partial Hepatectomy ' 
(ODO4309) Mets j 0 " 0 


Thyroid Cancer A302 1 52 


o n 


Liver Margin (ODO4309) :o.O iThyroid Marsjin A302 1 53 


0.0 


Colon mets to lung (OD0445 1-! 

oi) ~ 

!■ ' i 


Normal Breast 


0.0 


Lung Margin (OD0445 1 -02) ;o.O 


Breast Cancer (OD04566) 


100.0 


Normal Prostate 6546-1 


0.1 


Breast Cancer (OD04590-01) 


0.0 


Prostate Cancer (OD04410) 


0.0 


Breast Cancer Mets (OD04590- 
03) 


0.0 


Prostate Margin (OD04410) 


0.0 


Breast Cancer Metastasis 
(OD04655-05) 


0.0 


Prostate Cancer (OD04720-0 1 ) 


0.1 


Breast Cancer 064006 


0.0 


Prostate Margin (OD04720- 
02) 


0.0 


Breast Cancer 1024 


0.0 


Normal Lung 061010 


0.0 


Breast Cancer 9 1 00266 1 


0.0 


Lung Met to Muscle 
(OD04286) 


0.0 


Breast Margin 9100265 


0 0 


Muscle Margin (OD04286) 


0.0 


Breast Cancer A209073 


0.0 


Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin A209073 


0.0 


Lung Margin (OD03 1 26) 


0.0 


Normal Liver j 


o.o 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 j 


0.0 


Lung Margin (OD04404) ( 


D.O ] 


Liver Cancer 1025 j 


o.o 


Lung Cancer (OD04565) j< 


0.0 1 


Liver Cancer 1026 \i 


3.0 


Lung Margin (OD04565) j( 


3.0 ] 


Liver Cancer 6004-T I 


3.0 


Lung Cancer (OD04237-0 1 ) ( 


3.0 1 


Liver Tissue 6004-N I 


).0 



664 



WO 03/010327 



PCT7US02/14199 



Lung Margin (OD04237-02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 10) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 

1 
1 


Bladder Cancer 1023 


0.0 
0.0 


Lung Margin (OD04321 ) 


0.0 


Bladder Cancer A302I73 


Normal Kidney 


0.0 


Bladder Cancer (OD047 18-01) jo.O 


Kidney Ca, Nuclear grade 2 
(OD04338) 


0.0 


Bladder Normal Adjacent 
(OD047 18-03) 


0.0 


Kidney Margin (OD04338) 


0.0 


Normal Ovary 


0.0 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


0.0 


Ovarian Cancer 064008 


0.0 


Kidney Margin (OD04339) 


0.0 


Ovarian Cancer (OD04768-07) 


0.0 


Kidney Ca, Clear cell type 
(OD04340) 


0.0 


Ovary Margin (OD04768-08) 


0.0 


Kidney Margin (OD04340) 


0.0 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 3 j A A L . ^ i 
(OD04348) ~ j 0 0 Gastric Cancer 9060358 |0.0 


Kidney Margin (OD04348) jO.O 


Stomach Margin 9060359 jo.O 


Kidney Cancer (OD04622-01) jo.O jGastric Cancer 9060395 jo.O 


Kidney Margin (OD04622-03) 


0.0 


Stomach Margin 9060394 


0.0 


Kidney Cancer (OD04450-01) 


0.0 


Gastric Cancer 9060397 


0.0 


Kidney Margin (OD04450-03) 


0.0 


Stomach Margin 9060396 


0.0 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 064005 


0.0 



Table AWE. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4125, Run 
172859315 


Tissue Name 


Rel. Exp.(%) 
Ag4125, Run 
172859315 


Secondary Thl act 


3.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


4.2 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVECTNF alpha +1FN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVECTNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC 1L- 11 


0.0 


Secondary Trl rest 


2.7 


Lung Microvascular EC none 


0.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC • 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


3.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


11.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


7.4 
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Primary Thl rest 


7.2 


Bronchia! epithelium TNFalpha 
+ ILlbeta 


7.5 


Primary Th2 rest 


3.1 


Small airway epithelium none 


0.0 


Primary Tr I rest 


0.0 


Small airway epithelium 
TNFalpha + 1L-1 beta 


0.0 


CD45RA CD4 lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


^l/hjkaj l,lm- lymphocyte act 


0.0 


Coronerv arterv SMC TNFalnha 
+ IL-lbeta 


0.0 


CDS lymphocyte act 


0.0 


Astrocytes rest 


T ! 

n 0 


Secondary CD8 lymphocyte 
rest 


0.0 


Astrocytes TNFalpha + IL-I beta 


0.0 


Secondary CD8 lymphocyte 
act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti-CD95 
CH11 


0.0 


CCD1 106 (Keratinocytes) none 


0.0 


i 

LAK cells rest h 3 

! 


CCD1106(Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 

i 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 i 


LAK cells IL-2+IL-I2 


5.3 |NC1-H292 none (0.0 ] 


LAK cells IL-2+IFN gamma 


1.8 


NCI-H292 1L-4 I5.8 | 


LAK cells IL-2+ IL-18 


3.6 


NCI-H292 IL-9 


3.5 


LAK cells PMA/ionomycin 


0.0 


NCI-H292 IL-13 


1.3 


NK Cells IL-2 rest 


5.9 


NCI-H292 I FN gamma 


0.8 


Two Way MLR 3 day 


3.3 


HPAEC none : 


0.0 


Two Way MLR 5 day 


3.2 


HPAECTNF alpha + IL-1 beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL- 
I beta 


2.3 


PBMC PWM 


3.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


1.6 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL- 13 


0.0 


Ramos (B cell) ionomycin i 


D.O 


Lung fibroblast IFN gamma ( 


3.0 


B lymphocytes PWM l 


3.0 ] 


Dermal fibroblast CCD1070 rest 


1.0 


B lymphocytes CD40L and IL- 
4 1 


3.0 J 


Dermal fibroblast CCD 1 070 
rNF alpha * 


3.0 


EOL-1 dbcAMP : 


U 1 


Dermal fibroblast CCD 1070 IL- 
1 beta ( 


3.0 


EOL-1 dbcAMP 
PMA/ionomycin 


J.3 ] 


Dermal fibroblast IFN gamma ( 


).0 
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LJCIIUI 1111/ l>Cllo I1UI1C 


u.u 


JL/ermai iiDroDiasx 


n ft 

U.U 


DpnHritir rpllc T 


U.U 


uermai riurouiasts rest 


ft ft 
u.u 


OpnHritir' ppIIc antt -C^T^ACi 

J-swlVJI I LIU WCIlo CliJll V^L/^rU 


U.U 


"Mah itrr\r»Vi i 1c TTMPo-J-T PQ 

iNcUuOpnUb 1 INraTLro 


ft ft 
u.u 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


13.5 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


5.4 


Macrophages LPS 


0.0 


Thymus 


18.4 


HUVEC none 


0.0 


Kidney 


100.0 1 


HUVEC starved 


0.0 




i 



Table AWF . General oncology screening panel_v_2.4 



Tissue Name 


Rel. 

Exp.(%) 
Ag3046, 
Run 

267908989 


Rel. 

Exp.(%) 
Ag4125, 
Run 

26839002 
5 


Tissue Name 


Rel. 

Exp.(%) 
Ag3046, 
Run 

267908989 


| 

Rel. Exp.(%) 
Ag4125, Run 
268390025 ! 

1 

1 
j 


Colon cancer 1 


0.0 


0.0 


Bladder cancer NAT 2 


0.0 


0.0 ! 


Colon NAT 1 


0.0 


0.0 


Bladder cancer NAT 3 


0.0 


3.6 ! 


Colon cancer 2 


0.0 


0.0 


Bladder cancer NAT 4 


0.0 


7.6 I 


Colon cancer NAT 
2 


0.0 


0.0 


Adenocarcinoma of the 
prostate 1 


26.8 


27.5 


Colon cancer i 


12.1 


14.1 


Adenocarcinoma of the 
prostate 2 


0.0 


0.0 


Colon cancer NAT 
3 


ft ft 
u.u 


Q 0 
O.O 


Adenocarcinoma of the 
prostate 3 


U.U 


18.4 


Colon malignant 
cancer 4 


11.7 


23.8 


Adenocarcinoma of the 
prostate 4 


0.0 


0.0 


Colon normal 
adjacent tissue 4 


3.5 


0.0 


Prostate cancer NAT 5 


0.0 


0.0 


Lung cancer 1 


0.0 


14.2 


Adenocarcinoma of the 
prostate 6 


13.0 


8.4 


Lung NAT 1 


7.3 


0.0 


Adenocarcinoma of the 
prostate 7 


9.6 


0.0 


Lung cancer 2 


8.7 


0.0 


Adenocarcinoma of the 
prostate 8 


10.7 


0.0 


Lung NAT 2 


0.0 


0.0 


Adenocarcinoma of the 
prostate 9 


0.0 


11.7 


Squamous cell 
carcinoma 3 


16.7 


0.0 


Prostate cancer NAT 1 0 


0.0 


0.0 


Lung NAT 3 


0.0 


0.0 


Kidney cancer 1 


10.2 


7.7 


metastatic 
melanoma 1 


8.8 


14.8 


KidneyNAT 1 


0.0 


0.0 
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Melanoma 2 


9.1 


0.0 


Kidney cancer 2 


100.0 


100.0 


Melanoma 3 


16.2 


8.2 


Kidney NAT 2 


0.0 


7.7 


metastatic 
melanoma 4 


0.0 


19.8 


Kidney cancer 3 


18.6 


13.3 


metastatic 
melanoma 5 


58.2 


33.7 


Kidney NAT 3 


0.0 


8.1 


Bladder cancer I 


0.0 


0.0 


Kidney cancer 4 


17.8 


0.0 


Bladder cancer 
NAT 1 


0.0 


0.0 


Kidney NAT 4 


0.0 

i 


21.0 


Bladder cancer 2 


0.0 


o.o ! 







low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Ag4125 Results from one experiment with this gene are not included. The amp 
plot indicates that there were experimental difficulties with this run (data not shown). 

General_screening_panel_vl.4 Summary: Ag4125 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag3046 Expression of this gene is low/undetectable (CTs 
> 35) across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag3046 Significant expression of this gene is seen 
exclusively in a breast cancer sample (CT = 25.2). Therefore, expression of this gene may 
be used to distinguish breast cancers from the other samples on this panel. Furthermore, 
therapeutic modulation of the activity of the GPCR encoded by this gene may be 
beneficial in the treatment of breast cancer. 

Panel 3D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

Panel 4.1D Summary: Ag4125 This gene is only expressed at detectable levels in 
the kidney (CT = 32.6). The putative GPCR encoded for by this gene could allow cells 
within the kidney to respond to specific microenvironmental signals (For example, ref. 1). 
Therefore, antibody or small molecule therapies designed with the protein encoded for by 
this gene could modulate kidney function and be important in the treatment of 
inflammatory or autoimmune diseases that affect the kidney, including lupus and 
glomerulonephritis. 
References: 
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1. Mark M.D., Wittemann S., Herlitze S. (2000) G protein modulation of 
recombinant P/Q-type calcium channels by regulators of G protein signalling proteins. J. 
Physiol. 528 Pt 1:65-77. 

1 . Fast synaptic transmission is triggered by the activation of presynaptic Ca2+ 
5 channels which can be inhibited by Gbetagamma subunits via G protein-coupled receptors 
(GPCR). Regulators of G protein signalling (RGS) proteins are GTPase-accelerating 
proteins (GAPs), which are responsible for > 100-fold increases in the GTPase activity of 
G proteins and might be involved in the regulation of presynaptic Ca2+ channels. In this 
study we investigated the effects of RGS2 on G protein modulation of recombinant P/Q- 

1 0 type channels expressed in a human embryonic kidney (HEK293) cell line using whole- 
cell recordings. 2. RGS2 markedly accelerates transmitter-mediated inhibition and 
recovery from inhibition of Ba2+ currents (IBa) through P/Q-type channels heterologously 
expressed with the muscarinic acetylcholine receptor M2 (mAChR M2). 3. Both RGS2 
and RGS4 modulate the prepulse facilitation properties of P/Q-type Ca2+ channels. G 

15 protein reinhibition is accelerated, while release from inhibition is slowed. These kinetics 
depend on the availability of G protein alpha and betagamma subunits which is altered by 
RGS proteins. 4. RGS proteins unmask the Ca2+ channel beta subunit modulation of Ca2+ 
channel G protein inhibition. In the presence of RGS2, P/Q-type channels containing the 
beta2a and beta3 subunits reveal significantly altered kinetics of G protein modulation and 

20 increased facilitation compared to Ca2+ channels coexpressed with the beta lb or beta4 
subunit. 

PMID: 11018106 

Panel 4D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

25 General oncology screening panel_v_2.4 Summary: Ag3046/Ag4125 Two 

experiments with same probe and primer set are in excellent agreement. Significant 
expression of this gene is seen exclusively in a kidney cancer sample (CT=34.6). 
Therefore, expression of this gene may be used to distinguish kidney cancers from the 
other samples on this panel. Furthermore, therapeutic modulation of the activity of the 

30 GPCR encoded by this gene may be beneficial in the treatment of kidney cancer. 

AX. CG97658-01: Human Protein Tyrosine Phosphotase 
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Expression of gene CG97658-01 was assessed using the primer-probe set Ag4128, 
described in Table AXA. Results of the RTQ-PCR runs are shown in Tables AXB, AXC,' 



AXD and AXE. 



Primers 



Sequences 



Forward 



Probe 



Reverse 



Table AXA. Probe Name Ag4128 



- aagcagaagagct t ca tgaaaa - 3 ' 



! Length 



22 



TET-5 ' -ccttbcctgctgcaggcg gaatt-3 ' -TAMRAi 23 



- aaagt tea tgggga tttcaaag-3 ' 



j 22 



Start Position 



784 



816 



839 



Table AXB. CNS_neurodegeneration_vl.O 



SEQ ID No 



386 



387 



388 



■ issue iNarne 


Rel. Exp.(%) 
Ag4128, Run 
214956370 


Tissue Name 


Rel. Exp.(%) 
Ag4128, Run 
214956370 


AD 1 Hippo 




Control (Path) 3 Temporal Ctx 


2.9 


AD 2 Hippo 


24.1 


Control (Path) 4 Temporal Ctx 


44.4 


AD 3 Hippo 


3.2 


AD 1 Occipital Ctx 


[T2.5 


AD 4 Hippo 


7.7 


AD 2 Occipital Ctx (Missing) 10.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


3.4 


AD 6 Hippo 


32 5 |AD 4 Occipital Ctx 


39.5 


Control 2 Hippo 


310 |AD 5 Occipital Ctx 


54.0 


v-'Oiiiroi h rflppo 


3.2 


AD 6 Occipital Ctx 


20.0 


Control (Path) 3 Hippo 


ii ~~ 


Control 1 Occipital Ctx 


— 




6.0 


Control 2 Occipital Ctx 


83.5 


AD 2 TemporaJ Ctx 


25.3 


Control 3 Occipital Ctx 


26.1 


AD 3 Temporal Ctx 


5.1 


Control 4 Occipital Ctx 


3.7 


AD 4 Temporal Ctx 


24 - 5 Control (Path) 1 Occipital Ctx ! 


78.5 


AD 5 Inf Temporal Ctx 


92 0 jControl (Path) 2 Occipital Ctx ; 


23.8 


AD 5 Sup Temporal Ctx 


25 2 (Control (Path) 3 Occipital Ctx 


1.4 


AD 6 Inf Temporal Ctx 


33.7 


Control (Path) 4 Occipital Ctx 


23.0 


AD 6 Sup Temporal Ctx 


41.8 


Control 1 Parietal Ctx 


3.9 


Control 1 Temporal Ctx 


4.2 


Control 2 Parietal Ctx \ 


32.5 


Control 2 Temporal Ctx ! ; 


52.5 


Control 3 Parietal Ctx 19.9 


Control 3 Temporal Ctx 


15.3 < 


Control (Path) 1 Parietal Ctx 


76.8 


Control 3 Temporal Ctx ] 


13.3 < 


Control (Path) 2 Parietal Ctx ; 


12.7 


Control (Path) 1 Temporal 
Ctx ' 


n.2 ( 


:ontrol (Path) 3 Parietal Ctx : 


>.8 


Control (Path) 2 Temporal j 

Ctx .4 


H.2 ( 


Control (Path) 4 Parietal Ctx A 


15.1 
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Tissue Name 


P*>l Pvn /0A\ 

Ag4128,Run 
220364381 


Tissue Name 


i\ei. n,xp\/o) 
Ag4128,Run 
220364381 


Adipose 


0.7 


Renal ca. TK-10 


0.0 


Melanoma* Hs688(A).T 


0.0 


Bladder 


0.9 


ivieianoma rtsooo^tsj. 1 




Gastric ca. (liver met.) NCI- 
N87 


0.0 


Melanoma* MI4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVi 


o.o- 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


0.3 


Colon ca. SW480 


o.o 


Squamous cell carcinoma 
SCC-4 


A. A 

u.u 


Colon ca.*(SW480 met) 
SW620 


A A 
0.0 


Testis Pool 


2.2 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) 
PC-j 


0.7 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.3 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.5 


Uterus Pool 


0.2 


Colon ca. SW1 116 


0.0 


Ovarian ca. OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV-3 


0.4 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool 


1.6 


Ovarian ca. OVCAR-5 


0.0 


Small Intestine Pool 


0.3 


Ovarian ca. IGROV-1 


0.0 


Stomach Pool 


0.4 


Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool 


0.1 


Ovary 


6.0 


Fetal Heart 


2.2 


Breast ca. MCF-7 


0.0 


Heart Pool 


1.2 


Breast ca. MDA-MB-23 1 


0.0 


Lymph Node Pool 


1.6 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.2 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.2 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.2 . 


Rrpa^t Pnnl 


ft fx 


I nymub ruui 




Trachea 


0.9 


v^iNo Cancer ^gi lo/aSLTCjj uo/- 
MG 


0.0 


I nncr 


U.J 


CNS cancer (glio/astro) U-l 1 8- 
MG 


n a 


Fetal Lung 


1.3 


CNS cancer (neuro;met) SK.-N- 
AS 


0.1 


Lungca.NCI-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) SNB-75 


0.0 


Lungca.NCI-H146 


0.0 


CNS cancer (glio)SNB-l 9 


0.0 
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Lung ca SHP-77 

O ****** » / 


In n 

M 


ILiNo cancer (guo) SF-295 


0.0 


Lun2 ca A 549 


in n 


jrirain (Amygdala) Pool 


63.7 


Luneca NCI-H526 


In n 


jBrain (cerebellum) 


22.4 


Luneca NCI-H23 


jo I 


joram (tetai) 


O 1 O 

81.8 


Lunsca NCI-H460 


In n 

Ju.v 


Brain (Hippocampus) Pool 


49.7 


Luneca HOP-62 


!o l 

jU.I 


|L,ereDrai cortex rool 


"7*7 t\ 

77.9 


Luneca NCI-H522 


ll K 
| 1 .o 


jBrain (Substantia nigra) Pool 


65.1 


Liver 


lo 0 


Brain (Thalamus) Pool 


100.0 


Fetal Liver 


!o i 

V. 1 


Brain (whole) 


95.9 


Liver ca HenG7 


In n 


bpinal Lord Pool 


11.3 


Kidnev Pnnl 


In o 


Adrenal uland 


0.6 


Fetal Kidney 


|o.5 


[Pituitary gland Pool 


0.0 


Renal ca. 786-0 


|o.o 


(Salivary Gland 


0.0 


Renal ca. A498 


jo.o 


jThyroid (female) 


0.3 


Renal ca. ACHN 


0.0 


jPancreatic ca. CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


jPancreas Pool 

i 


0.8 


Table AXD. Panel 4.1 D 


Tissue Name 


Rel. E.\p.(%) 
Ag4128, Run 
172859562 


Tissue Name 


Rel. Exp.(%) j 
Ag4128, Run j 
172859562 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVECTNF alpha + IFN 
gamma 


0.0 . 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha-ML-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha 
+ ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 lymphocyte 
act 


0.0 


Coronery artery SMC rest 


35.1 


CD45RO CD4 lymphocyte 


0.0 


Coronery artery SMC 


100.0 
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act 




TNFalpha + IL-1 beta 




tuo lymphocyte act 


n n 
u.u 


/\sirocyxes rebi 


0 0 


Secondary CD8 
lympnocyie resi 


0.0 


Astrocytes TNFalpha + IL- 

1 hf»tn 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


K\ l_St 1 "7 /Raqnnhih 

l\U "O li> ^LXOVJJJIII \) 

PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


U.U 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


0.0 


LAK ceils IL-2+ IL-18 


0.0 


NC1-H292 IL-9 


0.0 


LAK cells PMA/ionomycin 


0.0 


NC1-H292 IL-13 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 3 day jO.O 


HPAEC none 


o.o i 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-1 beta 


|oT" 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + L Q 
IL-1 beta | " 


PBMC PWM 


0.0 


Lung fibroblast IL-4 jO.O 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


jo.o 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 jO.O 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma jO.O 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 L Q 
rest | 


B lymphocytes CD40L and 
IL-4 


9.9 


Dermal fibroblast CCD 1 070 L Q 
TNFalpha | ' 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 IL-L Q 
1 beta 1 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


|l0.4 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


jo.o 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS JO.O 


Monocytes rest 


0.0 


Neutrophils rest jo.O 


Monocytes LPS 


0.0 


Colon 


I 10 - 4 
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Macrophages rest 


0.0 |Lung 


56.6 


Macrophages LPS 


0.0* [Thymus 


12.9 


HUVEC none 


0.0 jKidney 


0.0 


HUVEC starved 


0.0 | 




Table AXE. General onrnlnay serening p* n pl v 




Tissue Name 


Rel. Exp.(%) 
A<*4128 Run 
268390026 


i ibbuc iNaiTie 


Rel. Exp.(%) 
Ag4128, Run 
268390026 


Colon cancer 1 


4.1 


oiauuer cancer in a I ! 


3.4 


Colon NAT 1 


4.3 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


3.8 


Bladder cancer NAT 4 


1 A 1 
IU. / 


Colon cancer NAT 2 


4.2 


Adenocarcinoma of the prostate 
1 


4.2 


Colon cancer 3 


14.6 


Adenocarcinoma of the prostate 
2 


0.0 


Colon cancer NAT 3 


14 7 


Adenocarcinoma of the prostate 

3 


2.1 


Colon malignant cancer 4 


9.3 


Adenocarcinoma of the prostate 


15.4 


Colon normal adiacent 
tissue 4 


1.7 


Prostate cancer NAT 5 


1.3 




n n 


Adenocarcinoma of the prostate 
6 


3.3 


Lung NAT 1 


0.0 


Adenocarcinoma of the prostate 
7 


3.3 


Lung cancer 2 


43.8 


Adenocarcinoma of the prostate 
8 


0.0 


Lung NAT 2 


1.6 


Adenocarcinoma of the prostate 
9 


2.6 


Squamous cell carcinoma 3 


3.8 


Prostate cancer NAT 1 0 


1.7 


Lung NAT 3 


0.0 


Kidney cancer 1 


6.4 


metastatic melanoma 1 


42.9 


KidneyNAT 1 


9.7 


Melanoma 2 


3.7 


Kidney cancer 2 


29.1 


Melanoma 3 


2.1 


Kidney NAT 2 


9.7 


metastatic melanoma 4 


90.8 


Kidney cancer 3 


5.2 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


2.0 


Bladder cancer 1 


0.0 


Kidney cancer 4 


18.3 


Bladder cancer NAT 1 jO.O 


Kidney NAT 4 


6.1 


Bladder cancer 2 jo.8 


i 



CNS_neurodegeneration_vl.O Summary: Ag4 128 This panel confirms the 
expression of the CG97658-01 gene at low levels in the brains of an independent group of 
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individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 
5 Generalscreeningj)anel_vl.4 Summary: Ag4128 Highest expression of the 

CG97658-01 gene is seen in thalamus (CT=25.7). High expression of this gene is detected 
in all regions of the central nervous system (CNS) examined, including amygdala, 
hippocampus, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. 
Therefore, expression of this gene can be used to distinguish CNS samples from other 

10 samples in this panel. Furthermore, therapeutic modulation of this gene may be useful in 
the treatment of central nervous system disorders such as Alzheimer's disease, Parkinson's 
disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 

15 muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Panel 4.1D Summary: Ag4128 Highest expression of the CG97658-01 gene is 
detected in TNFalpha + IL-lbeta treated coronery artery SMC cells. Therefore, exression 

20 of this gene can be used to distinguish this sample from other samples in this panel. 

In addition, low expression of this gene is seen in lung. Therefore, therapeutic 
modulation of this gene product may be beneficial in the treatment of lung related 
disorders such as chronic obstructive pulmonary disease, asthma, allergy and emphysema. 
General oncology screening panel_v_2.4 Summary: Ag4128 Highest 

25 expression of the CG97658-01 gene is detected in metastic melanoma (CT=3 1.5). 

Significant expression of this gene is associated with kidney cancer, melanoma and lung 
cancer. Therefore, therapeutic modulation of this gene may be useful in the treatment of 
these cancers. 

30 AY. CG97842-01 : Protein Kinase-Formin-Like Protein 
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Expression of gene CG97842-01 was assessed using the primer-probe set Ag4130, 
described in Table AYA. Results of the RTQ-PCR runs are shown in Tables AYB, AYC 
and A YD. 

Table AYA . Probe Name Ag4130 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
ID No 


Forward 


5 ' -attcaacatgcccgttaataca-3 1 


22 


2409 


389 


Probe 


TET-5 ' -ccagaacttctacagtagtccaagcaca-3 ' -TAMRA 


28 


2433 


390 


Reverse 


5 • -agagtcatcttggtcactccaa-3 1 


22 


2462 


391 



Table AYB , CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4130, Run 
214959899 


Tissue Name 


Rei. Exp.(%) 
Ag4130, Run 


AD 1 Hippo 


16.4 


Pnntrnl fPfltM ^ Tpmnnral Ptv 


7 7 
1 .z 


AD 2 Hippo |34.6 


v^UIUIUl frailly H I Cllipurdl 




AD 3 Hippo [9.6 


AD 1 Orrinital Ptv 


— 1 . J 


AD 4 Hippo j 1 6.3 


A V} *J PlrT 1 in it Q 1 f 1 tv f\A ice i ihy^ 
r\VJ z. V_/l_i_ipildl V>L\ ^iVlJbblllS^J 


oo 1 


AD 5 Hippo j90.8 


Af) i Orrinitfl! Ptv 

J V_/V-dplLal V^LA 


R 7 

20.2 


AD 6 Hippo 


80.1 


AD 4 Occipital Ctx 


Control 2 Hippo 


33.0 


AD 5 Occipital Ctx 


37.1 


Control 4 Hippo 


13.8 


AD 6 Occipital Ctx 


55.9 


Control (Path) 3 Hippo 


17.0 


Control 1 Occipital Ctx 


4.4 


AD 1 Temporal Ctx 


20.7 


Control 2 Occipital Ctx 


36.3 


AD 2 Temporal Ctx 


42.3 


Control 3 Occipital Ctx 


25.0 


AD 3 Temporal Ctx 


10.2 


Control 4 Occipital Ctx 


10.7 


AD 4 Temporal Ctx 


40.1 


Control (Path) 1 Occipital Ctx~ 


85.3 


ADS Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


24.8 


AD 5 Sup Temporal Ctx 


58.6 


Control (Path) 3 Occipital Ctx 


3.5 


AD 6 Inf Temporal Ctx 


69.3 


Control (Path) 4 Occipital Ctx 


29.9 


AD 6 Sup Temporal Ctx 


0.1 


Control 1 Parietal Ctx 


5.7 


Control 1 Temporal Ctx 


8.6 


Control 2 Parietal Ctx 


43.5 


Control 2 Temporal Ctx 


29.7 


Control 3 Parietal Ctx 


24.7 


Control 3 Temporal Ctx 


19.8 


Control (Path) 1 Parietal Ctx 


67.4 


Control 3 Temporal Ctx 


12.9 


Control (Path) 2 Parietal Ctx . 


31.0 


Control (Path) 1 Temporal 
Ctx 


50.7 


Control (Path) 3 Parietal Ctx 


5.3 


Control (Path) 2 Temporal 
Ctx 


35.1 


Control (Path) 4 Parietal Ctx 


57.8 
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Table AYC. General jscreeningjane^vl. 4 



Tissue Name 


Rel. Exp.(%) 
Ag4130, Run 
220424645 


Tissue Name 


Rel. 

Exp.(%) 
Ag4130, 
Run 

220424645 


Adipose 


12.0 


Renal ca. TK-10 


34.2 


Melanoma* Hs688(A)T 


27.0 


Bladder 


28.5 


Melanoma* Hs688(B).T 


24.5 


Gastric ca. (liver met.) NCI-N87 


71.7 


Melanoma* M14 


41.5 


Gastric ca. K.ATO III 


47.6 


Melanoma* LOXIMVI 


25.2 


Colon ca. SW-948 . 


10.9 


Melanoma* SK-MEL-5 


33.9 


Colon ca. SW480 


65.5 


Squamous cell carcinoma SCC- 
4 


17.4 


Colon ca.* (SW480 met) SW620 


JO.L 


Testis Pool 


12.9 


Colon ca. HT29 


27.7 


Prostate ca.* (bone met) PC-3 


95.9 


Colon ca. HCT-1 16 


43.8 


Prostate Pool 


8.7 


Colon ca. CaCo-2 


30.8 | 


Placenta 


3.5 jColon cancer tissue 


37.4 


Uterus Pool 


4.2 


Colon ca. SW1I16 


12.5 


Ovarian ca. OVCAR-3 


35.4 


Colon ca. Colo-205 


8.0 


Ovarian ca. SK-OV-3 


61.6 


Colon ca. SW-48 


7.6 


Ovarian ca. OVCAR-4 


27.0 


Colon Pool 


18.6 


Ovarian ca. OVCAR-5 


37.1 


Small Intestine Pool 


15.6 


Ovarian ca IGROV-1 


18.4 


Stomach Pool 


10.7 


Ovarian ca. OVCAR-8 


14.9 


Bone Marrow Pool 


4.2 


Ovary 


11.3 


Fetal Heart 


18.7 


Breast ca. MCF-7 


59.9 


Heart Pool 


9.3 


Breast ca. MDA-MB-231 


46.7 


Lymph Node Pool 


17.0 


Breast ca. BT 549 


54.7 


Fetal Skeletal Muscle 


6.6 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


13.8 


Breast ca. MDA-N 


18.3 


Spleen Pool 


12.2 


Breast Pool 


16.8 


Thymus Pool 


15.3 


Trachea 


9.5 


CNS cancer (glio/astro) U87-MG 


18.2 


Lung 


3.5 


CNS cancer (glio/astro) U-l 1 8- 
MG 


90.1 


Fetal Lung 


50.0 


CNS cancer (neuro;met) SK-N- 
AS 


57.8 


Lungca. NCI-N417 


6.7 


CNS cancer (astro) SF-539 


21.2 


Lung ca. LX-1 


50.0 


CNS cancer (astro) SNB-75 


51.8 
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I lino r*a >jr*T 141/1/; 


1 1 i 


CNS cancer (gho) SNB-19 


16.4 


T lino m ^vT-TP-77 


in a 


CNS cancer (gho) SF-295 


73.7 


I una ra A ^A.Q 


37.4 


Brain (Amygdala) Pool 


6.1 




7.5 


Brain (cerebellum) 


8.2 


I nno cs\ NCI- 


38.2 


Brain (fetal) 


23.5 


I una ca TsICl-H4fiO 


24.0 


Brain (Hippocampus) Pool 


9.1 


Lung ca. HOP-62 


13.0 


Cerebral Cortex Pool 


14.2 


Lung ca. NCI-H522 


46.3 jBrain (Substantia nigra) Pool 


6.2 


Liver 


1 • 1 ! Brain (Thalamus) Pool 


15.6 


Fetal Liver 


28. 1 {Brain (whole) 


5.3 


Liver ca. HepG2 


23.2 


Spinal Cord Pool 


8.1 


Kidney Pool |21.9 'Adrenal Gland 


7 A 


Fetal Kidney 


28.3 jPituitary gland Pool 


8.0 


Renal ca. 786-0 


43.8 jSalivary Gland 


1.9 


Renal ca. A498 


9.2 jThyroid (female) 


4.0 


Renal ca. ACHN j22. 1 jPancreatic ca. CAPAN2 


16.8 


Renal ca. UO-3 1 132.3 jPancreas Pool 


20.0 

_. 



Table A YD. Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag4130. Run 
172859566 


Tissue Name 


Rel. 

Exp.(%) 
Ag4130, 
Run 

172859566 


Secondary Thl act 


39.8 


HUVECIL-lbeta 


43.8 


Secondary Th2 act 


46.3 


HUVEC I FN gamma 


48.3 


Secondary Trl act 


36.3 


HUVECTNF alpha + IFN 
gamma 


34.2 


Secondary Thl rest 


17.3 


HUVECTNF alpha +IL4 


32.8 


Secondary Th2 rest 


15.8 


HUVEC 1L- 11 


30.8 


Secondary Trl rest 


19.1 


Lung Microvascular EC none 


57.0 


Primary Thl act 


41.2 


Lung Microvascular EC 
TNFalpha+IL-lbeta 


55.9 


Primary Th2 act 


34.2 


Microvascular Dermal EC none 


37.1 


Primary Trl act 


33.7 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


37.1 


Primary Thl rest 


28.5 


Bronchial epithelium TNFalpha + 
ILlbeta 


100.0 


Primary Th2 rest 


14.5 


Small airway epithelium none 


12.4 


Primary Trl rest 


22.8 


Small airway epithelium 
TNFalpha+IL-lbeta 


40.9 
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CD45RA CD4 lymphocyte act 


35.8 


Coronery artery SMC rest 


24.3 


CD45RO CD4 lymphocyte act 


37.6 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


23.8 


CD8 lymphocyte act 


37.1 


Astrocytes rest 


34.2 


Secondary CD8 lymphocyte 
rest 


26.4 


Astrocytes TNFalpha + IL-1 beta 


22.8 


Secondary CD8 lymphocyte act 


16.2 


KU-812 (Basophil) rest 


36.6 


i 

CD4 lymphocyte none j 1 2.2 


KU-8 12 (Basophil) 
PMA/ionomycin 


73.7 


2ryThl/Th2/Trl anti-CD95 
CH11 


31.9 


PPDI Iftfi f Keratinorvff><:^ nmip 

VvL/ 1 luu ^l\tlatlllUL \ ICiy nunc 




LAK cells rest 


36.9 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


26.1 


LAK cells 1L-2 


30.1 


Liver cirrhosis 


1 o.z 


LAK cells IL-2+IL-12 


21.8 


NCI-H292 none 


Zh.j 


LAK cells IL-2+IFN gamma |33.9 


NCI-H292 IL-4 


4o.<> 


LAK cells IL-2+IL- 18 |29.5 


NCI-H292 IL-9 


48. U 


LAK cells PMA/ionomycin ! 81.2 jNCl-H292 IL-13 


41.8 


;NK Cells IL-2 rest 136.1 


NCI-H292 I FN gamma 


... , , 

39.2 


Two Way MLR 3 day !35.4 


HP A EC none 


30.4 


Two Way MLR 5 day |31.9 


HPAEC TNF alpha + IL-1 beta 


51.4 


Two Way MLR 7 day [21.8 


Lung Fibroblast none 


39.8 


PBMC rest 


14.6 


Lung fibroblast TNF alpha + IL-1 
beta 


27.0 


PBMC PWM 


27.9 


Lung fibroblast IL-4 


27.5 


PBMC PHA-L 


23.7 


Lung fibroblast IL-9 


36.6 


Ramos (B cell) none 


30.4 


Lung fibroblast IL-13 


31.4 


Ramos (B cell) ionomycin 


35.4 


Lung fibroblast IFN gamma 


46.0 


B lymphocytes PWM 


23.3 


Dermal fibroblast CCD 1070 rest 


45.7 


B lymphocytes CD40L and IL- 
4 


23.2 


Dermal fibroblast CCD 1070 TNF 
alpha 


61.6 


EOL-1 dbcAMP 


29.5 


Dermal fibroblast CCD 1070 
beta 


30.4 


EOL-1 dbcAMP 
PMA/ionomycin 


61.1 


Dermal fibroblast IFN gamma 


24.8 


Dendritic cells none 


57.0 


Dermal fibroblast IL-4 


36.9 


Dendritic cells LPS 


38.2 


Dermal Fibroblasts rest 


17.1 


Dendritic cells anti-CD40 


39.8 


Neutrophils TNFa+LPS 


11.3 


Monocytes rest 


38.4 


Neutrophils rest 


30.1 


Monocytes LPS 


72.2 


Colon 


10.0 



679 



WO 03/010327 



PCT/US02/14199 



Macrophages rest 


36.9 


Lung 


17.9 


Macrophages LPS 


36.1 


Thymus 


47.3 


[HUVEC none 


31.0 


Kidney 


44.8 


HUVEC starved 


38.4 







CNS_neurodegenerahon_v1.0 Summary: Ag4130 This panel confirms the 
expression of the CG97842-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag4130 Highest expression of the 
CG97842-01 gene is detected in a breast cancer cell line (CT=25.6). High expression of 
this gene is seen a cluster of cancer cell lines including CNS, colon, gastric, renal, lung, 
breast, ovarian, prostate, melanoma, squamous cell carcinoma and pancreatic cancer cell 
lines. Therefore, therapeutic modulation of this gene product may be beneficial in the 
treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=27) when 
compared to adult liver and lung (CT=30-32). This observation suggests that expression of 
this gene can be used to distinguish fetal from adult liver and lung, respectively. In 
addition, the relative overexpression of this gene in fetal tissue suggests that the protein 
product may enhance growth or development of lung and liver in the fetus and thus may 
also act in a regenerative capacity in the adult. Therefore, therapeutic modulation of the 
protein encoded by this gene could be useful in treatment of liver and lung related 
diseases. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
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central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4130 Highest expression of the CG97842-01 gene is - 
detected in TNFalpha + ILlbeta treated bronchial epithelium (CT=28.7). This gene is 
expressed at high to moderate levels in a wide range of cell types of significance in the 
immune response in health and disease. These cells include members of the T-cell, B-cell, 
endothelial cell, macrophage/monocyte, and peripheral blood mononuclear cell family, as 
well as epithelial and fibroblast cell types from lung and skin, and normal tissues 
represented by colon, lung, thymus and kidney. This ubiquitous pattern of expression 
suggests that this gene product may be involved in homeostatic processes for these and 
other cell types and tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 



AZ. CG98021-01: Synaptotagmin III 

20 Expression of gene CG9802 1 -0 1 was assessed using the primer-probe set Ag4 1 38, 

described in Table AZA. Results of the RTQ-PCR runs are shown in Tables AZB, AZC, 
AZD and AZE. 

Table AZA. Probe Name Ag4138 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 1 -tcattcattcaacaaacatcca-3 ' 


22 


117 


392 


Probe 


TET-5 1 -agcaccaactacgccctacgtgct-3 • -TAMRA 


24 


90 


393 


Reverse 


5 1 -gaactccaggaccttgttctct-3 1 


22 


45 


394 



Table AZB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4138, Run 
214961618 


Tissue Name 


Rel. 

Exp.(%) 
Ag4138, 
Run 

214961618 


AD 1 Hippo 


8.0 


Control (Path) 3 Temporal Ctx 


1.7 
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AD 2 Hippo 


17 K 


Control (Path) 4 Temporal Ctx 


19.5 


AD 3 Hippo 


1.9 


AD 1 Occipital Ctx 


6.4 


AD 4 Hippo 


2.9 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital Ctx 


1.3 


AD 6 Hippo 


29.3 


AD 4 Occipital Ctx 


14:7 


Control 2 Hippo 


18.7 


AD 5 Occipital Ctx 


18.8 


Control 4 Hippo 


2-0 AD 6 Occipital Ctx 


60.7 


Control (Path) 3 Hippo 


1.4 


Control 1 Occipital Ctx 


0.5 


AD 1 Temporal Ctx 


4.6 


Control 2 Occipital Ctx |76.8 


AD 2 Temporal Ctx 


17.2 


Control 3 Occipital Ctx 


5.3 


AD 3 Temporal Ctx 


2.4 


Control 4 Occipital Ctx 


0.9 


AD 4 Temporal Ctx 


15.3 


Control (Path) 1 Occipital Ctx 


79.6 


AD 5 Inf Temooral Ctx 


81.8 


Control (Path) 2 Occipital Ctx 


8.6 


ADS SupTemporal Ctx 


24.1 


Control (Path) 3 Occipital Ctx 


0.4 


IAD 6 Inf Temooral Ctx 


25.0 jControl (Path) 4 Occipital Ctx 


10.8 


jAD 6 Sup Temporal Ctx 
•Control 1 Temporal Ctx 


28 9 jControl 1 Parietal Ctx 

0 3 {Control 2 Parietal Ctx j 


1.1 

33.7~~ 


jControl 2 Temporal Ctx 


38.7 


Control 3 Parietal Ctx ]s.2 


jControl 3 Temporal Ctx 


9.2 


Control (Path) 1 Parietal Ctx |78.5 


Control 4 Temporal Ctx 


1.7 


Control (Path) 2 Parietal Ctx 129.7 


Control (Path) 1 Temporal 
Ctx 


57.8 


Control (Path) 3 Parietal Ctx 


0.7 


Control (Path) 2 Temporal \ 
Ctx 


36.1 


Control (Path) 4 Parietal Ctx 


38.4 


r 


fable AZC. General sww>ninjr_pan.»| Y 1 i 




Tissue Name 


Rel. Exp.(%) 
Ag4138, Run 
220967147 


iRel. 
Exp.(%) 

Tissue Name Ag4I38, 

Run 

J220967147 


Adipose < 


O.O 


Renal ca. TK- 10 jo.8 


Melanoma* Hs688(A).T 1 


3.0 


Bladder \qq 


Melanoma* Hs688(B).T j( 


3.1 


Gastric ca. (liver met) NCI-N87 jo. 1 


Melanoma* Ml 4 < 


).l i 


Gastric ca. KATO III jo 0 


Melanoma* LOXIMVI ( 


).0 l 


Colon ca. SW-948 jo.O 


Melanoma* SK-MEL-5 ; 


U ( 


Colon ca. SW480 0.6 


Squamous cell carcinoma 
SCC-4 ( 


).0 < 


Colon ca.* (S W480 met) SW620 0. 1 


Testis Pool ( 


).8 ( 


Colon ca. HT29 jQ.Q j 
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Prostate ca.* (bone met) 
PC-3 


0.5 


Colon ca. HCT-116 


6.1 


Prostate Pool 


0.4 


! Colon ca. CaCo-2 


0.7 


Placenta 


0.7 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SWI1 16 


0.0 


jOvarian ca. OVCAR-3 

i — - — — — ~ 


0.1 


Colon ca. Colo-205 


0.0 


,'Ovarian ca. SK-OV-3 [5.4 


Colon ca. SW-48 


0.0 


jOvarian ca. OVCAR-4 |o.5 


Colon Pool jo.2 


iOvarian ca. 0 VCAR-5 1 1 .2 

1 f 


Small Intestine Pool 


0.0 


jwvdnan ca. ivjKvJV-j 


0.8 


Stomach Pool 


0.0 


|Uvdrian ca. UvLAK-o 

j ' 


0.0 


Bone Marrow Pool 


0.0 


iOvary 


0.3 


Fetal Heart jo. l 


Breast ca. MCF-7 


0.3 


Heart Pool 


0.0 


Breast ca. MDA-MB-231 


0.0 


Lymph Node Pool 


0.6 


jBreast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


jBreast ca. T47D 


1.1 


[Skeletal Muscle Pool 
Spleen Pool 


0.0 


Breast ca. MDA-N 


6.7 


0.1 


jBreast Pool 


o.2 S Th y mus p ° o1 


0.0 


'Trachea 


0 0 iCNS cancer (glio/astro) U87-MG jO.O 


l_. 

.Lung 


a o |CNS cancer (slio/astro) U-l 18- 
;MG 


0.1 


Fetal Lung 


0-9 |CNS cancer (neurojmet) SK-N-AS 


1.1 


Lungca.NCI-N417 


0-2 ]CNS cancer (astro) SF-539 


0.0 


Lungca. LX-1 j 


0.0 j 


CNS cancer (astro) SNB-75 


0.0 


Lungca.NCI-H146 j 


0.0 j 


CNS cancer (glio) SNB-19 


0.3 


Lung ca. SHP-77 jO.O j 


CNS cancer (glio) SF-295 


0.7 


Lung ca. A549 |0.7 (Brain (Amygdala) Pool 


11.7 


Lung ca. NCI-H526 jo.O (Brain (cerebel lum) 


98.6 


Lungca.NCI-H23 1 


3.7 


Brain (fetal) 


100.0 


Lungca.NCI-H460 Ji 


D.5 


Brain (Hippocampus) Pool 


12.3 


Lung ca. HOP-62 jO.l i 


Cerebral Cortex Pool : 


24.5 


Lungca.NCI-H522 : 


J.2 ] 


Brain (Substantia nigra) Pool 


17.7 


Liver < 


).0 ] 


Brain (Thalamus) Pool : 


29.3 


Fetal Liver ( 


).0 1 


3rain (whole) 


18.3 


Liver ca. HepG2 ( 


).0 5 


Spinal Cord Pool 


5.0 


Kidney Pool ( 


).l y 


\drenal Gland 


.0 


Fetal Kidney ] 


.2 I 


Mtuitary gland Pool ] 


.9 


Renal ca. 786-0 C 


>.0 j 


Salivary Gland |o.l 
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Renal ca. A498 


1.1 


jThyroid (female) 


0.7 


Renal ca. ACHN 


0.2 


jPancreatic ca. CAPAN2 


0.0 


Renal ca. UO-3 1 


0.3 


jPancreas Pool 


0.2 




Table AZD. Panel 4 in ~~ 


! 

I 
t 

i„. 

jTissue Name 


iRel. Exp.(%) 
Ag4l38. Run 

I / J I 1 oo /O 


Tissue Name 


Rel. 

Exp.(%) 
Ag4138, 
Run 

173118876 


Secondary Thl act 


0 3 


(HUVEC IL-lbeta 


0.5 


Secondary Th2 act 


0.0 


HUVEC I FN gamma 


1-4 


oc^wiiudry in act 


1 A 

jo.o 


|HUVEC TNF alpha + 1FN gamma jo.O 


ocLonaary i n 1 rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


]huvec IL-1 1 


0.7 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0 0 


Primarv Th 1 art 


0.0 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


0.0 


{Primarv Th2 act 


0.0 

i 


[Microvascular Dermal EC none 


0.0 


f ' 

Primarv Trl art 


- - 

0.0 


uviiLrosvasuiar Dermal bC 
TNFalpha + IL-1 beta 


0.0 


! Primarv Th 1 re<;t 

j 1 • J 1111 I Col 


0.0 


oruncniai epitneiium l Nralpha + 
ILlbeta 


0.0 


[Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primarv Trl r<»ct 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 lymphocyte 
act 


0.8 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte 
act j 


0.3 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.7 


CD8 lymphocyte act < 


3.0 


Astrocytes rest 


0.4 


Secondary CD8 lymphocyte 
rest ( 


).5 


Astrocytes TNFalpha + IL- 1 beta 1 


3.0 


Secondary CD8 lymphocyte 
act ( 


).0 ] 


KU-812 (Basophil) rest < 


).8 


CD4 lvmnhrif*\/f<» nnno f 
IJrllipjltJUyiC nunc ^ 


: 


<U-8 12 (Basophil) 
'MA/ionomycin 


1.5 


9rv Th1/Th9/Tri 

**\ y jni/inz/in anti- 

CD95CH11 ~ C 


l.O ( 


XD 1 1 06 (Keratinocytes) none ( 


).0 


LAK cells rest 0 


.0 ( 

1 


"CD1 106 (Keratinocytes) 
rNFalpha + IL-lbeta C 


1.0 


LAK cells IL-2 o 


•7 I 


Aver cirrhosis o 


.1 


LAK cells IL-2+IL-12 jo 


.0 > 


lCI-H292none o 


.0 
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LAK cells IL-2+IFN 

era mm 51 
gai in I la 


0.0 


NCI-H292 IL-4 


0.0 


I AlC cells IL-2+ IL-18 


0 0 


MPT U909 TI Q 




LAK cells PMA/ionomycin 


0.0 


NCI-H292 IL-13 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 3 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-l beta 


0.0 


Two Way MLR 7 day 


1.0 


Lung fibroblast none 


0.9 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-l 
beta 


0.0 


PBMC PWM 


0.5 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


2.0 


Lung fibroblast IL-9 


0.9 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


0.0 


B lymphocytes CD40L and 
IL-4 


0.4 


Dermal fibroblast CCD 1070 TNF 
alpha 


0.4 


EOL-1 dbcAMP 


2.7 


Dermal fibroblast CCD 1070 IL-l L 
beta j UU 


\ 

EOL-l dbcAMP 

P\/f A /innninvpiii 
r \y\r\j 


0.5 


Dermal fibroblast IFN gamma jl.O 

i 


V^t-llo I1VJ11C 


0 0 

\J.\J 


Dermal fibroblast IL-4 


0.5 


De»nHritir relU I 


1 0 


Dermal Fibroblasts rest 


2.0 




n o 


Neutrophils TNFa+LPS 


0.0 


iVIUIIULfjrtCd iCbL 


0 0 


Neutrophils rest 


0.8 


Monocytes LPS 


0.0 


Colon 


0.8 


Macrophages rest 


0.0 


Lung 


0.9 


Macrophages LPS 


0.0 


Thymus 


13.4 


HUVEC none 


0.2 


Kidney 


100.0 


HUVEC starved 


0.5 






Table AZE. General oncology screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4138,Run 
268390088 


Tissue Name 


Rel. 

Exp.(%) 
Ag4138, 
Run 

268390088 


Colon cancer 1 


3.1 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


11.8 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 2 


7.6 


Adenocarcinoma of the prostate 1 


0.0 
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Colon cancer 3 


0.0 


Adenocarcinoma of the prostate 2 


0.0 


Colon cancer NAT 3 


0.0 


Adenocarcinoma of the prostate 3 


14.5 


Colon malignant cancer 4 


32.1 


Adenocarcinoma of the prostate 4 


5.4 


Colon normal adjacent 
tissue 4 


0.0 


Prostate cancer NAT 5 


10.4 


Lung cancer 1 


6.0 


•Adenocarcinoma of the prostate 6 


29.7 


Lung NAT 1 


0.0 


; Adenocarcinoma of the prostate 7 ;0.0 


Lung cancer 2 


74.7 


Adenocarcinoma of the prostate 8 iO.O 


Lung NAT 2 


3.7 




Adenocarcinoma of the prostate 9 |6.5 


Squamous cell carcinoma 3 


21.8 




Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 


0.0 


; Kidney cancer 1 


0.0 


metastatic melanoma 1 


6.2 


'KidneyNAT 1 


4.2 


Melanoma 2 


0.0 




Kidney cancer 2 


24.3 


Melanoma 3 


0.0 


;KidneyNAT2 


100.0 


metastatic melanoma 4 


8.3 




Kidney cancer 3 


26.4 


metastatic melanoma 5 


11.0 


;KidneyNAT3 


7.5 


Bladder cancer I 


0.0 




Kidney cancer 4 ill.O 


Bladder cancer NAT I 


0.0 


iKidneyNAT4 |8.5 


Bladder cancer 2 14.3 






1 

i 



: 1 . , ■ 1 | 

CNS_neurodegeneration_vLO Summary: Ag4138 This panel confirms the 
expression of the CG98021-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment Please see Panel 1 A for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag4138 Highest expression of the 
CG98021-01 gene is detected in cerebellum and fetal brain (CT=28). In addition high 
expression of this gene is detected exclusively in all the region of central nervous system 
1 0 examined. Therefore, expression of this gene can be used to distinguish CNS samples 
from other samples used in this panel. In addition, therapeutic modulation of this gene 
may be useful in the treatment of CNS disorders such as Alzheimer's disease, Parkinson's 
disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Low expression of this gene is also seen in a few of colon cancer, renal 'cancer, 
1 5 lung cancer, breast, ovarian and melanoma cell lines. Therefore, therapeutic modulation of 
this gene may be beneficial in the treatement of these cancers. 
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Low expression of this gene is also detected in fetal lung. Interestingly, this gene is 
expressed at much higher levels in fetal (CT=34.8) when compared to adult lung (CT=40). 
This observation suggests that expression of this gene can be used to distinguish fetal from 
adult lung. In addition, the relative overexpression of this gene in fetal lung suggests that 
the protein product may enhance lung growth or development in the fetus and thus may 
also act in a regenerative capacity in the adult. Therefore, therapeutic modulation of the 
protein encoded by this gene could be useful in treatment of lung related diseases. 

Panel 4. ID Summary: Ag4138 Highest expression of the CG98021-01 gene is 
detected in kidney (CT=29.6). In addition low expression of this gene is also detected in 
thymus, eosinophils and PMA/ionomycin treated basophils. Therefore, expression of this 
gene can be used to distinguish these samples from other samples used in this panel. In 
addition, therapeutic modulation of this gene may be useful in the treatment of 
inflammation and autoimmune disease that affect kidney including lupus and 
glomerulonephritis. 

General oncology screening panel_v_2.4 Summary: Ag4138 Highest 
expression of the CG98021-01 gene is detected in kidney (CT=34.5). In addition, low 
expression of this gene is also seen in lung cancer sample. Please see Panel 1 .4 and 4. 1 D 
for a discussion of the potential utility of tiiis gene. 

BA. CG98030-01: Tyrosine Phosphatase 

Expression of gene CG98030-01 was assessed using the primer-probe set Ag4139, 
described in Table BAA. Results of the RTQ-PCR runs are shown in Tables BAB and 
BAC. 



Table BAA . Probe Name Ag4139 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 1 -ccaattccatacaaaccaagag-3 1 


22 


548 


395 


Probe 


TET-5 1 -actcagtttctaactcctctccgcaa-3 ■ -TAMRA 


26 


595 


396 


Reverse 


5 ' -tgggatcacagcaagagaatat-3 ' 


22 


622 


397 



Table BAB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4139, Run 
214962531 


Tissue Name 


Rel. 

Exp.(%) 
Ag4139, • 
Run 

214962531. 
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AD 1 Hinno 




Control (Path) 3 Temporal Ctx 


0.0 


AD 9 Winnn 


12.4 


Control (Path) 4 Temporal Ctx 


12.8 


AD 3 Hi'ddo 


o.z 


AD 1 Occipital Ctx 


13.4 


AD 4 Hinnn 


1 0 n 
1 0.0 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hinno 


DZ.4 


AD 3 Occipital Ctx 


10.8 


AD 6 Hippo 


73.7 


AD 4 Occipital Ctx 


47.5" 


Control 2 Hippo 


21.0 


AD 5 Occipital Ctx 


38.2 


Control 4 Hippo 


14.8 


AD 6 Occipital Ctx 


|30.4 


Control (Path) 3 Hippo 




Control 1 Occipital Ctx 


6.3 


AD 1 Temporal Ctx 


{21.9 


Control 2 Occipital Ctx 


28.1 


AD 2 Temporal Ctx (26.6 


Control 3 Occipital Ctx 


19.5 


AD 3 Temporal Ctx |5.3 


Control 4 Occipital Ctx 


1.5 


AD 4 Temporal Ctx 


6.6 


Control (Path) 1 Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx {56.3 


Control (Path) 2 Occipital Ctx 


|6.6 


AD 5 Sup Temporal Ctx (84.7 


Control (Path) 3 Occipital Ctx 




AD 6 Inf Temporal Ctx 19 1 .4 |contro! (Path) 4 Occipital Ctx 


'4.8 


AD 6 Sup Temporal Ctx |44.8 {Control 1 Parietal Ctx if 7 " " 


Control 1 Temporal Ctx 


1.1 
55.9 


Control 2 Parietal Ctx -37.4 


Control 2 Temporal Ctx 


Control 3 Parietal Ctx !4.5 


iControI 3 Temporal Ctx 


27.5 


Control (Path) 1 Parietal Ctx 162.9 


Control 3 Temporal Ctx 


3.1 


Control (Path) 2 Parietal Ctx 


24.8 


Control (Path) 1 Temporal 
Ctx 


31.9 


Control (Path) 3 Parietal Ctx 


0.0 


Control (Path) 2 Temporal 
Ctx 


28.7 


Control (Path) 4 Parietal Ctx 


75.8 


Table BAC. General nnmlooy <^r?f nin« panel v 21 




Tissue Name 


Rel. Exp.(%) 
Ag4139,Run 
268392257 


Tissue Name 

1 

* 


Rel. 

Exp.(%) 
*g4139, 
Run 

268392257 


Colon cancer 1 \\ 


24.7 ] 


Bladder cancer NAT 2 ( 


).0 


Colon cancer NAT 1 ( 


>.7 ] 


Bladder cancer NAT 3 ; 


1.1 


{Colon cancer 2 ] 


9.2 J 


3ladder cancer NAT 4 ( 


).0 


Colon cancer NAT 2 ( 


).6 


\denocarcinoma of the prostate 1 |l 


i.O 


Colon cancer 3 4 


13.8 \ t 


Adenocarcinoma of the prostate 2 1 


.1 


Colon cancer NAT 3 1 


.0 \t 


\denocarcinoma of the prostate 3 2 


!.8 


Colon malignant cancer 4 1 


00.0 / 


Adenocarcinoma of the prostate 4 9 


M 


Colon normal adjacent C 


10 jProstate cancer NAT 5 0 


.4 
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tissue 4 








Lung cancer 1 


1L0 


Adenocarcinoma of the prostate 6 


0.8 


Lung NAT 1 


0.8 


Adenocarcinoma of the prostate 7 


0.8 


Lung cancer 2 


25.7 


Adenocarcinoma of the prostate 8 


0.6 


Lung NAT 2 


1.6 


Adenocarcinoma of the prostate 9 


2.2 


Squamous cell carcinoma 


18.2 


Prostate cancer NAT 1 0 jo.3 


1 una NAT 1 

L/ 11 1 1 ii, IN/A t ~J 




Kidney cancer 1 j3.4 


1 1 j v laoiciii v 1 1 iv< 1 at iv/i i ici i 


4 1 


KidneyNAT 1 


1.0 


\A r la no ma 7 
jviciainjnia z. 




Kidney cancer 2 


9.5 


Melanoma 3 


0.8 


Kidney NAT 2 


2.3 


metastatic melanoma 4 


4.5 


Kidney cancer 3 


1.7 


metastatic melanoma 5 


3.1 


Kidney NAT 3 


0.4 


Bladder cancer 1 


0.1 


Kidney cancer 4 |6.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 jl.4 


Bladder cancer 2 


0.8 





CNS_ncurodegeneration_vl.O Summary: Ag4139 This panel confirms the 
expression of the CG98030-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

Panel 4.1D Summary: Ag4139 Expression of the CG98030-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
General oncology screening panel_y_2.4 Summary: Ag4139 Highest 
10 expression of the CG98030-01 gene is detected in malignant colon cancer sample 

(CT=29). Interestingly, higher expression of this gene is associated with number of cancer 
samples examined including colon, lung, melanoma, metastic melanoma, adenocarcinoma 
of the prostate, and kidney cancers. Therefore, expression of this gene may be used as 
diagnostic markers for these cancers. In addition, therapeutic modulation of this gene may 
15 be be useful in the treatments of these cancers. 

BB. CG98030-02: Tyrosine Phosphatase 
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Expression of foil length physical clone CG98030-02 was assessed using the 
primer-probe set Ag6401, described in Table BBA. Results of the RTQ-PCR runs are 
shown in Table BBB. 

Table BBA . Probe Name Ag6401 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -atgaatctgtagaaaaggaggaactaa-3 1 


27 


1676 


398 


Probe 


TET-5 • -tctttctgccacatttctaccttcct-3 • -TAMRA 


26 


1705 


399 


Reverse 


5 1 -ctccatctcgggaattaagc-3 ■ 


20 


1731 


400 



Tissue Name 


Rel. Exp.(%) 
Ag6401,Run 
269239989 


Tissue Name 


Rel Exn (%\ 
Ag6401,Run 
269239989 


Secondary Th 1 act 


17.2 


HUVEC IL-lbeta 


37.6 


Secondary Th2 act 


17.2 


HUVEC I FN gamma 


33.9 


Secondary Trl act 


5.0 


HUVEC TNF alpha + 1FN gamma 


16.5 


Secondary Th 1 rest 


0.0 jHUVEC TNF alpha + IL4 


23.8 


Secondary Th2 rest 


4.9 
3.7" 


jHUVECIL-11 


13.4 
IOOjO 


Secondary Trl rest 


|Lung Microvascular EC none 


Primary Thl act 


58 2 | Lun S Microvascular EC TNFalpha 
j+ IL-lbeta 


32.8 


Primary Th2 act 


28 3 |Microvascular Dermal EC none 


14.1 


Primary Trl act 


35.4 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


9.3 


Primary Thl rest 


1.8 


Bronchial epithelium TNFalpha + 
ILlbeta 


21.0 


Primary Th2 rest 


2.5 


Small airway epithelium none 


12.2 


Primary Trl rest 


3.2 


Small airway epithelium TNFalpha 
+ IL-lbeta 


11.9 


CD45RA CD4 lymphocyte 1 
act j 


30.6 


Coronery artery SMC rest 


37.9 


CD45RO CD4 lymphocyte 

act j 54 * 3 


Coronery artery SMC TNFalpha + 
IL-lbeta 


49.0 


CD8 lymphocyte act 


37.1 


Astrocytes rest 


19.9 


Secondary CDS lymphocyte 
rest 


5.8 


Astrocytes TNFalpha + IL-1 beta 


12.9 


Secondary CD8 lymphocyte 
act 


5.4 


KU-812 (Basophil) rest 


13.4 


CD4 lymphocyte none 


5.6 


KU-8 12 (Basophil) 
PMA/ionomycin 


5.3 
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2ryThl/Th2/Trl anti- 
CD95 CH11 


8.0 


CCD1 106 (Keratinocytes) none 


97.9 


LAK cells rest 


21.8 


CCD1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


41 5 


LAK cells IL-2 


5.6 


Liver cirrhosis 


17.3 


LAK cells IL-2+IL-12 


1.6 


NCI-H292 none 


13.3 


LAK cells IL-2+IFN 
tram ma 

gfll III 1 ICl 


3.4 


NCI-H292 IL-4 


33.4 


LAK cells IL-">-f- II -IS 




iN^i-rijiVZ il-7 


Oo.J 


LAK cells PMA/ionomycin 


6.9 


NCI-H292IL-13 


24.7 


NK Cells IL-2 rest 


16.3 


NC1-H292 I FN gamma 


25.0 


Two Way MLR 3 day 


5.1 


HPAEC none 


20.2 


Two Way MLR 5 day 


2.5 


HPAEC TNF alpha + IL-1 beta 


64.6 


Two Way MLR 7 day 


2.3 


Lung fibroblast none 


61.1 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-1 
oeia 


16.5 


PBMC PWM 


1.9 


Lung fibroblast IL-4 


17.0 


PBMC PHA-L 


17.7 


Lung fibroblast IL-9 


14.1 


Ramos (B cell) none 


6.8 


Lung fibroblast IL-13 


9.9 j 


Ramos (B cell) ionomycin 


9.5 


Lung fibroblast IFN gamma 


15.2 


B lymphocytes PWM 


8.0 


Dermal fibroblast CCD 1070 rest 


78.5 ! 


B lymphocytes CD40L and 
IL-4 


15.2 • 


Dermal fibroblast CCD 1 070 TNF 
alpha 


63.3 


EOL-1 dbcAMP 


6.2 


Dermal fibroblast CCD 1070 IL-1 
beta 


47 0 


EOL-1 dbcAMP 

PM A/iAnrtinvpin 


0.0 


Dermal fibroblast IFN gamma 


9.3 


OpnHritif* ppN^ nonp 




UCl Midi IlOiUUlabl 


Al A 


HenHritir <*p11q I 


ft 0 


uei iiiai riDrooiasis resi 


J7.0 


DpnHritir rplls anti-f^DdO 




TsJpntrnnhilc TTSlFn+I 
iNCUirupililb llNrd~Lrj 


1 5? 






iNCuuopiiiib re 5 1 


*f . 1 


Monocytes LPS 


5.9 


Colon 


3.3 


Macrophages rest 


2.5 


Lung 


6.7 


Macrophages LPS 


0.0 


Thymus 


4.0 


HUVEC none 


16.8 


Kidney 


38.7 


HUVEC starved 


54.3 






Panel 4.1D Summary: Ag6401 I 


lighest expression of the CG9803O-O2 is 



detected in lung microvascular endothelial cells and keratinocytes (CT=32.8). In addition, 
low levels of expression of this gene is also seen in dermal fibroblast, lung fibroblast, TNF 
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alpha + IL-1 beta treated HPAEC, HUVEC, resting LAK cells, activated primary Thl 
Th2, and Trl cells, activated CD4 lymphocytes and kidney. Therefore, therapeutic 
modulation of this tyrosine phosphotase encoded by this gene may lead to the alteration of 
functions associated with these cell types and lead to improvement of the symptoms of 
patients suffering from autoimmune and inflammatory diseases such as asthma, allergies 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

BC. CG98061-01 and CG98061-02: Novel Protein containing Histidine acid 
phosphatase domain 

Expression of gene CG98061-01 and variant CG98061-02 was assessed using the 
pnmer-probe set Ag4141, described in Table BCA. Results of the RTQ-PCR runs are 
shown in Tables BCB, BCC and BCD. 

Table BCA. Probe Name A g 4141 




Probe jTET - 5 ' - tgacacactg gagtctaatctccaaca- 3 ■ -T.AMRA: 07 
Reverse k • -rrr-=.,-,~=„,...-_.-_JL ' ~. " r 



Reverse jS ' -gcaccagag tctgttaat'tcca-V' 



2265 



2303 



j 402 
r~403~ 



Tissue Name 


Rel. Exp.(%) 
Ag4141,Run 
215281611 


Tissue Name 


Rel. 

Exp.(%) 
Ag4I41, 
Run 

21528161 1 


AD 1 Hippo 


15.9 


Control (Path) 3 Temporal Ctx 


4.4 


AD 2 Hippo 


20.7 


Control (Path) 4 Temporal Ctx 


m 


AD 3 Hippo 


6.6 


AD 1 Occipital Ctx 


13.1 


AD 4 Hippo 


2.8 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


58.6 


AD 3 Occipital Ctx 


6.7 


AD 6 Hippo ~^ 


100.0 


AD 4 Occipital Ctx 


12.4 


Control 2 Hippo 


17.7 ~~[ 


AD 5 Occipital Ctx 


18.0 


Control 4 Hippo 


13.9 | 


AD 6 Occipital Ctx 


21.9 


Control (Path) 3 Hippo 


7.1 jControl 1 Occipital Ctx 


6.0 


AD 1 Temporal Ctx ~f 


13-5 (Control 2 Occipital Ctx 


37.4 


AD 2 Temporal Ctx 


23.3 jControl 3 Occipital Ctx 


11.8 


AD 3 Temporal Ctx 


1.6 < 


"ontrol 4 Occipital Ctx 


S.O 


AD 4 Temporal Ctx 


12.9 ( 


-ontrol (Path) 1 Occipital Ctx < 


*8.6 
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AD 5 Inf Temporal Ctx 


62.4 


v^vuuui \x aulj jl UCGipilal v^LX. 


O.J 


AD 5 Sud TemDoral Ctx 

i mMS */ UUp A vHIUvl ill VIA 


48.3 


CVintml CP^\\C\ ^ OrYMnital ffv 
v^uiiuuj \l<xwi) j> uccipiiai i^ix 




AD 6 Inf Temporal Ctx 


83.5 


Pnntrnl (Path} 4 HrvMnital Prv 


7 ^ 


AD 6 Sup Temporal Ctx 


70.7 


Control 1 Parietal Ptv 


7 0 


Control 1 Temporal Ctx 


4.5 


Control ^ Parietal Ctx 




Control 2 Temporal Ctx 


18.7 


Control 3 Parietal Ctx 


11.5 


Control 3 Temporal Ctx 


10.5 


Control (Path) 1 Parietal Ctx 


37.9 


(Control 3 Temporal Ctx 


7.5 


Control (Path) 2 Parietal Ctx 


18.3 


jControI (Path) 1 Temporal Ctx 


30.8 


Control (Path) 3 Parietal Ctx 


5.4 


Control (Path) 2 Temporal Ctx 


22.1 


Control (Path) 4 Parietal Ctx 


29.3 


Table BCC. Panel 4.1 D 


Tissue Name 

i 

I 


Rel. Exp.(%) 
Ag4141, Run 
173118944 


Tissue Name 


Rel. 

Exp.(%) 
Ag4141, 
Run 

173118944 


| Secondary Thl act 


82.4 


HUVEC IL-lbeta 


19.8 


Secondary Th2 act 


100.0 


HUVEC IFN gamma |l7.6 


Secondary Trl act 


62.0 


HUVEC TNF alpha + IFN gamma j!5.4 


Secondary Th 1 rest ,=6.9 


HUVEC TNF alpha + 1L4 


25.3 


Secondary Th2 rest 


11.8 


HUVEC IL- 11 j8.3 


Secondary Trl rest 


11.2 


Lung Microvascular EC none 


24.5 


Primary Th 1 act 


61.1 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


17.2 


Primary Th2 act 


88.3 


Microvascular Dermal EC none 


14.1 


Primary Trl act 


88.3 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


11.8 


Primary Thl rest 


11.2 


Bronchial epithelium TNFalpha + 
ILlbeta 


12.6 . 


Primary Th2 rest 


5.8 


Small airway epithelium none 


5.8 


Primary Trl rest 


15.4 


Small airway epithelium TNFalpha 
+ IL-lbeta 


10.9 


CD45RA CD4 lymphocyte act 


55.9 


Coronery artery SMC rest 


9.4 


CD45RO CD4 lymphocyte act 


91.4 


Coronery artery SMC TNFalpha + 
IL-lbeta 


8.4 


CD8 lymphocyte act 


60.7 


Astrocytes rest 


10.2 


Secondary CD8 lymphocyte 
rest 


56.6 


Astrocytes TNFalpha + IL- 1 beta 


8.2 


Secondary CD8 lymphocyte 
act 


21.8 


KU-812 (Basophil) rest 


35.4 


CD4 lymphocyte none 


4.3 


KU-8 12 (Basophil) 


66.0 
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PMA/ionomycin 




2ryThl/Th2/Trl anti-CD95 
CH1 1 


14.6 


CCD1 106 (Keratinocytes) none 


m 4 


LAK cells rest 


19.2 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


J31.2 


LAK cells IL-2 


28.9 


Liver cirrhosis 


12.7 


|LAK cells IL-2+IL-12 


1 1 


NCI-H292 none 


|2o.d 


ILAK cells IL-2+IFN gamma 


1 1 8.3 . 


|NC1-H292 IL-4 


|63.7 


LAKcellsIL-2+IL-I8 |I6.6 ;NCI-H292 IL-9 


J69.7 


LAK cells PMA/ionomycin 


20.4 


NCI-H292 IL-13 


169.3 


NK Cells IL-2 rest 


40.1 


NCI-H292 I FN gamma 


67.8 


Two Way MLR 3 day 


24.3 


HPAEC none 


1 1.7 


Two Way MLR 5 day 


29.7 


HPAEC TNF alpha + IL-1 beta \24. 1 


Two Way MLR 7 day 


23.8 


Lung fibroblast none J 1 0.9 


jPBMC rest 

i 


3.4 

* 


Lung fibroblast TNF alpha + IL-1 !,„ 
beta 12 - 3 


! PBMC PWM J30.4 


— — 5 

Lung fibroblast IL-4 j 24. | 


PBMCPHA-L J31.6 jLung fibroblast IL-9 


iRamos (B cell) none |26.1 iLung fibroblast IL-13 ^4 3 


Ramos (B cell) lonomycin i30.1 iLung fibroblast 1FN gamma 39 2 


B lymphocytes PWM j43.2 permal fibroblast CCD] 070 ™t i 


B lymphocytes CD40L and IL- 
4 


25.0 


Dermal fibroblast CCD 1 070 TNF 
alpha 


54.3 


EOL-1 dbcAMP 


51.8 

i 


Dermal fibroblast CCD 1 070 IL- 1 
beta 


26.1 


EOL-1 dbcAMP 
PMA/ionomycin 


20.2 


Dermal fibroblast I FN gamma 


29.1 


Dendritic cells none 


14.4 


Dermal fibroblast IL-4 


36.9 


Dendritic cells LPS 


12.1 


Dermal Fibroblasts rest 


15.0 


Dendritic cells anti-CD40 


15.9 


Neutrophils TNFa+LPS 


2.0 


Monocytes rest 


7.1 


Neutrophils rest 


5.6 


Monocytes LPS i; 


28.9 


Colon 


5.8 


Macrophages rest 


10.9 


Lung 


11.8 


Macrophages LPS < 


U 


Thymus ; 


24.5 


HUVEC none 


4.6 ] 


fCidney 14.5 


HUVEC starved 


!1.9 






Table B( 


CD. General oncology screening panel v 2.4 


I 

Tissue Name / 

2 


lei. Exp.(%) 
Vg4141,Run ' 
.68392261 


I 

rissue Name \ 


lei. 

£xp.(%) 
\g4141, 
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Run 

zoojyzzol 


Colon runrpr 1 




Rlorlrior- oon^pr XT AT 1 

oiaauer cancer in a i z 


ft ft 
U.U 


Colon ranrpr MAX 1 


0 0 


oiduuer cancer in a i j 


ft £ 
U.O 


Colon ranrpr 9 


0 0 


R 1 carl H ^on^or XI A 1 A 

Diduuer cancer ina i h 


1 .O 


Colon ranrpr MAX *) 




/AucNuccirciiiunid oi uie probidic 1 




Colon fflnr^r ^ 

1 


1 1 


/AuciiucdrcinoiTia oi me prostate z 


ft ft 
U.U 


Colon cancer NAT ^ 


1 3 


Auenocarcinoma or ine piostate j 


1 ft A 
1 U.4 


Colon mfllionAnt canrpr 4 


8 7 


Auenocarcinoma or ine prosiaie h 




Colon normal aHiaf*f*nt ti^Qiif* A 


n o 


rrosiaie cancer in a i j 


^ A 


L-UIlii CdllCCI | 


8 4 

O.H 


Adenocarcinoma of the prostate 6 


5.2 


T uno NAT 1 


1. A 


Aaenocarcinoma oi ine prostate / 


1 1 
1 . 1 




Zj. / 


Aaenocarcinoma oi tne prostate o 


I A 
I .4 


Lung NAT 2 


1.8 


Aaenocarcinoma oi tne prostate v 


1 /.4 


Squamous cell carcinoma 3 


2.3 


riostate cancer in a i iu 


A /t 

U.4 


Lung NAT 3 


3.2 


Kidney cancer 1 


zO.o 
4.6 


metastatic melanoma 1 


29.5 


Money in a i i 


Melanoma 2 


0.5 


Kianey cancer z 


1 AA A 


Melanoma 3 fo.O 


KidnevNAT 0 


7 8 


metastatic melanoma 4 [6.4 


Kidney cancer 3 


31.6 


metastatic melanoma 5 


6.4 


Kidney NAT 3 


4.5 


Bladder cancer 1 


0.9 


Kidney cancer 4 


3.1 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


1.5 


Bladder cancer 2 


0.9 







CNS_neurodegeneration_vLO Summary: Ag4141 This panel confirms the 
expression of the CG98061-01 gene at low levels in the brain in an independent group of 
individuals. This gene is found to be slightly upregulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, therapeutic modulation of the expression or 



5 function of this gene may decrease neuronal death and be of use in the treatment of this 
disease. 

General_screening_panel_vl.4 Summary: Ag4141 Results from one 
experiment with the CG98061-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 
10 Panel 4.1D Summary: Ag4141 Highest expression of the CG98061-01 gene is 

detected in activated secondary Th2 cells (CT=29.6). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
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health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern suggests a role for the gene product in cell survival and proliferation. 
Therefore, modulation of the gene product with a functional therapeutic may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4141 Highest 
expression of the CG98061-01 gene is detected in kidney cancer sample (CT=31.8). In 
addition, significant expression of this gene is also seen in lung cancer, metastatic 
melanoma, kidney cancer and adenocarcinoma of the prostate. Therefore, therapeutic 
modulation of this gene may be beneficial in the treatment of these cancers. 



20 



BD. CG98131-01: MDJ6 

Expression of gene CG98131-01 was assessed using the primer-probe set Ag4144, 
described in Table BDA. Results of the RTQ-PCR runs are shown in Tables BDB, BDC ' 
and BDD. 

Table BDA. Probe Name Ag4144 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -cggaggacatcaagaaagc-3 • 


19 


137 


404 


Probe 


TET-5 ' -ctaccgcaagctggcccttcgtt-3 • -TAMRA 


23 


156 


405 


Reverse 


5 ' -tcctccttattgtcagggttct-3 ' 


22 


191 


406 



Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

215294700 


Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

215294700 


AD 1 Hippo 


0.0 


Control (Path) 3 Temporal Ctx 


0.0 


AD 2 Hippo 


0.0 


Control (Path) 4 Temporal Ctx 


0.0 
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AD 3 Hippo 


0 0 


ajj l Uccipital Ctx 


0.0 


AD 4 Hippo 


VI. u 


au z uccipital Ctx (Missing) 


0.0 


AD 5 hippo 


0 0 

V.v 


aij j occipital ctx 


0.0 


AD 6 Hippo 


In 0 


>\u 4 uccipital ctx 


0.0 


Control 2 Hippo 


0 0 


t\u j uccipital ctx 


0.0 


Control 4 Hippo 


lo 0 


o uccipital ctx 


0.0 


Control (Path) 3 Hippo 


124 8 


v^oniroi i uccipital Ctx 


0.0 


AD I Temporal Ctx 


126 6 


^onrroi z uccipital ctx 


13.2 


AD 2 Temporal Ctx 


io 0 


control j uccipital Ctx 


0.0 


AD 3 Temporal Ctx 


lo 0 

jv.v 


control 4 uccipital Ctx 


0.0 


AD 4 Temporal Ctx 


1? 1 £ 

jZ 1 .0 


Control (Path) 1 Occipital Ctx 


19.5 


AD 5 Inf Temporal Ctx 


In 0 


control (rath) 2 Occipital Ctx 


0.0 


AD 5 SupTempora! Ctx 


ft 0 


control (rath) j Occipital Ctx 


0.0 


AD 6 Inf Temooral Ctx 


u.u 


Control (rath) 4 Occipital Ctx 


o.o 


AD 6 Sup Temporal Ctx 


100.0 


Control 1 Parietal Ctx 


0.0 


^Control 1 Temooral Ptv 


0.0 


Control 2 Parietal Ctx 


b.o 

i 


[Control 2 Temporal Ctx 


47.3 


Control 3 Parietal Ctx 


0.0 


•Control 3 Temporal Ctx 


0.0 


Control (Path) 1 Parietal Ctx 


47.0 


Control 4 Temporal Ctx 


0.0 


Control (Path) 2 Parietal Ctx 


o.o 


Control (Path) 1 Temporal Ctx 


0.0 {Control (Path) 3 Parietal Ctx 


0.0 


Control (Path) 2 Temporal Ctx 


0.0 


Control (Path) 4 Parietal Ctx 


u.u 


Table BDC. General screeningjpanpl vi d 




] 
] 

Tissue Name A 

] 


Rel. 

Exp.(%) 
^g4144, 
R.un 

12 1000453 


Tissue Name 


Rel. 

cxp.{ /0 ) 

Ag4144, 
Run 

221000453 . 


Adipose ( 


).0 


Renal ca. TK-10 


ft A 

u.o 


Melanoma* Hs688(A).T ( 


).0 


Bladder 


ft ft 

0.0 


Melanoma* Hs688(B).T ( 


).0 ( 


3astric ca. (liver met.) NCI-N87 


1.1 


Melanoma* Ml 4 ( 


).0 ( 


3astric ca. KATO III 1 


3.6 


Melanoma* LOXIMVI jo.O ( 


"olon ca. SW-948 \{ 


"\ A 

3.0 


Melanoma* SK-MEL-5 |o.O jColon ca. S W480 ( 


).0 


Squamous cell carcinoma SCC-4 Jo.O |Colon ca.* (S W480 met) SW620 ( 




Test,s P ° o1 |l 00.0 |Colon ca. HT29 ( 


).0 


Prostate ca.* (bone met) PC-3 0.0 |Colon ca. HCT- 116 ( 


).9 


Prostate Pool j 0 . 0 jColon ca. CaCo-2 ( 


).0 


; Placenta |0.0 jColon cancer tissue t 


1.0 


Uterus Pool Jo.O |( 


:olonca.SW1116 ( 


1.0 
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ovarian ca. uvtAKo 


2.7 


Colon ca. CoIo-205 


0.5 


vjvanan ca. oiv-uv-j 


0.6 


Colon ca. SW-48 


0.0 


v^vanan ca. uvlak-4 


0.0 


Colon Pool 


0.6 


vyvanan ca. uv\»,ako 


1.0 


Small Intestine Pool 


1.3 


uvanan ca. ivjjKw v - 1 


0.0 


Stomach Pool 


0.0 


C\\tnr\+% n r»o C\\I ADO 

vyvanan ca. uvlak-o 


0.6 


Bone Marrow Pool 


0.5 


wvary 


0.0 


Fetal Heart 


0.0 


Dreasi ca. iviL,r-/ 


0.0 


Heart Pool 


0.0 


Breast ca. MDA-MB-231 


0.2 


Lymph Node Pool 


0.5 


Breast ca. BT 549 


0.8 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.2 


Thymus Pool 


2.0 


Trachea Jo.O 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 


0 0 jCNS cancer (glio/astro) U-l 1 8- 
jMG 

0 Q ICNS cancer (neuro;met) SK-N- 
jAS 


0.0 
0 

__. 


jFetal Lung 


jLung ca. NCI-N417 jO.O ICNS cancer (astro) SF-539 


Lungca. LX-I 0.0 


CNS cancer (astro) SNB-75 


u 


[Lung ca.NCI-H 146 |o.6 


CNS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 


0.6 


CNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) |o.5 


Lung ca. NCI-H23 


0.0 


Brain (fetal) J 1.5 


Lungca. NCI-H460 


0.0 


Brain (Hippocampus) Pool jo.O 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool jl.5 


Lung ca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool jo.5 


Liver 


0.0 


Brain (Thalamus) Pool ]o.O 


Fetal Liver 


0.0 


Brain (whole) jo.4 


Liver ca. HepG2 


0.0 


Spinal Cord Pool jo.O 


Kidney Pool 


2.6 


Adrenal Gland jo.O 


Fetal Kidney 


0.0 


Pituitary gland Pool jo.O 


Renal ca. 786-0 


0.0 


Salivary Gland jo.O 


Renal ca. A498 


0.0 


Thyroid (female) jo.O 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 ]2.5 


Renal ca. UO-3 1 


0.0 


Pancreas Pool 0.7 



Table BDD . Panel 4.1D 
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Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

173120070 


{Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

1 /J 120070 


Secondary Th 1 act 


0.0 


IhI IVFP If -1 h^ta 


a a 
U.U 


Secondary Th2 act 


jo.o 


|HUVFP IFN oamma 

nu v gl- l r in iiamma 


A A 

u.o 


Secondary Trl act 


3.4 


jHUVEC TNF alpha + I FN 

(Til m m c\ 


0.0 


Secondary Th 1 rest 


0.0 


IHI IVFP TMF olrJ-ia 4- 11 /I 

jnu v ck* i iNr aipna -r il,4 


A A 

0.0 


Secondary Th2 rest 


0.0 


|huvecil-ii 


0.0 


Secondary Trl rest 


2.2 


I — — — 

(Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


n n 

U.U 


rnmary 1 nz act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


i, 7 

f 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


■Primary Th 1 rest 


0.0 

1 


Bronchial epithelium TNFalpha + 
ILIbeta 


0.0 


•rnmary i Ii2 rest 


(— — . 
0.0 


Small airway epithelium none 




Primary Trl rest 


0.0 


Small airway epithelium 
i jNrajpna i i l- i oeta 


0.0 


CD45RA CD4 lymphocyte act 


0.0 


^oronery arrery oJVic rest 




0.0 


CD45RO CD4 lymphocyte act 


0.0 


Coronery artery SMC TNFalpha 


0.0 


CD8 lymphocyte act 


3.1 


Astrocytes rest 


1.6 


Secondary CD8 lymphocyte rest 


3.8 


Astrocytes TNFalpha + IL-1 beta 


0.0 


Secondary CD8 lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


2.7 


KU-8 12 (Basophil) 
PMA/ionomycin 


in i 

1 V. 1 


zry ini/inz/lrl antj-CD95 
CHI 1 


0.0 ( 


CCD 1 1 06 (Keratinocytes) none 


0.0 


LAK cells rest { 


0.0 ! 


:CD1106(Keratinocytes) 
rNFalpha + IL-1 beta ' 


D.O 


LAK cells IL-2 < 


5-0 ' Liver cirrhosis i 


3.0 


LAK cells IL-2+IL- 12 ( 


)0 (NCI-H292 none 


7.8 


LAK cells IL-2-HFN gamma ( 


) 0 JNCI-H292 IL-4 j( 


).0 


LAK cells IL-2+ IL-1 8 ( 


).0 |NCI-H292 IL-9 ( 


).0 


LAK cells PMA/ionomycin ( 


).9 JNCI-H292IL-13 ( 


).0 


NK Cells IL-2 rest ] 


2.2 jNCI-H292 1 FN gamma ( 


).0 


Two Way MLR 3 day ( 


) 0 jHPAEC none ( 


>.0 
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Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + 1L-1 beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


3.6 


PBMC rest 


9.1 


Lung fibroblast TNF alpha + IL-1 
beta 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


jPBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


IRamos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


0.0 


Ramos (B cell) ionomycin 


0.0 [Lung fibroblast I FN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


0.0 


B lymphocytes CD40L and IL-4 


0.0 


Dermal fibroblast CCD1070 TNF 
alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast I FN gamma 


0.0 


Dendritic cells none 
r — 


0.0 


Dermal fibroblast IL-4 ;0.0 


jDendritic cells LPS 

r ■■■ — 


0.0 


Dermal Fibroblasts rest jo.O 


jDendritic cells anti~CD40 • JO.O 


Neutrophils TNFa+LPS 0.0 


Monocytes rest 7.2 [Neutrophils rest !o n ! 


Monocytes LPS je.4 jColon IQ.O j 


Macropliages rest 


0.0 jLung jo.O i 


Macrophages LPS 


0.0 


Thymus 110.7 


HUVEC none 


0.0 Kidney jlOO.O 


HUVEC starved 


0.0 


i 



CNS_neurodegeneration_vl.O Summary: Ag4144 Highest expression of the 



CG98 131-01 gene is detected in superior temporal cortex of an Alzheimer's disease patient 
(CT=34.1). Therefore, therapeutic modulation of the expression or function of this gene 
may decrease neuronal death and be of use in the treatment of this disease. 

General_screeningjpanel_vl.4 Summary: Ag4144 Highest expression of the 
CG98131-01 gene is detected in testis (CT=29). Therefore, expression of this gene may be 
used to distinguish testis from other samples used in this panel and therapeutic modulation 
of this gene product may be useful in the treatment of disorders associated with testis such 
as fertility and hypogonadism. 

In addition, low expression of this gene is also seen in a CNS cancer, pancreatic 
cancer and an ovarian cancer cell lines. Therefore, therapeutic modulation of this gene 
product may be beneficial in the treatment of these cancers. 
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Panel 4.1D Summary: Ag4144 Highest expression of the CG9813 1-01 gene is 
detected in exclusively in kidney (CT=32). Therefore, expression of this gene may be used 
to distinguish kidney from other samples used in this panel. In addition, therapeutic 
modulation of this gene product may be beneficial in the treatment of autoimmune and 
5 inflammatory diseases that affect kidney including lupus and glomerulonephritis. 

BE, CG98164-01 and CG98164-02: LRR and Kinase Domain Protein 

Expression of gene CG98 164-01 and full length physical clone CG98 164-02 was 
assessed using the primer-probe sets Ag4145 and Ag4145, described in Tables BEA and 
1 0 BEB. Results of the RTQ-PCR runs are shown in Tables BEC, BED, BEE and BEF. 

Please note that CG98 164-02 represents a full-length physical clone of the CG98 164-01 
gene, validating the prediction of the gene sequence. 



Table BEA . Probe Name Ag4145 



iPrimers 

i 


Sequences 


Length 


Stan ; SEQ1D 
Position j No 


[Forward 


5 1 -agtgacagtgagaccgaagaga-3 1 


22 


2340 407~~ 
< 


[Probe jTET-5 1 -cccggaaagcactacctatacaatca-3 » -TAMRA 


26 


2362 


408 
409 


* 

[Reverse 


5 ' -agtgttgtctgtgttggtgaga-3 » 


22 


2406 | 


Table BEB. Probe Name A24145 


Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -agtgacagtgagaccgaagaga-3 1 


22 


2340 


410 


Probe 


TET-5 1 -cccggaaagcactacctatacaatca-3 • -TAMRA 


26 


2362 


411 


Reverse 


5 ' -agtgttgtctgtgttggtgaga-3 ' 


22 


2406 


412 



15 Table BEC . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. 

Exp.(%) 
Ag4145, 
Run 

215300524 


Tissue Name 


Rel. 

Exp.(%) 
Ag4l45, 
Run 

215300524 


AD 1 Hippo 


25.2 


Control (Path) 3 Temporal Ctx 


19.6 


AD 2 Hippo 


33.9 


Control (Path) 4 Temporal Ctx 


72.7 


AD 3 Hippo 


26.4 


AD 1 Occipital Ctx 


23.7 


AD 4 Hippo 


62.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


28.1 


AD 3 Occipital Ctx 


6.8 


AD 6 Hippo 


65.1 


AD 4 Occipital Ctx 


55.1 


Control 2 Hippo 


52.5 


AD 5 Occipital Ctx 


17.3 
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Control 4 Hippo 


1 

yj.j 


AL) 0 Occipital Ctx 


8.9 


Control (Path) 3 Hippo 


9R ^ 


Control 1 Occipital Ctx 


7.7 


AD 1 Temporal Ctx 


R 9 


control 2 Occipital Ctx 


29.5 


AD 2 Temporal Ctx 


16 9 


control 3 Occipital Ctx 


19.1 


AD 3 Temporal Ctx 


94 R 


Control 4 Occipital Ctx 


54.7 


AD 4 Temporal Ctx 


4 


control (rath) I Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx 




control (Path) 2 Occipital Ctx 


27.4 


AD 5 Sup Temporal Ctx 


UU.7 


control (Path) j Occipital Ctx 


io.o 


AD 6 Inf Temporal Ctx 


Id 9 


Control (Path) 4 Occipital Ctx 


15.6 


AD 6 Sup Temporal Ctx 


\)D. J 


Control 1 Parietal Ctx 


16.6 


Control 1 Temporal Ctx 




control 2 Parietal Ctx 


69.3 


Control 2 Temporal Ctx 


30.6 


Control 3 Parietal Ctx 


10.0 


Control 3 Temporal Ctx 


17.0 


Control (Path) 1 Parietal Ctx 


83.5 


Control 3 Temporal Ctx 


i 
1 


19.3 


Control (Path) 2 Parietal Ctx 


28.9 


Control (Path) 1 Temporal Ctx 


90.8 


Control (Path) 3 Parietal Ctx 


29.5 


jControl (Path) 2 Temporal Ctx 




60.3 


Control (Path) 4 Parietal Ctx 


31.9 


Table BED. General scr^ningjpQnoi v | 4 


Tissue Name 


s 

jRel. Exp.(%) 
|Ag4145, Run 
221000454 


Tissue Name 


Rel. 

Exp.(%) 
Ag4145, 
Run 










221000454 


Adipose 


0.5 


Renal ca.TK-10 


0.4 


Melanoma* Hs688(A).T 


0.1 


Bladder 


8.1 


Melanoma* Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI-N87 


3.4 


Melanoma* Ml 4 


6.2 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


4.0 


Colon ca. SW-948 


0.2 


Melanoma* SK-MEL-5 


100.0 


Colon ca. SW480 


0.2 


Squamous cell carcinoma SCC- 
4 


0.0 


Colon ca.* (SW480 met) SW620 


0.3 


Testis Pool 


23.2 


Colon ca. HT29 


0.3 


Prostate ca.* (bone met) PC-3 


0.8 1 


Colon ca. HCT-116 


5.6 


Prostate Pool 


1.5 ( 


Colon ca. CaCo-2 


D.O 


Placenta 


0-0 Colon cancer tissue | 


0.4 


Uterus Pool 


0.7 |Colonca. SW1116 ; 


1.7 


Ovarian ca. OVCAR-3 


1.6 < 


~olon ca. Colo-205 ( 


10 


Ovarian ca. SK-OV-3 


1.4 < 


:olon ca. SW-48 ( 


).0 


Ovarian ca. OVCAR-4 


0.0 ( 


"olon Pool ( 


).9 


Ovarian ca. OVCAR-5 


13 < 


Small Intestine Pool ; 


13 
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Ovarian ca IGROV-1 


1 6 


Stoma ch Pool 


1 ? 


Ovarian ca OVCAR-8 


4.7 


Rone Marrow Pool 


\J.\J 


Ovary 


0.5 


Fetal Heart 


1 fi 

I .u 


Breast ca MCF-7 


2.6 


Heart Pool 


0 4 


Breast ca. MDA-MB-231 


2.8 


Lvmnh Node Ponl 


2 1 


jBreast ca. BT 549 


5.8 


Fetal Skeletal Muscle 


1 2 


iBreast ca. T47D 


0.9 


Skeletal Muscle Pool 


0 7 


i Breast ca. MDA-N 


5.5 


Snleen Pnol 




jBreast Pool 


1.7 


ThviTiiiQ Pool 

I 11 J IJILI J I \J\J I 




Trachea 


1 1.3 


ClSlS cancer ( alio/asrrol I 187-MG 


0 7 

v". / 


iLun° 


0.0 


CNS cancer folio/astro"* U-l 1 R-MG 

\^ail\_<tl \\JI doll \J J \J~ 1 1 O IV1VJ 


I . J 


Fetal Lun<* 


17 6 


CRISIS cancer /neiirn'mef^ SK-M-A^ 


1 ? 

1 .Z 


Lung ca. NCI-N417 


0.0 


CNS cancer f astro 1 SF-S^Q 


0 0 


1 una ca L,X- 1 


0 4 


Pr\IS cancer Castro ^ SNR-7S 

V^mO walled ^aillu/ jliO'i J 




Lima ca NCI-H146 


0 1 


PXIC r a ncer folio \ *n1sJR-10 
\-»l>o Cancel Vcj'**-'/ OiND-iy 


i n 

i -U 


^unn ca SHP-77 


0 8 

v. O 


PlSlS cancer folio'l 'nF-'^OS 
cno cancel iu ) or*-7J 


0 7 


I uno ca 


1 s 


Rrain ( A mvorls* 1 o ^ Pr\r\l 
Diaiii ^/Allijr ^Ualuj rUUI 


l .\j 


j Lun° ca NCI-H5*>6 


0 0 


Rrain fcerehelli im) 


1 ^ 




Lun^ ca NCI-H23 


7 0 


Rrain ffptn1\ 
Ljiaiu \iciaiy 


1 7 


Lun° ca NCI-H460 


0 0 


Rrain ( HinnncamniiQ^ Pool 


1 0 


;Lun<* ca HOP-62 


0.0 


Cerehral Cortev Pool 




Lun^ca NCI-H522 


0.5 


Rrain ^Substantia niara^ Pool 


0 5 


Liver 


0.0 


Rrain fThalamiiQ^ Pool 

i-J 1 UUI i I llalallluj 1 I UUJ 




Fetal Liver 


1.0 


Rrain fwhole^ 

LJl £4.1 1 1 WIlUlwl 


1 4 


Liver ca HenG2 


0.3 


Sninal Cord Pool 

opium \w/Ui vj i 


1 7 


Kidney Pool 


2.3 


Adrenal Gland 


0 6 


Fetal Kidney 


7.0 


Pituitary gland Pool 


3.5 


Renal ca. 786-0 


0.3 


Salivary Gland 


0.8 


Renal ca. A498 


3.4 


Thyroid (female) 


1.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


2.3 


Renal ca. UO-31 


0.9 


Pancreas Pool 


1.7 


Table BEE. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4145, Run 
197470572 


Tissue Name 


Rel. 

Exp.(%) 
Ag4145, 
Run 

197470572 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


7.5 


Secondary Th2 act 


7.3 


HUVEC I FN gamma 


0.0 
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Secondary Trl act 


I23 0 


HU VbC TNF alpha + IFN gamma 


0.0 


Secondary Th 1 rest 


loo 

IV/. Vf 


HU VfcC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


jo.o 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


jo.o 


Lung Microvascular EC none 


11.3 


Primary Thl act 


10.0 


Lung Microvascular EC TNFalpha 
+ IL-1 beta 


0.0 


■X ninary i nz act 


0.0 


Microvascular Dermal EC none 


8.2 


Primary Trl act 


7.9 

1 


Micros vasular Dermal EC 
TNFalpha + IL-1 beta 


1 1 5.0 

! j 


Primary Thl rest 


11.7 


Bronchial epithelium TNFalpha + 
ILlbeta 


30.6 


Primary Th2 rest 


0.0 


Small airway epithelium none 


6.3 


Primary Trl rest 


7.7 


Small airway epithelium TNFalpha 
+ IL-1 beta 


15.5 


CD45RA CD4 lymphocyte act 


31.4 


Coronery artery SMC rest 


14.6 


CD45RO CD4 lymphocyte act 


|40.1 


Coronery artery SMC TNFalpha + 
IL-lbeta 


7 0 

1 




|CD8 lymphocyte act J 1 4,6 


Astrocytes rest 


0.0 


'Secondary CD8 lymphocyte I 
rest 4 " j 16 - 5 


Astrocytes TNFalpha + IL-1 beta 


0.0 


Secondary CDS lymphocyte act lO.O 


KU-8I2 (Basophil) rest 


25.7 
40.6 "~ 




CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) PMA/ionomycin j 


2ryThl/Th2/Trl anti-CD95 
CH11 


8.6 


CCD1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD 1 1 06 (Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 


LAK cells IL-2 


10.4 


Liver cirrhosis t 


11 9 


LAK cells IL-2+IL-12 


11.6 


NCI-H292 none 


j . j 


LAK cells IL-2+IFN gamma 


13.8 1 


MCI-H292 IL-4 ( 


1 0 

J.\j 




LAK cells IL-2+ IL-1 8 


16.0 1 


MCI-H292IL-9 < 




] 


LAK cells PMA/ionomycin ( 


5.2 ] 


^Cl-H292 IL-13 ( 


).0 


1 


MK Cells IL-2 rest ( 


).0 1 


NC1-H292 IFN gamma f 


i.5 




rwo Way MLR 3 day ( 


).0 I 


IPAECnone ( 


1.0 




rwo Way MLR 5 day ( 


).0 1 


1PAEC TNF alpha + IL-1 beta ( 


..7 


1 


rwo Way MLR 7 day ( 


).0 I 


^ung fibroblast none jc 


.0 


I 


>BMC rest c 


I 


.ung fibroblast TNF alpha + IL-1 j 
eta P 


.1 


f 


'BMC PWM C 


.0 jLung fibroblast IL-4 |o 


.0 


{ 


'BMC PHA-L g 


■4 JLung fibroblast IL-9 0.0 


F 


Lamos (B cell) none 9 


4.6 jLung fibroblast IL-13 jo.O 
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Ramos (B cell) ionomycin 


100.0 


jLung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


0.0 


jDermal fibroblast CCDI070 rest 


13.4 


B lymphocytes CD40L and IL- 
4 


14.4 


jDermal fibroblast CCD1070 TNF 
jalpha 


0.0 


EOL-1 dbcAMP 


0.0 


bermal fibroblast CCD 1070 IL-I 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


28.3 


Joermal fibroblast IFN gamma 


0.0 

J 


Dendritic cells none 


45.7 


jDermal fibroblast IL-4 


8.3 


Dendritic cells LPS 


16.0 


JDermal Fibroblasts rest 


0.0 


Dendritic cells anti-CD40 


23.5 


(Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


20.2 


Neutrophils rest 


0.0 


Monocytes LPS 


15.7 


Colon 


0.0 


Macrophages rest 


19.1 


Lung 


15.7 


Macrophages LPS 


0.0 


Thymus 


|8.2 


HUVEC none 


0.0 


Kidney 


15.6 


HUVEC starved 


0.0 






Table BEF. General onrolngy «rr^PnmcT p,m^ v -> j 


Tissue Name 

i 


Rel. Exp.(%) 
Ag4145, Run 
268861697 


Tissue Name 


iRel. I 
Exp.(%) 
Ag4I45, 
Run 

268861697 


Colon cancer 1 


18.8 


Bladder cancer NAT 2 


13.6 


Colon cancer NAT 1 


9.0 


B ladder cancer NAT 3 


0.9 


Colon cancer 2 


14.5 


Bladder cancer NAT 4 


8.4 


Colon cancer NAT 2 


1.7 


Adenocarcinoma of the prostate 1 


0.0 


Colon cancer 3 


11.3 


Adenocarcinoma of the prostate 2 


0.0 


Colon cancer NAT 3 


3.2 


Adenocarcinoma of the prostate 3 


17.6 


Colon malignant cancer 4 


17.6 


Adenocarcinoma of the prostate 4 


0.0 


Colon normal adjacent tissue 4 < 


t.6 1 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 ; 


22.1 


Adenocarcinoma of the prostate 6 


1.7 


Lung NAT 1 


1-5 


Adenocarcinoma of the prostate 7 


1.0 


Lung cancer 2 


100.0 


Adenocarcinoma of the prostate 8 ( 


3.0 


Lung NAT 2 


i.7 , 


Adenocarcinoma of the prostate 9 ( 


).0 


Squamous cell carcinoma 3 1 


1.0 I 


5 rostate cancer NAT 1 0 ( 


).2 


Lung NAT 3 a 


1.2 1 


Cidney cancer 1 2 


(0.1 


metastatic melanoma 1 2 


1.9 i 


CidneyNAT 1 i 


18.0 


Melanoma 2 f 


1.2 I 


Cidney cancer 2 A 


17.3 
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Melanoma 3 


3.0 


Kidnev NAT 2 


1Q 1 

17. 1 


metastatic melanoma 4 


0.1 


Kidney cancer 3 


2.9 


metastatic melanoma 5 


0.0 


Kidney NAT 3 


1.7 


Bladder cancer 1 


26.4 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


3.3 


Bladder cancer 2 


LI 







CNS_neurodegeneration_vl.O Summary: Ag4145 This panel confirms the 
expression of the CG98 164-01 gene at low levels in the brain in an independent group of 
individuals. This gene appears to be slighdy upregulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, therapeutic modulation of the expression or 



5 function of this gene may decrease neuronal death and be of use in the treatment of this 
disease. 

General_screening_panel_vl.4 Summary: Ag4145 Highest expression of the 
CG98 164-01 gene is seen in a melanoma cell line (CT=30). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as ; 

1 0 marker to detect the presence of melanoma. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of melanoma. 

In addition, this gene is expressed at much higher levels in fetal lung (CT=32.5) 
when compared to expression in the adult counterpart (CT=40). Thus, expression of this 
gene may be used to differentiate between the fetal and adult source of this tissue. 

1 5 General_screening_panel_vl.5 Summary: Ag4 1 45 Results from one 

experiment with the CG98164-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4.1D Summary: Ag4145 Expression of the CG98 164-01 gene is restricted 
to ionomycin and untreated samples from the B cell line Ramos (CTs=34.5). B cells 

20 represent a principle component of immunity and contribute to the immune response in a 
number of important functional roles, including antibody production. Production of 
antibodies against self-antigens is a major component in autoimmune disorders. Since B 
cells play an important role in autoimmunity, inflammatory processes and inflammatory 
cascades, therapeutic modulation of this gene product may reduce or eliminate the 

25 symptoms of patients suffering from asthma, allergies, chronic obstructive pulmonary 
disease, emphysema, Crohn's disease, ulcerative colitis, rheumatoid arthritis, psoriasis, 
osteoarthritis, systemic lupus erythematosus and other autoimmune disorders. Two 
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additional experiments with the same probe and primer show low/undetectable levels of 
expression (CTs>35). (Data not shown.) 

General oncology screening panel_v_2.4 Summary: Ag4145 Highest 
expression of the CG98164-01 gene is detected in lung cancer (CT=29). Significant 
expression of this gene is seen in number of cancer samples including colon, lung, 
adenocarcinoma of prostate, bladder and kidney cancers. . Thus, the expression of this 
gene could be used to distinguish colon, lung and prostate cancers from the normal tissues. 
Therapeutic modulation of this gene product may be useful in the treatment of these 
cancers. 

BF. CG99588-01: Novel Transmembrane Protein 

Expression of gene CG99588-01 was assessed using the primer-probe set Ag4148, 
described in Table BFA. Results of the RTQ-PCR runs are shown in Tables BFB. BFC 
BFD and BFE. 



Table BFA . Probe Name Ag4148 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 1 -ggtcactgtggtgaagagtga-3 ' 


21 


. 12 


-■—-J 


Probe 


TET-5 1 -acccaaactggtgccgttcttcaag-3 1 


-TAMRA 


25 


36 


414 | 


Reverse 


5 1 -cagccagagcacaaaatacac-3 • 


21 


70 


415 



Table BFB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 
215309158 


Tissue Name 


Rel. Exp.(%) 
Ag4l48, Run 
215309158 


AD 1 Hippo 


24.5 


Control (Path) 3 Temporal Ctx 


9.0 


AD 2 Hippo 


42.9 


Control (Path) 4 Temporal Ctx 


49.0 


AD 3 Hippo 


8.1 


AD 1 Occipital Ctx 


27.2 


AD 4 Hippo 


16.7 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


95.3 


AD 3 Occipital Ctx 


8.6 


AD 6 Hippo 


72.2 


AD 4 Occipital Ctx 


29.1 


Control 2 Hippo 


36.6 


AD 5 Occipital Ctx 


38.7 


Control 4 Hippo 


30.8 


AD 6 Occipital Ctx 


26.1 


Control (Path) 3 Hippo 


9.9 


Control 1 Occipital Ctx 


2.9 


AD 1 Temporal Ctx 


51.8 


Control 2 Occipital Ctx 


64.6 ' 


AD 2 Temporal Ctx 


49.0 


Control 3 Occipital Ctx • 


40.6 


AD 3 Temporal Ctx 


11.4 


Control 4 Occipital Ctx 


16.8 
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/\u *f i emporai ctx 


52.5 


Control (Path) 1 Occipital Ctx 


60.3 


t\u j ini l emporai ctx 


100.0 


Control (Path) 2 Occipital Ctx 


25.5 


f\u j oiip 1 emporai ctx 


o/.o 


Control (Path) 3 Occipital Ctx 


3.3 


r\.L/ o mi 1 emporai ctx 


o5.1 


Control (Path) 4 Occipital Ctx 


27.0 


AD 6 Sup Temporal Ctx 


59.9 


Control 1 Parietal Ctx 


11.2 


Control 1 Temporal Ctx 


18.9 


Control 2 Parietal Ctx 


64.6 


Control 2 Temporal Ctx 


40.6 


Control 3 Parietal Ctx 


19.5 


Control 3 Temporal Ctx 


28.3 


Control (Path) 1 Parietal Ctx 


71.7 


Control 3 Temporal Ctx 


23.5 


Control (Path) 2 Parietal Ctx 


34.9 


Control (Path) 1 Temporal 
Ctx 


63.3 


Control (Path) 3 Parietal Ctx 


10.9 


Control (Path) 2 Temporal 
Ctx 


53.2 


Control (Path) 4 Parietal Ctx 


50.0 


Table BFC. General screening pan^l vl 4 


Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 
222181956 


Tissue Name 


Rel. Exp.(%) 
Ag4l48, Run 
222181956 


Adipose 


19.9 


Renal ca.TK- 10 |7.6 


Melanoma* Hs688(A).T 


5-9 


Bladder 157.4 


Melanoma* Hs688(B).T 


3.4 


Gastric ca. (liver met.) NCI-N87 


32.3 


Melanoma* MI4 


6.4 


Gastric ca. KATO 111 


0.0 


Melanoma* LOXIMVI 


4.8 


Colon ca. SW-948 


3.3 


Melanoma* SK-MEL-5 


17.2 jColon ca. SW480 


8.3 


Squamous cell carcinoma 
SCC-4 


6.4 Jcolon ca* (SW480 met) SW620 


1.7 


Testis Pool 


12.1 jcolon ca. HT29 


3.4 


Prostate ca.* (bone met) 
PC-3 


13.3 Jcolonca.HCT-116 


3.6 


Prostate Pool 


2.6 jcolon ca. CaCo-2 


1.8 


Placenta 


37.4 jColon cancer tissue 


32.5 


Uterus Pool 


8.8 JCoIonca. SW1116 


3.5 


Ovarian ca. OVCAR-3 


17.8 |Colon ca. Colo-205 


1.5 


Ovarian ca. SK-OV-3 


5.9 jcolon ca. SW-48 


5.8 


Ovarian ca. OVCAR-4 


8.2 Jcolon Pool 


19.2 


Ovarian ca. OVCAR-5 


33.7 JSmall Intestine Pool 


22.2 


Ovarian ca. IGROV-1 


7.8 jstomach Pool 


16.5 


Ovarian ca. OVCAR-8 


22.4 JBone Marrow Pool 


10.5 


Ovary 


58.2 jFetal Heart 


6.9 


Breast ca. MCF-7 


4.5 |HeartPool jll.9 
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Breast ca. MDA-MB-231 


3.6 


iLvmoh Node Pool 


14 6 


Breast ca. BT 549 


1.2 


|Fetal Skeletal Muscle 


3.7 


Breast ca. I4/U 


100.0 


(Skeletal Muscle Pool 


3.8 


Breast ca. MDA-N 


8.1 


jSpleen Pool 


20.6 


Breast Poo! 


16.4 


(Thymus Pool 


15.6 


i 

Trachea 


11.5 


|CNS cancer (glio/astro) U87-MG 


7.3 


i 

jLung 

i 


5.4 


[CNS cancer (glio/astro) U- 1 1 8- 
jMG 


0.6 


Fetal Lung 


14.8 


CNS cancer (neuro;met) SK-N- 

|as 


26.6 


Lung ca.NCI-N417 


1.1 


|CNS cancer (astro) SF-539 


8 8 


Lungca. LX-1 


6.7 


CNS cancer (astro) SNB-75 


97 0 


Lung ca. NCI-H146 


0.2 


CNS cancer (<*lio) SNB-19 


4 7 


Lung ca. SHP-77 


2.8 


CNS cancer fcrlio^ SF-295 




Lung ca. A549 


1.0 


Brain (Amv»dala^ Pnol 


I J.O 


Lung ca. NCI-H526 


6.3 


Brain f cerebellum I 


JO.7 


Lung ca. NCI-H23 


2.5 


Brain (TetaH 

*— * 1 CA 111 1 LCI I § 


^0 1 i 


jLunsca. NCI-H460 


0.5 


Brain fHinnocamnnO Pool 


n ? ! 


Lungca. HOP-62 


1.0 


Cerebral Cortex Pool 


16 n i 


Lung ca. NCI-H522 


28.7 


Brain (Substantia ni^ra^ Pnol 


19 Q ! 


Liver 


3.4 


Brain (Thalamus) Pool 


21 0 


Fetal Liver 


12.1 


Brain (whole) 




Liver ca. HepG2 


0.6 


Spinal Cord Pool 


18 9 

t u . z. 


Kidney Pool 


28.1 


Adrenal Gland 


39 S 


Fetal Kidney 


9.3 


Pituitary gland Pool 


5.2 


Renal ca. 786-0 


3.1 


Salivary Gland 


8.4 


Renal ca. A498 


11.3 


Thyroid (female) 


10.3 


Renal ca. ACHN 


10.4 


Pancreatic ca. CAPAN2 


1.8 


Renal ca. UO-31 


11.8 


Pancreas Pool 


24.0 


Table BFD. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4148,Run 
173123939 


Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 
173123939 


Secondary Thl act 


0.3 


HUVEC IL-1 beta 


1.4 


Secondary Th2 act 


6.4 


HUVEC I FN gamma 


18.6 


Secondary Trl act 


1.1 


HUVEC TNF alpha + IFN 
gamma 


3.1 


Secondary Thl rest 


5.7 


HUVEC TNF alpha + IL4 


1.9 


Secondary Th2 rest 


3.4 


HUVEC IL- 11 


2.2 
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Secondary Trl rest 


11.2 


Lung Microvascular EC none 


10.8 


rnmary Tnl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


16.6 


Primary Th2 act 


4.2 


Microvascular Dermal EC none 


7.3 


rnmary 1 ri act 


1 A 

1.4 


Microsvasular Dermal EC 
TNFalpha + 1L-I beta 


3.6 


Primary Th ! rest 


l.i 


Bronchial epithelium TNFalpha 
+ ILIbeta 


6.0 
16.7 


Primary Th2 rest 


4.0 


Small airway epithelium none 


Primary Trl rest 


4.2 


Small airway epithelium 
TNFalpha + 1L-1 beta 


12.9 


CD45RA CD4 
lymphocyte act 


1.2 


Coronery artery SMC rest 


5.7 


CD45RO CD4 
lymphocyte act 


2.7 


Coronery artery SMC TNFalpha 
+ IL-1 beta 


10.5 


CDS lymphocyte act 


, 

2.4 


Astrocytes rest 




Secondary CD8 
lymphocyte rest 


2.1 


Astrocytes TNFalpha + IL-1 beta 


1.1 


Secondary CDS 
lymphocyte act 


0.0 


KU-81 2 (Basophil) rest 


0.0 i 

i 


CD4 lymphocyte none 


1.4 


KU-8 12 (Basophil) 
PMA/ionomycin 


... j 

0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


8.1 


CCD1 106 (Keratinocytes) none 


8.7 


LAK ceils rest 


17.1 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 




LAK cells IL-2 


4.5 


Liver cirrhosis 


7.3 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


20.3 


LAK cells 1L-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 




LAK cells IL-2+ IL-1 8 


0.0 


NCI-H292 IL-9 


28.3 


LAK cells 
PMA/ionomycin 


1.0 


NCI-H292 IL-1 3 


21.3 


NK Cells IL-2 rest 


3.7 


NCI-H292 I FN gamma 


19.5 


Two Way MLR 3 day 


9.5 


HPAEC none 


3.4 


Two Way MLR 5 day 


2.3 


HPAEC TNF alpha + IL-1 beta 


4.9 


i wu way iviL/i\ / uay 


ft o 


I X*" L 1_l 

Lung fibroblast none 


3.2 


PBMC rest 


3.6 


Lung fibroblast TNF alpha + IL- 
1 beta 


0.0 


PBMC PWM 


1.7 


Lung fibroblast IL-4 


1.7 


PBMC PHA-L 


3.0 


Lung fibroblast IL-9 


1.1 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


1.0 
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Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD1070 rest 


3.7 


B lymphocytes CD40L 
and IL-4 


2.8 


Dermal fibroblast CCD1070 
TNF alpha 


7.2 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD1070 IL-1 
beta 


2.8 


EOL-1 dbcAMP 
PM A/ion omyc in 


2.0 


Dermal fibroblast IFN gamma 


1.4 


Dendritic cells none 


71.7 


Dermal fibroblast II -4 


0 0 


Dendritic cells LPS 


8.7 


Dprmjil FihrnKlactc r^ct 


1 Q 


Dendritic cells anti-CD40 


81.8 


NentrnnhiU TNFa+T 


fi n 


Monocytes rest 


18.8 


Neutrophils rest 


0.0 


Monocytes LPS 


8.4 


Colon 


8.8 


Macrophages rest 


56.6 


Lung 


47.3 


Macrophages LPS 


1.0 


Thymus 


22.8 j 


HUVEC none 


2.2 


Kidney 


100.0 


HUVEC starved 


12.2 


! ! 


Table BFE. General oncology screening panel v 2A 


Tissue Name 


Re!. Exp.(%) 
Ag4148, Run 
268623908 


jRel. Exp.(%)i 

Tissue Name lAg4 1 48. Run j 

|268623908 j 


Colon cancer 1 


12.6 


Bladder cancer NAT 2 jo.3 


Colon NAT 1 


7.6 


Bladder cancer NAT 3 


0.1 


Colon cancer 2 


19.1 


Bladder cancer NAT 4 


2.1 


Colon cancer NAT 2 


5.1 


Adenocarcinoma of the prostate 
1 


24.0 


Colon cancer 3 


23.0 


Adenocarcinoma of the prostate 
2 


1.3 


Colon cancer NAT 3 


11.9 


Adenocarcinoma of the prostate 
3 


10.6 


Colon malignant cancer 4 


30.1 


Adenocarcinoma of the prostate 
4 


21.8 


Colon normal adjacent 
tissue 4 


4.1 


Prostate cancer NAT 5 


1.1 


Lung cancer 1 


13.3 


Adenocarcinoma of the prostate 
6 


5.4 


Lung NAT 1 


5.0 


Adenocarcinoma of the prostate 
7 


7.9 


Lung cancer 2 


36.6 


Adenocarcinoma of the prostate 
8 


1.6 


Lung NAT 2 


3.0 


Adenocarcinoma of the prostate 


20.0 
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9 




Squamous cell carcinoma 
3 


22.8 


Prostate cancer NAT 10 


0.4 


Lune NAT 3 


I 7 


Kidney cancer 1 


38.4 


metastatic mplannm« 1 

lllvLUOlCllIU 1 i i^iai l\Jl 1 Id 1 


XX 0 


ly ! J_ „_ .XT A TP 1 

IvidneyNAT 1 


24.0 


iMelanoma 2 




Kidney cancer 2 


66.9 


Melanoma 3 


2.6 


Kidney NAT 2 




metastatic melanoma 4 


96.6 


Kidney cancer 3 


17.3 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


19.2 


Bladder cancer 1 


0.8 


Kidney cancer 4 


8.8 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


38.4 


Bladder cancer 2 


4.6 







CNS_neurodegeneration_vl.O Summary: Ag4148 This panel confirms the 
expression of the CG99588-01 gene at low levels in the brain in an independent group of 
individuals. This gene is found to be slightly upregulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, therapeutic modulation of the expression or 
function of this gene may decrease neuronal death and be of use in the treatment of this 



disease. 

General_screening_panel_vl.4 Summary: Ag4148 Highest expression of the 
CG99588-01 gene is detected in breast cancer cell line (CT=30.41). Significant expression 
of this gene is also seen in cluster of cancer cell lines including CNS, colon, gastric, renal, 
lung, breast, ovarian, prostate, squamous cell carcinoma, and melanoma cell lines. 
Therefore, therapeutic modulation of this gene product may be useful in the treatment of 
these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, heart, liver 
and the gastrointestinal tract. Therefore, therapeutic modulation of the activity of this gene 
may prove useful in the treatment of endocrine/metabolically related diseases, such as 
obesity and diabetes. 

In addition, this gene is expressed at moderate levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 
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Ag4148 Results from one experiment (run 220982871) with the this gene are not 
included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 4.1D Summary: Ag4148 Highest expression of the CG99588-01 gene is 
detected in kidney (CT=30.3). In addition, moderate to low levels of expression of this 
gene is also seen primary and secondary Till , Th2 and Trl cells, LAK cells, dendritic 
cells, monocytes, macrophages, endothelial cells, bronchial and small airway epithelial 
cells, coronery artery SMC NCI-H292, astrocytes and normal tissues represent by colon, 
lung, and thymus. Therefore, therapeutic modulation of the gene product may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

General oncology screening paneljv_2.4 Summary: Ag4148 Highest 
expression of the CG99588-01 gene is detected in metastic melanoma (CT=29). In 
addition, significant expression of this gene is also seen in number of cancer samples 
including kidney, colon, adenocarcinoma of prostate, lung and bladder cancer. Therefore, 
therapeutic modulation of this gene product through the use of small molecule drug may 
be beneficial in the treatment of these cancers. 

BG, CG99618-01: Protein-Tyrosine Phosphatase 2C 

Expression of gene CG996 18-01 was assessed using the primer-probe set Ag4151, 
described in Table BGA. Results of the RTQ-PCR runs are shown in Tables BGB, BGC 
and BGD. 



Table BGA . Probe Name Ag4151 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -catcatgggcaattaaaagaga-3 1 


22 


258 


416 


Probe 


TET-5 * -aaaatcctctgaactgtgcagatcct-3 ' -TAMRA 


26 


304 


417 


Reverse 


5 • -catgaaaccacctttgagaagt-3 • 


22 


330 


418 



Table BGB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
215318747 


Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
215318747 


AD 1 Hippo 


3.7 


Control (Path) 3 Temporal Ctx 


2.0 
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AD 2 Hippo 


6.1 


v^uiiiroi \rainj *f 1 emporai t^tx 


*3A O 
3U.5 


AD 3 Hippo 


0.7 


Avi-/ i uicipiiai v^ix 


T) ft 


AD 4 Hippo 


3.5 


wucipiiai l^lx ^ivnssingj 


A A 
U.U 


AD 5 hippo 


100.0 


AD 1 ("Vrinitsil Pfv 


*2 O 
-5.0 


AD 6 Hippo 

1 — 1 ' — 


12.4 




Kft 

i:>.v 


iControl 2 Hippo 


7.3 


ADS Orriruto! f^fv 


0.2 


Control 4 Hippo 


k.o 


rVL/ U WCCipiiai V^lX 


1 "7 ^ 

1 /.2 


Control (Path) 3 Hippo 


0.0 


v^uiiiroi I wccipuai l*ix 


1 .1 


■AD 1 Temporal Ctx 


13.7 


Control 2 Occipital Ctx 


32.1 


jAD 2 Temporal Ctx 


14.1 


Control 3 Occipital Ctx 


7.3 


|AD 3 Temporal Ctx 


3.2 


Control 4 Occipital Ctx 


4.9 


,AD 4 Temporal Ctx 


17.6 


Control (Path) 1 Occipital Ctx 


49.3 


|AD 5 Inf Temporal Ctx 


90.8 


Control (Path) 2 Occipital Ctx 


11.3 


AD 5 SupTemporal Ctx 


16.8 


Control (Path) 3 Occipital Ctx (o.O 


.AD 6 Inf Temporal Ctx 


0.0 jControl (Path) 4 Occipital Ctx 


19.1 


j AD 6 Sup Temporal Ctx 


37.6 


Control 1 Parietal Ctx 


2.8 


Control 1 Temporal Ctx 


4.8 


Control 2 Parietal Ctx 


26.2 


'Control 2 Temporal Ctx 


8.5 


Control 3 Parietal Ctx 


12.4 


Control 3 Temporal Ctx 


7.9 


Control (Path) 1 Parietal Ctx 


35.8 


^Control 4 Temporal Ctx j 


11.2 


Control (Path) 2 Parietal Ctx 


13.0 


Control (Path) I Temporal 
jCtx 


28.1 


Control (Path) 3 Parietal Ctx 


0.0 


IControl (Path) 2 Temporal 
Ctx 


31.4 


Control (Path) 4 Parietal Ctx 


31.0 



Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
221034281 


Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
221034281 


Adipose 


0.5 


Renal ca.TK-10 


0.0 


Melanoma* Hs688(A).T 


0.9 


Bladder 


1.5 


Melanoma* Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI-N87 


0.8 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


0.0 


Colon ca. SW-948 


0.2 


Melanoma* SK-MEL-5 


100.0 


Colon ca. SW480 


0.0 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


0.0 


Testis Pool 


4.2 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) 
PC-3 


0.4 


Colon ca.HCT-1 16 


0.3 
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Prostate Pool \42 |colon ca. CaCo-2 


0.1 


Placenta 1 1 .5 JColon cancer tissue 


0.4 


Uterus Pool jo.7 (Colon ca. S W 1 1 1 6 


0.1 


Ovarian ca. OVCAR-3 


11 JColon ca. CoIo-205 


0.0 


Ovarian ca. SK-OV-3 jo.5 |Colon ca. SW-48 


0.0 


jOvarian ca. OVCAR-4 


0.5 jCoIon Pool 


4.2 


Ovarian ca. OVCAR-5 


0-4 [Small Intestine Pool 


2.2 * 


Ovarian ca. IGROV-1 


0.0 


Stomach Pool 


0.9 


Ovarian ca. OVCAR-8 


1.5 


Bone Marrow Pool 


1.0 


Ovary 


3.0 


Fetal Heart 


4.5 


Breast ca. MCF-7 


0.1 


Heart Pool 


1.0 


Breast ca. MDA-MB-23I 


0.5 


Lymph Node Pool 


2.1 


Breast ca. BT 549 


3.5 


Fetal Skeletal Muscle 


2.3 


Breast ca. T47D 


0.2 


Skeletal Muscle Pool 


1.6 


Breast ca. MDA-N 


0.2 


Spleen Pool 


i.i 


Breast Pool 


1-6 (Thymus Pool 


4.0 i 


Trachea 


1.9 


CNS cancer (glio/astro) U87-MG 


I b78" " 1 


Lung 


5.6 


CNS cancer (glio/astro) U-l 1 8-MG 


1.4 ! 


Fetal Lung 


4.1 


CNS cancer (neuro;met) SK-N-AS |0.2 j 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 1 


Lungca. LX-i jO.O 


CNS cancer (astro) SNB-75 


1 

5.4 


Lungca.NCI-H146 


0.0 


CNS cancer (glio) SNB-19 


0.5 


Lung ca. SHP-77 


0.2 


CNS cancer (glio) SF-295 


2.3 


Lung ca. A549 


0.2 


Brain (Amygdala) Pool 


4.3 


Lungca. NCI-H526 


0.0 


Brain (cerebellum) 


0.9 


Lung ca. NCI-H23 


1.2 


Brain (fetal) 


2.6 


Lungca.NCI-H460 


0.5 


Brain (Hippocampus) Pool 


3.8 


Lung ca. HOP-62 


1,5 


Cerebral Cortex Pool 


8.5 


Lungca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


3.9 


Liver 


0.0 


Brain (Thalamus) Pool 


13.6 


Fetal Liver \ 


3.2 


Brain (whole) ; 


2.4 


Liver ca. HepG2 { 


10 


Spinal Cord Pool ; 


1.5 


Kidney Pool 1 


1.7 


Adrenal Gland 


1.5 


Fetal Kidney ; 


>.3 ] 


Pituitary gland Pool 


.9 


Renal ca. 786-0 ; 


1.2 ! 


Salivary Gland ( 


).2 


Renal ca. A498 ( 


).0 


rhyroid (female) ( 


).0 


Renal ca. ACHN ] 


.1 1 


3 ancreatic ca. CAPAN2 ( 


(.5 


Renal ca. UO-3 1 1 


.0 jPancreas Pool 2 


!.l 
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Table BGD. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
173124788 


Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
173124788 


Secondary Th 1 act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


4.6 


HUVEC I FN gamma 


2.6 


Secondary Trl act j0.9 


HUVEC TNF alpha + IFN gamma 


0.0 


Secondary Thl rest |0.0 


HUVEC TNF alpha + 1L4 


0.0 


Secondary Th2 rest ! 0.0 


HUVEC IL-1I 


0.0 


Secondary Trl rest !3.2 


Lung Microvascular EC none 


3.3 


Primary Th I act jO.O 


Lung Microvascular EC TNFalpha 
+ IL-Ibeta 


0.0 


Primary Th2 act jo.O 


Microvascular Dermal EC none 


0.0 


Primary Trl act jo.O 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


0.0 


Primarv Th 1 rest |o 0 (Bronchial epithelium TNFalpha + 

1 ' (ILlbeta 


16.5 


Primary Th2 rest ;0.0 iSmal 1 airway epithelium none 


1.0 


Primarv Trl rest ! 0 0 (Small airway epithelium TNFalpha 

!+IL-lbeta 


0.0 


CD45RA CD4 lymphocyte 
act 


|o.o 

i 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte 
act 


0.0 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.0 


CD8 lymphocyte act , 


0.0 


Astrocytes rest 


6.1 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL- 1 beta 


2.9 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


2.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) PMA/ionomycin 


3.4 


2ryThl/Th2/Trl anti- 
CD95 CH11 


2.3 


CCD 1 1 06 (Keratinocytes) none 


1.8 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


32.1 


LAK cells IL-2 


4.0 


Liver cirrhosis 


O.O 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


1.8 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


J .2 


LAK cells IL-2+IL- 18 


0.0 


NCI-H292 IL-9 


1.8 


LAK cells 
PMA/ionomycin 


0.0 ] 


NCI-H292IL-13 ( 


).0 
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NK Cells IL-2rest 


0.0 


NCI-H292 IFN gamma 


9.3 


Two Way MLR 3 day 


0.5 


HPAEC none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-1 beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


5.8 


PBMC rest 


2.0 


Lung Fibroblast TNF alpha + IL-1 
beta 


4.1 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast 1L-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast 1L-13 


4.6 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma 


1.7 


R Ivm nil nrvt<*Q PWM 


5 5 


Dermal fibroblast CCD 1 070 rest 


12.4 


R ivmnhncvte^ CD40I and 
IL-4 


0.0 


Dermal fibroblast CCD 1 070 TNF 

L/ wl 111 Li. 1 11 L/l UU 1 CIO L. V_v 1 \s 1 \J 1 111 

alpha 


2.3 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD1070 IL-1 
beta 


n ft 

\J.V 


EOL-1 dbcAMP 
PMA/ionomycin 


11.1 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


n A 

2.9 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


A A 

0.0 


Dendritic cells anti-CD40 iO.O 


Neutrophils TNFa+LPS 


A A 

0.0 


Monocytes rest 


2.6 


Neutrophils rest 


u.u 


Monocytes LPS 


0.6 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


3.1 


Macrophages LPS 


0.0 


Thymus 


14.6 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag4151 This panel confirms the 
expression of the CG9961 8-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screeningjpanel_vl.4 Summary: Ag41 5 1 Highest expression of the 
CG99618-01 gene is detected in melanoma SK-MEL-5 cell line (CT=29.6). Therefore, 
expression of this gene can be used to distinguish this sample from other samples in the 
10 panel. Low expression of this gene is also detected in a breast cancer and a CNS cancer 

* 

717 



WO 03/010327 



PCT/US02/14199 



cell lines. Therefore, therapeutic modulation of this gene product may be useful in the 
treatment of melanoma, breast and CNS cancers. 

In addition, this gene is expressed at low levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, cerebral 
cortex, and spinal cord. Therefore, this gene may play a role in central nervous system 
disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, 
schizophrenia and depression. 

Panel 4.1D Summary: Ag4151 Highest expression of the CG99618-01 gene is 
detected in kidney (CT=32.4). Therefore, expression of this gene can be used to 
distinguish kidney from other samples in this panel. In addition, therapeutic modulation of 
this gene product may be useful in the treatment of inflammatory and autoimmune 
diseases that affect kidney such as lupus and glomerulonephritis. 

Low levels of expression of this gene is also seen in TNFalpha + IL-lbeta treated 
keratinocytes. Interestingly, this expression in treated cells is higher (CT=34) as compared 
to the untreated keratinocytes (CT=38). Therefore, expression of this gene can be used to 
distinguish the treated from untreated keratinocytes. In addition, therapeutic modulation of 
this gene product may be useful in the treatment of psoriasis and wound healing. 

BH. CG99832-01: Novel Gene Containing NUDIX Hydrolase Domain 

Expression of gene CG99832-01 was assessed using the primer-probe set Ag4157, 
described in Table BHA. Results of the RTQ-PCR runs are shown in Tables BHB, BHC, 
BHD and BHE. 



Table BHA . Probe Name Ag4157 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 » -agcttgctcgtttgtacatcat-3 • 


22 


543 


419 


Probe 


TET-5 9 -tccaggaattccaaaagacacaaaat-3 1 -TAMRA 


26 


565 


420 


Reverse 


5 1 -cactcaatgttccgaatttctc-3 1 


22 ; 


609 


421 



Table BHB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
215331739 


Tissue Name 


Rel. Exp.(%) 
Ag4157,Run 
215331739 


AD 1 Hippo 


11.7 


Control (Path) 3 Temporal Ctx 


5.7 


AD 2 Hippo 


24.8 


Control (Path) 4 Temporal Ctx 


37.1 
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AD 3 Hinno 


7 5 


AD 1 Orrinital Ctx 


IS 1 


AD 4 Hinno 


5.7 


AD 0 Or>dnita1 Ctx fN4i<»^irn7^ 


0.0 


AD 5 hiooo 


78.5 


AD 3 OcciDital Ctx 


4.0 


AD 6 Hiddo 


52.9 


AD 4 Occioital Ctx 


15 6 


Control 2 Hippo 


28.3 


AD 5 Occioital Ctx 


31.6 


Control 4 Hippo 


8.2 


AD 6 Occioital Ctx 

/ 1 v/ V/vv 1 ft*/ 1 LCI 1 Li V 


41.2 


Control (Path) 3 Hippo 


10.6 


Control 1 Occipital Ctx 


3.8 


AD 1 Temporal Ctx 


14.7 


Control 2 Occinital Ctx 


52.1 


AD 2 Temporal Ctx 


31.2 


Control 3 Occioital Ctx 


12.4 


AD 3 Temooral Ctx 

* % m Will IV \J I t* 1 X-^ 


4.7 


Control 4 Occinital Ctx 


6 7 


AD 4 Temporal Ctx 


20.4 


Control (Path^ 1 Occinital Ctx 


79 6 


AD 5 Inf Temooral Ctx 


100.0 


Control f Path^ 1 Occinital Ctx 


9 7 


AD 5 SunTemDoral Ctx 


40.6 


Control fPath^ 3 Occinital Ctx 




AD 6 lnf Temooral Ctx 

JlL/ V/ 1111 1 VlllLA/IUl V— ' u\ 


63.3 


Control f PanM 4 Ocrinital Ptx 




AD 6 Sup Temporal Ctx 


66.0 


Control 1 Parietal Ctx 


6.4 


Control 1 Temporal Ctx 


5.6 


Control 2 Parietal Ctx 


38.7 


Control 2 Temporal Ctx 


40.6 


Control 3 Parietal Ctx 


17.2 


Control 3 Temporal Ctx 


15.5 


Control (Path) 1 Parietal Ctx 


79.6 


Control 4 Temporal Ctx 


5.0 


Control (Path) 2 Parietal Ctx 


28.9 


Control (Path) 1 
Temporal Ctx 


70.7 


Control (Path) 3 Parietal Ctx 


3.6 


Control (Path) 2 
Temporal Ctx 


34.6 


Control (Path) 4 Parietal Ctx 


40.9 



Table BHC . General screening_panel vL4 



Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
221117670 


Tissue Name 


Rel. Exp.(%) 
Ag4157,Run 
221117670 


Adipose 


18.2 


Renal ca.TK- 10 


28.1 


Melanoma* Hs688(A).T 


14.1 


Bladder 


23.8 


Melanoma* Hs688(B).T 


12.1 


Gastric ca. (liver met.) NCI-N87 


23.5 


Melanoma* M14 


32.3 


Gastric ca. KATO 111 


24.0 


Melanoma* LOXIMVI 


21.5 


Colon ca. SW-948 


6.1 


Melanoma* SK-MEL-5 


41.8 


Colon ca. SW480 


21.8 


Squamous cell 
carcinoma SCC-4 


11.7 


Colon ca.* (SW480 met) SW620 


14.4 


Testis Pool 


8.3 


Colon ca. HT29 


18.4 • 


Prostate ca.* (bone met) 
PC-3 


13.5 


Colon ca.HCT-1 16 


51.8 


Prostate Pool 


9.4 


Colon ca. CaCo-2 


80.1 



719 



WO 03/010327 



PCT/US02/14199 



r lacenia 




Colon cancer tissue 


14.7 


Uterus Pool 


8.3 


Colon ca. SWl llo 


2.7 


Ovarian ca. OVCAR-3 


31.2 


Colon ca. Colo-205 


3.7 


Ovarian ca. SK-OV-3 


25.3 


Colon ca. bW-48 


4.5 


Ovarian ca. OVCAR-4 


14.3 


colon rool 


17.3 


Ovarian ca. OVCAR-5 


32.5 


Small Intestine Pool 


17.3 


Ovarian ca. IGROV-I :I5.8 


Stomach Pool 


8.9 


Ovarian ca. OVCAR-8 jll.6 


Bone Marrow Pool 


10.2 


Ovary jll.4 


Fetal Heart 


12.5 


Breast ca. MCF-7 


53.6 


Heart Pool 


7.6 


Breast ca. MDA-MB- 
231 


34.6 


Lymph Node Pool 


17.7 


Breast ca. BT 549 34.6 


Fetal Skeletal Muscle 


9.8 


Breast ca. T47D S l0 ° 0 


Skeletal Muscle Pool 


8.8 


Breast ca. MDA-N 16.7 


Spleen Pool 


16.4 


Breast Pool jl8.6 


Thymus Pool 


18.4 


Trachea il 1.0 


CNS cancer (glio/astro) U87-MG 


15.3 


Lung |7.4 


CNS cancer (glio/astro) U-l 18- 
MG 


20.6 


Fetal Lung !45.7 


CNS cancer (neuro;met) SK-N-AS 


27.2 


Lungca.NCl-N417 |9.6 


CNS cancer (astro) SF-539 


15.6 


Lung ca. LX-1 


24.1 


CNS cancer (astro) SNB-75 


31.4 


Lung ca. NCI-H146 


17.1 


CNS cancer (glio) SNB-19 


17.2 


Lung ca. SHP-77 


48.3 


CNS cancer (glio) SF-295 


21.5 


Lung ca. A549 


14.1 


Brain (Amygdala) Pool 


10.5 


Lung ca. NCI-H526 


13.7 


Brain (cerebellum) 


18.3 


Lung ca. NCI-H23 


32.1 


Brain (fetal) 


48.3 


Lung ca. NC1-H460 


10.8 


Brain (Hippocampus) Pool 


13.7 


Lung ca. HOP-62 


18.3 


Cerebral Cortex Pool 


17.9 


Lung ca. NCI-H522 


36.3 


Brain (Substantia nigra) Pool 


12.6 


Liver 


1.7 


Brain (Thalamus) Pool 


20.7 


Fetal Liver 


43.2 


Brain (whole) 


19.2 


Liver ca. HepG2 


17.8 


^ninal CnrA Pr\ol 


o.z 


Kidney Pool 


27.0 


Adrenal Gland 


16.2 


Fetal Kidney 


55.5 


Pituitary gland Pool 


4.5 


Renal ca. 786-0 


27.9 


Salivary Gland 


5.4 


Renal ca. A498 


4.9 


Thyroid (female) 


2.8 


Renal ca. ACHN 


8.5 


Pancreatic ca. CAPAN2 


30.4 
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Renal ca. UO-31 



12.2 



Pancreas Pool 



|l6 8 



Table BHD. Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4157,Run 
173123943 


Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
173123943 


Secondary Thl act 


55.1 


HUVECIL-lbeta 


20.4 


Secondary Th2 act 


54.7 


HUVEC I FN gamma 


28.7 


Secondary Trl act 142.9 


HUVEC TNF alpha + I FN gamma 


15.6 


Secondary Th 1 rest 23.7 


HUVEC TNF alpha + IL4 


12.9 


Secondary Th2 rest J33.9 


ill I\/C/-* II 11 

HUVbC IL-1 1 


24.0 


Secondary Trl rest 


23.7 


Lung Microvascular EC none 


24.7 


Primary Th 1 act 


29.7 


Lung Microvascular EC TNFalpha 
+ IL-1 beta 


19.5 


Primary Th2 act 


44.8 


Microvascular Dermal EC none 


11.1 


Primary Trl act 


36.1 


Microsvasular Dermal EC 
TNFalpha + I L-l beta 


11.7 


Primary Thl rest |38.4 


Bronchial epithelium TNFalpha + 

11 1 Us>+*-> 

I L I beta 


19.9 


Primary Th2 rest i44.4 


Small airway epithelium none 


4.5 


Primary Trl rest |48.0 

i 


Small airway epithelium TNFalpha 
+ IL-1 beta 


12.9 


CD45RA CD4 
lymphocyte act 


21.2 


Coronery artery SMC rest 


9.3 


CD45RO CD4 
lymphocyte act 


46.3 


Coronery artery SMC TNFalpha + 
IL-1 beta 


11.2 


CD8 lymphocyte act 


28.7 


Astrocytes rest 


11.1 


Secondary CD8 
lymphocyte rest 


27.7 


A^trnnvfpQ TNIFalnfiJi 4- II -1 hf»ta 

Aoll ULjr IC5 1 LNralLJJld ' lb 1 ucia 


to Q 


Secondary CD8 
lymphocyte act 


27.7 


KU-812 (Basophil) rest 


39.0 


CD4 lymphocyte none 


16.8 


KU-812 (Basophil) 
PMA/ionomycin 


52.9 


2ryThl/Th2/Trl anti- 
CD95CH11 


52.1 


CCD 1 1 06 (Keratinocytes) none 


13.0 


LAK cells rest 


34.2 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


14.3 


LAK cells IL-2 


37.4 


Liver cirrhosis 


7.2 


LAK cells IL-2+IL-12 


22.4 


NCI-H292 none 


11.2 


LAK cells 1L-2+IFN 
gamma 


36.1 


NCI-H292 IL-4 


23.8 


LAK cells IL-2+IL-18 


41.8 


NCI-H292 IL-9 


21.3 
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ceus 
PMA/ionomycin 


39.8 


NCI-H292 IL-13 


21.9 


XIV /"Vile TT O re*c*i- 

injv i^eus iLr-z rest 


54.7 


NCI-H292 IFN gamma 


16.7 


i wo way iviLrK J oay 


4 J. 2 


HPAEC none 


19.3 


i wo w ay ivi lk j day 


A A O 

44.5 


HPAEC TNF alpha + IL-1 beta 


35.4 


i wo way iviLK / oay 




Lung fibroblast none 


7.6 


PBMC rest 

1 


20.2 


Lung fibroblast TNF alpha + IL-1 
beta 


7.7 


PRMP pu/y 
r diviv_, r wivi 




Lung fibroblast IL-4 


7.3 


PBMC PHA-L 


33.7 


Lung fibroblast IL-9 


9.5 


Ramos (B cell) none 


75.8 


Lung fibroblast IL-13 


8.5 


Ramos (B cell) 
ionomycin 




Lung fibroblast IFN gamma 


7.6 


B lymphocytes PWM 


35.4 


Dermal fibroblast CCD 1070 rest 


17.6 


B lymphocytes CD40L 
and IL-4 


48.6 


Dermal fibroblast CCD 1070 TNF 
alpha 


52.5 


EOL-1 dbcAMP 


54.3 


Dermal fibroblast CCD1070 IL-1 
beta 


13.5 


EOL-1 dbcAMP 
r ivi a/ ionomycin 


86 -5 jDermal fibroblast I FN gamma 


15.8 


Dendritic cells none 


58.6 jDermal fibroblast IL-4 


18.4 


r» A t-i t Irk si^llr* I DC 

. L/enarmc ceils L,rb 


40.6 jDermal Fibroblasts rest J6.4 


Dendritic cells anti- 
CD40 


50.3 


Neutrophils TNFa+LPS 


28.7 


Monocytes rest 


52.1 


Neutrophils rest 


87.1 


Monocytes LPS 


100.0 


Colon 


6.9 


Macrophages rest 


38.4 


Lung 


11.6 


Macrophages LPS 25.2 


Thymus 


66.4 


HUVECnone Jl 5.8 


Kidney 


19.9 


HUVEC starved : 


21.0 






Table BHE. General nncolngy smpning pr^f | v 


2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4157,Run 
268624005 


Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
268624005 


Colon cancer 1 


12.8 ] 


Bladder cancer NAT 2 


D.4 


Colon NAT 1 


4.8 


Bladder cancer NAT 3 


1.8 


Colon cancer 2 


26.1 ~jj 


Bladder cancer NAT 4 ]< 


5.6 


Colon cancer NAT 2 


5.6 


Adenocarcinoma of the prostate 1 


54.9 


Colon cancer 3 


24.8 


Adenocarcinoma of the prostate 2 '. 


5.5 


Colon cancer NAT 3 


13.2 j 


Adenocarcinoma of the prostate 3 < 


).0 
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Colon malignant cancer 
4 


34.4 


Adenocarcinoma of the prostate 4 


34.4 


Colon normal adjacent 
tissue 4 


5.0 


Prostate cancer NAT 5 


3.6 


Lung cancer 1 


1 1.7 


Adenocarcinoma of the prostate 6 


3.8 


T m,w%n \IAT 1 

Lung INA I I 




Adenocarcinoma of the prostate 7 


5.4 


Lung cancer 2 


66.9 


Adenocarcinoma of the prostate 8 


1.9 


Lung NA I 2 


4.0 


Adenocarcinoma of the prostate 9 


17.3 


Squamous cell 
carcinoma 3 


18.7 


Prostate cancer NAT 1 0 


3.3 


Lung NAT 3 


1.1 


Kidney cancer 1 


15.2 


metastatic melanoma 1 


23.0 


KidneyNAT 1 


8.5 


Melanoma 2 


2.1 


Kidney cancer 2 


67.4 


Melanoma 3 


5.3 


Kidney NAT 2 


11.2 


metastatic melanoma 4 


92.0 


Kidney cancer 3 


12.5 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


5.0 


Bladder cancer I 


2.1 


Kidney cancer 4 


16.8 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


4.2 


Bladder cancer 2 


7.1 







CNS_neurodegeneration_vl.O Summary: Ag4157 This panel confirms the 
expression of the CG99832-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General screening panel vl.4 Summary: Ag4157 Highest expression of the 
CG99832-01 gene is detected in breast cancer T47D cell line (CT=29). Moderate levels of 
expression of this gene is seen in cluster of cancer lines including pancreatic, CNS, colon, 
10 gastric , renal, lung, breast , ovarian, prostate, squamous cell carcinoma, and melanoma 
cell lines. Therefore, therapeutic modulation of this gene may be useful in the treatment of 
these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at 
moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
15 muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 
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Interestingly, this gene is expressed at much higher levels in fetal (CT=30) when 
compared to adult liver (CT=34.8). This observation suggests that expression of this gene 
can be used to distinguish fetal from adult liver. In addition, the relative overexpression of 
this gene in fetal liver suggests that the protein product may enhance liver or development 
in the fetus and thus may also act in a regenerative capacity in the adult. Therefore, 
therapeutic modulation of the protein encoded by this gene could be useful in treatment of 
liver related diseases. 

In addition, this gene is expressed at moderate levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4157 Highest expression of the CG99832-01 gene is 
detected in LPS treated monocytes (CT=29). This gene is expressed at high to moderate 
levels in a wide range of cell types of significance in the immune response in health and 
disease. These cells include members of the T-cell, B-cell. endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4157 Highest 
expression of the CG99832-01 gene is detected in metastatic melanoma (CT=29). Higher 
expression of this gene is seen in cancer samples including colon cancer, kidney cancer, 
lung cancer, prostate adenocarcinoma and melanoma. Therefore, therapeutic modulation 
of this gene may be useful in the treatment of these cancers. 
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BI. CG99842-01: Tensin-Like 

Expression of gene CG99842-01 was assessed using the primer-probe set Ag4158, 
described in Table BIA. Results of the RTQ-PCR runs are shown in Tables BIB, BIC and 
5 BID. 

Table BIA . Probe Name Ag4158 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -aagggataccaagtgagaaagc-3 ' 


22 


221 


422 


Probe 


TET-5 ' -ccttcagttaaacaaaggggtacatca-3 ' -TAMRA 


27 


245 


423 


Reverse 

r II 


5 • -tgttccaattgtcacctgattt-3 ' 


22 


293 


424 



Table BIB . CNSjncurodegenerationvl.O 



Tissue Name 


Re!. Exp.(%) 
Ag4158, Run 

£ I J J J IjL I 


Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 

n i mien i 


A n 1 Hinnn 


1 A 1 

IV./ 


v_,oniroi ^ratjij j lemporai v^tx 


1 1 7 
II./ 


AH? Hinnn 


?£ 1 


v^oniroi ^rain ) h l eniporai v>tx 


JO. J 


!a n i Hinnn 

!r\L/ _) nip|JU 


1 1 A 


AU 1 VJCCipiiai v-uv 


11 ft 


1 

AD 4 HiDDo 


15.3 


AH 0 Oppinital f^tv fN/Ti^^inc^ 


0 0 


jAD 5 Hippo 


84.7 


AD 3 Occipital Ctx 


9.2 


AD 6 Hippo 


92.7 


AD 4 Occipital Ctx 


13.4 


Control 2 Hippo 


26.2 


AD 5 Occipital Ctx 


20.4 


Control 4 Hippo 


25.5 


AD 6 Occipital Ctx 


27.9 


Control (Path) 3 Hippo 


11.8 


Control 1 Occipital Ctx 


11.2 


AD 1 Temporal Ctx 


14.8 


Control 2 Occipital Ctx 


39.2 


AD 2 Temporal Ctx 


35.1 


Control 3 Occipital Ctx 


31.2 


AD 3 Temporal Ctx 


11.1 


Control 4 Occipital Ctx 


7.6 i 


AD 4 Temporal Ctx 


16.0 


Control (Path) 1 Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx 


87.1 


Control (Path) 2 Occipital Ctx 


15.2 


AD 5 Sup Temporal Ctx 


44.1 


Control (Path) 3 Occipital Ctx 


6.9 


AD 6 Inf Temporal Ctx 


91.4 


Control (Path) 4 Occipital Ctx 


30.8 


AD 6 Sup Temporal Ctx 


89.5 


Control 1 Parietal Ctx 


8.2 


Control 1 Temporal Ctx 


10.8 


Control 2 Parietal Ctx 


48.3 


Control 2 Temporal Ctx 


13.8 


Control 3 Parietal Ctx 


4.9 


Control 3 Temporal Ctx 


17.7 


Control (Path) 1 Parietal Ctx 


37.1 


Control 3 Temporal Ctx 


7.6 


Control (Path) 2 Parietal Ctx 


30.8 


Control (Path) 1 


44.8 


Control (Path) 3 Parietal Ctx 


13.1 
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Temporal Ctx 








Control (Path) 2 
Temporal Ctx 


45.1 


Control (Path) 4 Parietal Ctx 


54.0 


Table BIC. General screeningjpanf*! vl d 


Tissue Name 


Rel Fxn 
Ag4l58, Run 
2211)7871 


Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 
221117871 


Adipose 


8.5 


Renal ca. TK-10 


32.1 


Melanoma* Hs688(A).T 


16.7 


Bladder 


26.4 


Melanoma* Hs688(B).T 


16.3 


Gastric ca. (liver met.) NCI-N87 


56.6 


Melanoma* M14 


11.9 


Gastric ca. KATO III 


54.7 


Melanoma* LOXIMVI 


30.1 


Colon ca. SW-948 


11.4 


Melanoma* SK-MEL-5 


29.9 


Colon ca. SW480 


36.3 


Squamous cell 
carcinoma SCC-4 




Colon ca.* (SW4S0 met) SW620 


38.7 


Testis Pool 


5^0 


Colon ca. HT29 


18.7 


Prostate ca.* (bone met) 


31.4 


Colon ca. HCT-1 16 


ion n 


Prostate Pool 


7.5 


Colon ca. CaCo-2 


31.0 


Placenta 


13.5 


j - ■ ■" ■ 

.Colon cancer tissue 


[uTj ~~* 


Uterus Pool 


'6.7 


Colon ca. SW1116 


4.2 


Ovarian ca. OVCAR-3 


26.8 


Colon ca. CoIo-205 


4.1 


Ovarian ca. SK-OV-3 


36.6 


Colon ca. SW-48 


4.0 


Ovarian ca. OVCAR-4 


6.4 


Colon Pool 


15.7 


Ovarian ca. OVCAR-5 


36.6 


Small Intestine Pool 


18.4 


Ovarian ca. IGROV-1 


9.5 

• 


Stomach Pool 


11.0 


Ovarian ca. OVCAR-8 


5.0 


Bone Marrow Pool 


9.2 


Ovary 


9.7 


Fetal Heart 


5.2 


Breast ca. MCF-7 


48.6 


Heart Pool 


6.7 


Breast ca. MDA-MB- 
231 


34.6 


Lymph Node Pool 


17.7 


Breast ca. BT 549 


36.6 


Fetal Skeletal Muscle 


5.1 


Breast ca. T47D 


52.9 


Skeletal Muscle Pool 


11.2 


Breast ca. MDA-N 


10.4 


Spleen Pool 


15.4 


Breast Pool 


13.5 


Thymus Pool 


14.3 


Trachea 


13.8 


CNS cancer (glio/astro) U87-MG 


34.4 


Lung 


10.5 


CNS cancer (glio/astro) U-l 1 8- 
MG 


62.4 


Fetal Lung 


24.5 


CNS cancer (neuro;met) SK-N-AS : 


28.7 
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idling ca. iNv^i-iN^f 1 / 


1 Q 


P\|C ______ / 0 \ O T? CO A 

clNb cancer (astro) br-539 


in 0 

10.2 




o 


ClNb cancer (astro) bNr>-75 


28.1 




1 1 1 
11.1 


v^ino cancer (giio) oNJ5-iy 


a a 
O.O 


I una 


1ft 4 


uiNo cancer (gno) or-2yj 


A 1 O 

41.2 


T iino cfk A ^40 




Brain (Amygdala) Pool 


7.0 


Luna ca NCI-HS^A 


1 ft 


oram (cereoenum) 


8.2 


'Luno ca l\!CI-l-m 




p r „: n / r_ *_ 1 \ 

oram (reiaij 


18.6 


Luna ca NPI-H4fiO 


40 0 


Brain (Hippocampus) Pool 


8.7 


I una ca 


10 S 

1 U.J 


i_ereurai v_one\ rooi 


7.2 




1 7 ft 


Brain (Substantia nigra) Pool 


A O 

4.8 


I ivpr 


0.0 


Brain (Thalamus) Pool 


1 A £. 

14.6 


Ff^tJlI I ivpr 
i cial L/l vci 


12.5 


Brain (whole) 


6.4 


L/IVCI V/a. riCjJVJZ. 


18.3 


Spinal Cord Pool 


9.5 


rviuiicv ruui 


33.7 


Adrenal Gland 


12.5 


Fetal Kidney 


26.1 


Pituitary gland Pool 


3.9 


Renal ca. 786-0 


20.4 jSalivary Gland 


4.1 


Renal ca. A498 


3.5 


Thyroid (female) 


2.2 


!Renal ca. ACHN 


28.5 


Pancreatic ca. CAPAN2 


50.0 


jRenal ca. UO-31 


19.8 


Pancreas Pool 


22.4 


Table BID. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 
173123945 


Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 
173123945 


Secondary Thl act 


46.0 


HUVECIL-lbeta 


17.3 


Secondary Th2 act 


63.3 


HUVEC I FN gamma 


14.9 


Secondary Trl act 


35.4 


HUVEC TNF alpha + IFN gamma 


14.3 


Secondary Thl rest 


7.0 


HUVEC TNF alpha + IL4 


15.8 


Secondary Th2 rest 


13.2 


HUVEC IL- 11 


12.2 


Secondary Trl rest 


11.7 


Lung Microvascular EC none 


22.5 


Primary Thl act 


55.9 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


27.4 


Primary Th2 act 


81.8 


Microvascular Dermal EC none 


15.7 


Primary Trl act 


51.1 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


10.0 


Primary Thl rest 


9.5 


Bronchial epithelium TNFalpha + 
ILlbeta 


12.5 


Primary Th2 rest 


5.7 


Small airway epithelium none 


5.2 


Primary Trl rest 


20.2 


Small airway epithelium 
TNFalpha + IL-1 beta 


18.0 
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CD45RA CD4 
lymphocyte act 


J 1.2 


Coronery artery SMC rest 


3.3 


CD45RO CD4 
lymphocyte act 




Coronery artery SMC TNFalpha + 
IL-1 beta 


6.9 


CD8 lymphocyte act 


43.8 


Astrocytes rest 


9.7 


Secondary CD8 
lymphocyte rest 


40.3 


Astrocytes TNFalpha + IL- 1 beta 


7.1 


Secondary CDS 
lymphocyte act 


13.7 


KU-812 (Basophil) rest 


16.8 


k^ljh lympnocyxe none 


13.0 


KU-8 12 (Basophil) 
PMA/ionomycin 




14.8 


2ryThimi2m-l anti- 
CD95 CH11 


20.6 


CCD 1 1 06 (Keratinocytes) none 


21.8 


LAK cells rest 


12.6 


CCD 11 06 (Keratinocytes) 
TNFalpha + 1L-1 beta 


JZ.U 


LAK cells IL-2 


21.8 


Liver cirrhosis 


1.4 


LAK cells IL-2+IL-I2 


20.3 


NCI-H292 none 


21.5 


LAK cells IL-2+IFN 
gamma 


l . 1 

26.2 !NCI-H292 IL-4 

1 


JJ.H 


F A \f _ 11 ii *\ \ ri to 

LAK cells IL-2+ IL-1 8 

r — 1 -— 


26.6 INCI-H292 IL-9 


43.2. 


LAK cells 
jPMA/ionomycin 


42.0 


NC1-H292IL-13 


31.2 


NK Cells IL-2 rest 


33.7 


NCI-H292 I FN gamma 


30.8 


1 wo Way MLR 3 day 


32.1 


HPAEC none 


11.1 


Two Way MLR 5 day 


24.8 


HPAEC TNF alpha + IL-1 beta 


16.8 


Two Way MLR 7 day 


18.8 


Lung fibroblast none 


12.3 


PBMC rest 


8.1 


Lung fibroblast TNF alpha + IL-1 
beta 


A 6. 


PBMC PWM 


30.4 


Lung fibroblast IL-4 


8.2 


PBMC PHA-L 


27.5 


Lung fibroblast IL-9 


13.4 


Ramos (B cell) none 


59.5 


Lung fibroblast IL-1 3 


11.8 


Ramos (B cell) 
ionomycin 


45.4 


Lung fibroblast IFN gamma 


15.5 


B lymphocytes PWM 


25.3 


Dermal fibroblast CCD 1 070 rest 


25 0 


B lymphocytes CD40L , 
and IL-4 


24.0 


Dermal fibroblast CCD 1070 TNF 
alpha 


34.9 


EOL-1 dbcAMP : 


29.5 


Dermal fibroblast CCD 1 070 IL- 1 

beta 


9.7 


EOL-1 dbcAMP 
PMA/ionomycin 


18.6 ] 


Dermal fibroblast IFN gamma j 


3.3 


Dendritic cells none 


12.1 1 


Dermal fibroblast IL-4 j 


17.8 


Dendritic cells LPS < 


h9 1 


Dermal Fibroblasts rest j: 


3.9 
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Dendritic cells anti-CD40 


14.0 


Neutrophils TNFa+LPS 


6.7 


Monocytes rest 


15.0 


Neutrophils rest 


8.4 


Monocytes LPS 


27.5 


Colon 


6.4 


Macrophages rest 


16.7 


Lung 


6.7 


Macrophages LPS 


4.4 


Thymus 


34.6 


HUVEC none 


9.2 


Kidney 


100.0 


HUVEC starved 


13.9 







CNS_neurodegeneration_vl.O Summary: Ag4158 This panel confirms the 
expression of the CG99842-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer f s diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 

treatment of central nervous system disorders. 

GeneraI_screening_paneI_vL4 Summary: Ag4158 Highest expression of the 

CG99842-01 gene is detected in colon cancer HCT-l 16 cell, line (CT=O0.7). Moderate 

expression of this gene is associated with cluster of cancer cell lines including pancreatic, 
1 0 CNS, colon, gastric, renal, lung, breast, ovarian, prostate, squamous cell carcinoma and 

melanoma cancer cell lines. Therefore, therapeutic modulation of this gene product may 

be useful in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 

to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
1 5 muscle, heart, fetal liver and the gastrointestinal tract. Therefore, therapeutic modulation 

of the activity of this gene may prove useful in the treatment of endocrine/metabolically 

related diseases, such as obesity and diabetes. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=33.7) when 

compared to adult liver (CT=40). This observation suggests that expression of this gene 
20 can be used to distinguish fetal from adult liver. In addition, the relative overexpression of 

this gene in fetal liver suggests that the protein product may enhance liver growth or 

development in the fetus and thus may also act in a regenerative capacity in the adult. 

Therefore, therapeutic modulation of the protein encoded by this gene could be useful in 

treatment of liver related diseases. 
25 In addition, this gene is expressed at low to moderate levels in all regions of the 

central nervous system examined, including amygdala, hippocampus, substantia nigra, 
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thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a 
role in central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4158 Highest expression of the CG99842-01 gene is 
detected in kidney (CT=31). This gene is expressed at high to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 
cells include members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 
This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. This pattern is in 
agreement with the expression profile in General_screening_panel_vl.4 and also suggests 
a role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 



BJ. CG99944-01: ABC Transporter 

Expression of gene CG99944-01 was assessed using the primer-probe set Ag41 84, 
described in Table BJA. Results of the RTQ-PCR runs are shown in Table BJB. 

Table B JA . Probe Name Ag4184 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -agctaaaaggggaagacatcac-3 ' 


22 


965 


425 


Probe 


TET-5 ' -aaaacctcgaaagatcactgaacatg-3 1 -TAMRA 


26 


993 


426 


Reverse 


5 1 -cttctggcacatgtcctacac-3 ' 


21 


1019 


427 



Table BJB. General oncology screening panel _y_2.4 



Tissue Name . 


Rel. Exp.(%) 
Ag4184, Run 
268695207 


Tissue Name 


Rel. Exp.(%) ' 
Ag4184, Run 
268695207 


Colon cancer 1 


0.0 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


0.0 
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10 



Colon cancer 2 Jo.O 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 2 |o.O 


Adenocarcinoma of the prostate 1 


0.0 


Colon cancer 3 jo.O 


Adenocarcinoma of the prostate 2 


0.0 


Colon cancer NAT 3 jo.O 


Adenocarcinoma of the prostate 3 


0.0 


Colon malignant cancer 

;4 


0.0 


Adenocarcinoma of the prostate 4 


0.0 


'Colon normal adjacent 
; tissue 4 


0.0 


Prostate cancer NATS 


0.0 


Lung cancer 1 


0.0 


Adenocarcinoma of the prostate 6 


0.0 


;Lung NAT 1 |0.0 [Adenocarcinoma of the prostate 7 


0.0 


Lung cancer 2 jo.O J Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 jo.O (Adenocarcinoma of the prostate 9 


0.0 


'Squamous cell 
Jcarcinoma 3 


0.0 


Prostate cancer NAT 10 


0.0 


■Lung NAT 3 


0.0 


Kidney cancer 1 * 


ft ft 


Imetastatic melanoma 1 


0.0 


KidneyNAT 1 


ft ft 


Melanoma 2 


100.0 


Kidney cancer 2 


0.0 


Melanoma 3 


0.0 ;KidneyNAT2 


0.0 


metastatic melanoma 4 


— 1~ 

0.0 ;Kidney cancer 3 


0.0 


imetastatic melanoma 5 


0.0 jKidneyNAT3 


bTo 


.Bladder cancer 1 


0.0 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 |o.O 


Bladder cancer 2 


0.0 


i 



with the CG99944-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

Generalscreeningjjanel vl.4 Summary: Ag4184 Expression of the CG99944- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1D Summary: Ag4I84 Expression of the CG99944-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag41 84 Highest 
expression of the CG99944-01 gene is detected exclusively in a melanoma sample 
(CT=34). Therefore, expression of this gene can be used to distinguish this sample from 
other samples used in this panel and therapeutic modulation of this gene product may be 
beneficial in the treatment of melanoma. 



731 



WO 03/010327 



PCTVUS02/14199 



BK. CG99963-01: Cyclophilin 18 

Expression of gene CG99963-01 was assessed using the primer-probe set Ag4160, 
described in Table BKA. Results of the RTQ-PCR runs are shown in Tables BKB, BKC, 
5 BKD and BKE. 



Table BKA . Probe Name Ag4160 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -ggcaagaccagcaagaagat-3 ' 


20 


545 


428 


Probe 


TET-5 ' -caccattgctgactgtggacaactct-3 ' -TAMRA 


26 


565 


429 


Reverse 


5 1 -aaaggaatggtctggtggtt-3 ' 


20 


617 


430 



Table BKB . CNS_neurodegeneration_vLO 



Tissue Name 


Rel. Exp.(%) 
Ag4160, Run 


Tissue Name 


! Rel. Exp.(%) 
|Ag4160, Run 
215342806 


AD 1 Hinno 


1 1 A 

I J. 4 


iControl (Path) 3 Temporal Ctx j6.5 


AD 0 Wirmn 


A 

JU.4 


jControl (Path) 4 Temporal Ctx 


|35.6 


AH ^ Hinnn 


Q A 
O.U 


;AD 1 Occipital Ctx 


120.0 


AD 4 Hippo 


8.5 


jAD 2 Occipital Ctx (Missing) 


lo.o 


AD 5 Hippo 


84.7 


AD 3 Occipital Ctx 


|6.0 


AD 6 Hippo 


37.9 


AD 4 Occipital Ctx 


|25.5 


Control 2 Hippo 


64.2 


AD 5 Occipital Ctx 


84.7 


Control 4 Hippo 


7.9 


AD 6 Occipital Ctx 


29.3 


Control (Path) 3 Hippo 


8.0 


Control 1 Occipital Ctx 


4.4 


AD 1 Temporal Ctx 


13.5 


Control 2 Occipital Ctx 


98.6 


AD 2 Temporal Ctx 


40.1 


Control 3 Occipital Ctx 


14.9 


AD 3 Temporal Ctx 


5.0 


Control 4 Occipital Ctx 


8.1 


AD 4 Temporal Ctx 


23.5 


Control (Path) 1 Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx 


71.2 


Control (Path) 2 Occipital Ctx 


11.5 


AD 5 Sup Temporal Ctx 


26.1 


Control (Path) 3 Occipital Ctx 


3.7 


AD 6 Inf Temporal Ctx 


41.5 


Control (Path) 4 Occipital Ctx 


11.3 


AD 6 Sup Temporal Ctx 


39.2 


Control I Parietal Ctx 


7.0 


Control 1 Temporal Ctx 


5.6 


Control 2 Parietal Ctx 


25.0 


Control 2 Temporal Ctx 


65.1 


Control 3 Parietal Ctx 


23.3 


Control 3 Temporal Ctx 


18.8 


Control (Path) ] Parietal Ctx 


94.0 


Control 3 Temporal Ctx 


5.6 


Control (Path) 2 Parietal Ctx 


26.2 


Control (Path) 1 i 


57.4 


Control (Path) 3 Parietal Ctx 


4.7 
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Temporal Ctx 








Control (Path) 2 
Temporal Ctx 


45.4 


Control (Path) 4 Parietal Ctx 


49.3 



Table BKC . General screening panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4160, Run 
221297229 


Tissue Name 


Rel. Exp.(%) 
Ag4160, Run 
221297229 


| Adipose 


5.0 


Renal ca. TK-10 


25.3 


Melanoma* Hs688(A)T 


20.9 


Bladder 


13.5 


Melanoma* Hs688(B).T 


16.6 


Gastric ca. (liver met.) NCI-N87 


20.6 


Melanoma* Ml 4 


31.6 


Gastric ca. KATO III 


75.8 


Melanoma* LOX1MVI 


25.5 


Colon ca. SW-948 


14.7 


Melanoma* SK-MEL-5 


52.9 


Colon ca. SW480 


77.4 


Squamous cell carcinoma 
SCC-4 


27.0 


Colon ca * ( SW480 inert SW620 


33 9 

JJ.7 


jTestis Pool 


2.7 


Colon ca. HT29 


30.8 


iProstate ca.* (bone met) 

i — 


25.0 


Colon ca. HCT-116 


66.4 


iPrnctatf 1 ' Pr>r»l 
riuoiflic ruui 


^ 7 


l^OiOIl Ca. V_aV^O-Z 


77 A 


r laccilla 


1 7 


L-Oion cancer tissue 


1 A A 


! Ifprn^ Pnnl 




v^oion ca. jwi no 


a n 

O.v 


Ovarian ra OVPAR-^ 

vvfll lu.ll la. \J V V_^.r\I\. _7 


16 7 


t^oion Ca. V^OIO-ZU-> 


7 R 


Ovarian ra ^K-OV-1 

\y v ai iai i va> jiv w v J . 


1 R 0 

l O.V/ 


V^UIUIJ Cd. kJ W ~H 0 


1 j.Z 


Ovarian ca OVCAR-4 


14 1 


PnJon Pnnl 


R 1 


Ovarian ca OVCAR-S 

Well ICLJI vd> W V V<AI\ J 


S2 S 


Q rn^ 1 1 Intf*ctin#» Pnnl 
oiiia.il u i ico 1 1 1 ic i Uvi 


1 0 


Ovarian ca TGROV-1 


97 4 


OLUIUaCIl ruui 


1 .v) 


Ovarian ca. OVCAR-8 


25.5 


Bone Marrow Pool 


3.7 


Ovary 


5.8 


Fetal Heart 


5.5 


Breast ca. MCF-7 


29.5 


Heart Pool 


4.2 


Breast ca. MDA-MB-23 1 


60.3 


Lymph Node Pool 


5.1 


Breast ca. BT 549 


45.4 


Fetal Skeletal Muscle 


4.2 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


1.9 


Breast ca. MDA-N 


14.5 


Spleen Pool 


2.6 


Breast Pool 


3.9 


Thymus Pool 


6.7 


Trachea 


4.8 


CNS cancer (glio/astro) U87-MG 


40.9 


Lung 


1.9 


CNS cancer (glio/astro) U- 11 8- 
MG 


54.7 


Fetal Lung 


11.2 


CNS cancer (neuro;met) SK-N-AS 


30.4 


Lungca. NCI-N417 


10.4 


CNS cancer (astro) SF-539 


28.7 
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I. una Pii f Y-1 


Jl.U 


CNS cancer (astro) SNB-75 


51.4 


idling ^a. ixv^i-n i ho 


Q A 

y.v 


CNS cancer (glio) SNB-19 


16.2 


Lune ca 




CNS cancer (glio) SF-295 


28.9 


Lune ca AS40 


ZO.o 


Brain (Amygdala) Pool 


7.2 


Luns ca 


in 7 


Brain (cerebellum) 


10.3 


Lun? ca NCLH7"} 




Brain (fetal) 


9.2 


Luno ca NPf-H4£0 


j9.3 


Brain (Hippocampus) Pool 


6.7 


Lun<* ca HOP-69 


1 J.S 


Cerebral Cortex Pool |9.4 


Luna ca NPI-H^")^ 




Brain (Substantia nigra) Pool 


7.9 


Liver 


i .0 


Brain (Thalamus) Pool 


6.8 




1 1 o 

1 1.8 


Brain (whole) 


6.3 


j^ivci td. ncpuz 


9.6 


Spinal Cord Pool 


3.3 


iviuiiw ruvji 




Adrenal Gland 


9,1 


Fetal Kidney 


11.3 


Pituitary gland Pool 


1.9 


Renal ca. 786-0 


20.0 


Salivary Gland 


3.7 


Renal ca. A498 


8.7 


Thyroid (female) 


j.8 


Renal ca. ACHN 


13.6 


Pancreatic ca. CAPAN2 


1 A O 

j0.8 


Renal ca. UO-3 1 


21.6 


Pancreas Pool 


10.4 


Table BKD. Panel 4.1 D 




Tissue Name 


Rel. Exp.(%) 
Ag4l60, Run 
173118878 


Tissue Name ' 


Rel E\d 
Ag4l60, Run 
173118878 


Secondary Thl act 


57.8 


HUVECIL-lbeta 


70.7 


Secondary Th2 act 


48.3 


HUVECIFN gamma 


51.4 


Secondary Trl act 


53.6 


HUVEC TNF alpha + IFN gamma; 


31. .4 


Secondary Thl rest 


12.3 


HUVECTNF alpha + IL4 


36.6 


Secondary Th2 rest 


13.0 


HUVEC IL-1 1 


30.4 


Secondary Trl rest 


14.9 


Lung Microvascular EC none 


58.2 


Primary Thl act 


58.2 1 


Lung Microvascular EC TNFalpha 
*■ IL-1 beta 


48.3 


Primary Th2 act < 


54.2 ] 


Microvascular Dermal EC none j 


43.2 


Primary Trl act ( 


58.3 ! 


vlicrosvasular Dermal EC 
rNFalpha + IL-1 beta 


39.8 


Primary Thl rest ; 


: 


bronchial epithelium TNFalpha + 
LI beta 


J7.4 


Primary Th2 rest ] 


4.5 J 


Small airway epithelium none j: 


>2.8 


Primary Trl rest 2 


' 9 * 5 \ 


Jmall airway epithelium j 
rNFalpha* IL-1 beta f 


12.3 


CD45RA CD4 lymphocyte J47.6 C 


^oronery artery SMC rest j' 


13.0 
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act 








CD45RO CD4 lymphocyte 
act 


77.4 


Coronery artery SMC TNFalpha + 
IL-lbeta 


27.4 


CD8 lymphocyte act 


69.7 


Astrocytes rest 


18.4 


Secondary CD8 
lymphocyte rest 


53.6 


Astrocytes TNFalpha + IL-lbeta 


17.8 


Secondary CDS 
lymphocyte act 


24.0 


KU-812 (Basophil) rest 


43.8 


CD4 lymphocyte none 


6.7 


KU-8 12 (Basophil) 
PMA/ionomycin 


49.3 


2ryThl/Th2/Trl anti- 
CD95 CH11 


24.8 


CCD1 106 (Keratinocytes) none 


51.1 


LAK cells rest 


28.7 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


49.0 


LAK cells IL-2 


31.2 


Liver cirrhosis 


7.6 


LAK cells IL-2+IL-12 


26.1 


NCI-H292 none 


71.2 


LAK cells IL-2+IFN 
gamma 


28.7 


NCI-H292 IL-4 


90.8 


LAK cells IL-2+IL-18 


26.8 


NCI-H292 IL-9 


100.0 


LAK cells 
PMA/ionomycin 


38.4 


NCI-H292 IL-13 


85.9 


i - - — 1 

INK Cells IL-2 rest 


28.7 


NCI-H292 I FN gamma 


71.7 


r 

|Two Way MLR 3 day 


26.1 


HPAEC none 


33.0 


Two Way MLR 5 day 


44.4 


HPAEC TNF alpha + IL-1 beta 


49.3 


Two Way MLR 7 day 


30.4 


Lung fibroblast none 


29.3 


PBMC rest 


9.6 


Lung fibroblast TNF alpha + IL-1 \ 

UCla 


20.4 


PBMC PWM 


53.2 


Lung fibroblast IL-4 


31.2 


PBMC PHA-L 


47.0 


Lung fibroblast IL-9 


52.5 


Ramos (B cell) none 


82.9 


Lung fibroblast IL-13 


28.7 


Ramos (B cell) ionomycin 


81.2 


Lung fibroblast I FN gamma \ 


43.5 


B lymphocytes PWM 


43.2 


Dermal fibroblast CCD 1070 rest 


47.0 


B lymphocytes CD40L and 
IL-4 


37.6 


Dermal fibroblast CCD 1070 TNF 
alpha 


62.4 


EOL-1 dbcAMP 


36.9 


Dermal fibroblast CCD1070 IL-1 
beta 


33.2 


EOL-1 dbcAMP 
PMA/ionomycin 


28.5 


Dermal fibroblast IFN gamma . 


30.6 


Dendritic cells none 


39.5 


Dermal fibroblast IL-4 


56.6 


Dendritic cells LPS 


27.7 


Dermal Fibroblasts rest 


42.3 


Dendritic cells anti-CD40 


26.1 


Neutrophils TNFa+LPS 


3.7 
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[Monocytes rest 


1 1 7 
11-/ 


Neutrophils rest 


5.4 


(Monocytes LPS 


14.9 


Colon 


7.1 


[Macrophages rest 


37.4 


Lung 


13.1 


[Macrophages LPS 


18.3 


Thymus 


20.7 


HUVEC none 


37.9 


Kidney 


40.1 


HUVEC starved 


51.1 


--• | 



Table BKE . General oncology screening panel _v 2.4 



Tissue Name 


Rei. Exp.(%) 
Ag4l60, Run 
268624163 


Tissue Name 


Rei. Exp.(%) 

Ao4160 Rim 

268624163 


Colon cancer 1 


32.5 


Bladder cancer NAT 2 


0.5 


Colon cancer NAT I 


11.6 


Bladder cancer NAT 3 


0.3 


Colon cancer 2 


52.9 {Bladder cancer NAT 4 


6.2 


Colon cancer NAT 2 


16.6 


Adenocarcinoma of the prostate 1 


22.7- 


Colon cancer 3 


85.9 


Adenocarcinoma of the prostate 2 


1.6 


Colon cancer NAT 3 


19.1 


Adenocarcinoma of the prostate 3 


9.0 


Colon malignant cancer 4 


100.0 


Adenocarcinoma of the prostate 4 


25.5 


Colon normal adjacent 
tissue 4 


10.0 


Prostate cancer NAT 5 


4.2 


Lung cancer 1 


39.2 (Adenocarcinoma of the prostate 6 


6.0 


Lung in A 1 1 


08 ; Adenocarcinoma of the prostate 7 


6.4 


Lung cancer 2 


*' ZA Adenocarcinoma of the prostate 8 


2.2 


Lung NAT 2 


1 y ^Adenocarcinoma of the prostate 9 


27.5 


Squamous cell carcinoma 
3 * 


18.3 

— 


Prostate cancer NAT 1 0 


0.9 


Lung NAT 3 




Kidney cancer 1 




metastatic melanoma 1 


36.6 


KidneyNAT 1 


5.1 


Melanoma 2 


5.3 j 


Kidney cancer 2 


59.0 


Melanoma 3 < 


5-7 (Kidney NAT 2 J 


15.2 


metastatic melanoma 4 ; 


>2.5 ] 


Sidney cancer 3 


>0.6 


metastatic melanoma 5 
Bladder cancer 1 ( 


72.7 ] 
).6 I 


Sidney NAT 3 t 


1.9 


Bladder cancer NAT 1 ( 


1.0 I 


Cidney cancer 4 ; 
Cidney NAT 4 ] 


>0.0 
1.0 


Bladder cancer 2 ] 

CNS n Pll rnrl Prrom 


0.2 







expression of the CG99963-01 gene at low levels in the brain in an independent group of 
individuals. This gene is found to be slightly down-regulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, up-regnlation of this gene or its protein product, 
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or treatment with specific agonists for this protein may be useful in reversing the neuronal 
death and dementia/memory loss associated with this disease. 

The CG99963-01 gene codes for cyclophilin 18 (Cyclophilin A; CyP-A) homolog. 
Cyp-A, a soluble cytoplasmic immunophilin, is known for its involvement in T cell 
5 differentiation and proliferation. Although CyP-A has a pivotal role in the immune 
response, it is most highly concentrated in brain. It is known to play a role in neuronal 
differentiation and proliferation of human embryonic brain cells (Nahreini et al., 2001, 
Cell Mol Neurobiol 21(l):65-79, PMID: 1 1440199). Therefore, therapeutic modulation of 
Cyp-A like protein encoded by this gene may be useful in treatment of neurological 
10 disorders. 

General_screening^panel_vl.4 Summary: Ag4160 Highest expression of the 
CG99963-01 gene is detected in a breast cancer T47D cell line (CT=21). High expression 
of this gene is also seen in cluster of cancer cell lines including melanoma, squamous cell 
carcinoma, pancreatic, CNS, colon, gastric, renal, breast, ovarian, and prostate cancer cell 

15 lines. Therefore, therapeutic modulation of this gene through the use of small molecule 
drug may be beneficial in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 

20 activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 

25 central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4160 Highest expression of the CG99963-01 gene is 
detected in IL-9 treated NCI-H292 cells (CT=23). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 

30 health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
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lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl .4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4160 Highest 
expression of the CG99963-01 gene is detected in malignant colon cancer (CT=23). In 
addition, high expression of this gene is also detected in number of cancer samples 
including squamous cell care 
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OTHER EMBODIMENTS 

Although particular embodiments have been disclosed herein in detail, this has been done 
by way of example for purposes of illustration only, and is not intended to be limiting with 
respect to the scope of the appended claims, which follow. In particular, it is 
contemplated by the inventors that various substitutions, alterations, and modifications 
may be made to the invention without departing from the spirit and scope of the invention 
as defined by the claims. The choice of nucleic acid starting material, clone of interest, or 
library type is believed to be a matter of routine for a person of ordinary skill in the art 
with knowledge of the embodiments described herein. Other aspects, advantages, and 
modifications considered to be within the scope of the following claims. The claims 
presented are representative of the inventions disclosed herein. Other, unclaimed 
inventions are also contemplated. Applicants reserve the right to pursue such inventions 
in later claims. 
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What is claimed is: 
What is claimed is: 

1 . An isolated polypeptide comprising the mature form of an amino acid sequenced 
selected from the group consisting of SEQ ID NO:2n, wherein n is an integer 
between 1 and 101 

2. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO:2n, wherein n is an integer between 1 and 101. 

3. An isolated polypeptide comprising an amino acid sequence which is at least 95% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:2n r wherein n is an integer between 1 and 101. 

4. An isolated polypeptide, wherein the polypeptide comprises an amino acid 
sequence comprising one or more conservative substitutions in the amino acid 
sequence selected from the group consisting of SEQ ID NO:2n, wherein n is an 
integer between 1 and 101 

5. The polypeptide of claim 1 wherein said polypeptide is naturally occurring. 

6. A composition comprising the polypeptide of claim 1 and a carrier. 

7. A kit comprising, in one or more containers, the composition of claim 6. 

8. The use of a therapeutic in the manufacture of a medicament for treating a 
syndrome associated with a human disease, the disease selected from a pathology 
associated with the polypeptide of claim 1, wherein the therapeutic comprises the 
polypeptide of claim 1. 
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9. A method for determining the presence or amount of the polypeptide of claim 1 in 
a sample, the method comprising: 

(a) providing said sample; 

(b) introducing said sample to an antibody that binds immunospecifically to the 
polypeptide; and 

(c) determining the presence or amount of antibody bound to said polypeptide, 
thereby determining the presence or amount of polypeptide in said sample. 

10. A method for determining the presence of or predisposition to a disease associated 
with altered levels of expression of the polypeptide of claim 1 in a first mammalian 
subject, the method comprising: 

a) measuring the level of expression of the polypeptide in a sample from the 
first mammalian subject; and 

b) comparing the expression of said polypeptide in the sample of step (a) to 
the expression of the polypeptide present in a control sample from a second 
mammalian subject known not to have, or not to be predisposed to, said 
disease, 

wherein an alteration in the level of expression of the polypeptide in the first subject as 
compared to the control sample indicates the presence of or predisposition to said disease. 

11. A method of identifying an agent that binds to the polypeptide of claim 1 , the 
method comprising: 

(a) introducing said polypeptide to said agent; and 

(b) determining whether said agent binds to said polypeptide. 

12. The method of claim 1 1 wherein the agent is a cellular receptor or a downstream 
. effector. 

13. A method for identifying a potential therapeutic agent for use in treatment of a 
pathology; wherein the pathology is related to aberrant expression or aberrant 
physiological interactions of the polypeptide of claim 1 3 the method comprising: 
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(a) providing a cell expressing the polypeptide of claim 1 and having a 
property or function ascribable to the polypeptide; 

(b) contacting the cell with a composition comprising a candidate substance; 
and 

(c) determining whether the substance alters the property or function ascribable 
to the polypeptide; 

whereby, if an alteration observed in the presence of the substance is not observed when 
the cell is contacted with a composition in the absence of the substance, the substance is 
identified as a potential therapeutic agent. 



14. 



A method for screening for a modulator of activity of or of latency or 
predisposition to a pathology associated with the polypeptide of claim 1, said 
method comprising: 

(a) administering a test compound to a test animal at increased risk for a 
pathology associated with the polypeptide of claim 1 , wherein said test 
animal recombinantly expresses the polypeptide of claim 1: 

(b) measuring the activity of said polypeptide in said test animal after 
administering the compound of step (a); and 

(c) comparing the activity of said polypeptide in said test animal with the 
activity of said polypeptide in a control animal not administered said 
polypeptide, wherein a change in the activity of said polypeptide in said test 
animal relative to said control animal indicates the test compound is a 
modulator activity of or latency or predisposition to, a pathology associated 
with the polypeptide of claim 1 . 



15. The method of claim 14, wherein said test animal is a recombinant test animal that 
expresses a test protein transgene or expresses said transgene under the control of a 
promoter at an increased level relative to a wild-type test animal, and wherein said 
promoter is not the native gene promoter of said transgene. 

16. A method for modulating the activity of the polypeptide of claim 1, the method 
comprising contacting a cell sample expressing the polypeptide of claim 1 with a 
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compound that binds to said polypeptide in an amount sufficient to modulate the 
activity of the polypeptide. 

1 7. A method of treating or preventing a pathology associated with the polypeptide of 
claim 1, the method comprising administering the polypeptide of claim 1 to a 
subject in which such treatment or prevention is desired in an amount sufficient to 
treat or prevent the pathology in the subject. 

18. The method of claim 17, wherein the subject is a human. 

1 9. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal a polypeptide in an amount that is sufficient to 
alleviate the pathological state, wherein the polypeptide is a polypeptide having an 
amino acid sequence at least 95% identical to a polypeptide comprising the amino 
acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n is 
an integer between 1 and 101 or a biologically active fragment thereof. 

20. An isolated nucleic acid molecule comprising a nucleic acid sequence selected 
from the group consisting of SEQ ID NO:2n-l, wherein n is an integer between 1 
and 101. 

2 1 . The nucleic acid molecule of claim 20, wherein the nucleic acid molecule is 
naturally occurring. 

22. A nucleic acid molecule, wherein the nucleic acid molecule differs by a single 
nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
ID NO: 2n-l, wherein n is an integer between 1 and 101 . 

23. An isolated nucleic acid molecule encoding the mature form of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID 
NO:2n, wherein n is an integer between 1 and 101 . 
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24. An isolated nucleic acid molecule comprising a nucleic acid selected from the 
group consisting of 2n-l, wherein n is an integer between 1 and 101. 

25. The nucleic acid molecule of claim 20, wherein said nucleic acid molecule 
hybridizes under stringent conditions to the nucleotide sequence selected from the 
group consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101. 
or a complement of said nucleotide sequence. 

26. A vector comprising the nucleic acid molecule of claim 20. 

27. The vector of claim 26, further comprising a promoter operably linked to said 
nucleic acid molecule. 

28. A cell comprising the vector of claim 26. 

29. An antibody that immunospecifically binds to the polypeptide of claim 1 . 

30. The antibody of claim 29, wherein the antibody is a monoclonal antibody. 

31. The antibody of claim 29, wherein the antibody is a humanized antibody. 

32. A method for determining the presence or amount of the nucleic acid molecule of 
claim 20 in a sample, the method comprising: 

(a) providing said sample; 

(b) introducing said sample to a probe that binds to said nucleic acid molecule; 

and 

(c) determining the presence or amount of said probe bound to said nucleic 
acid molecule, 

thereby determining the presence or amount of the nucleic acid molecule in said sample. 

33. The method of claim 32 wherein presence or amount of the nucleic acid molecule 
is used as a marker for cell or tissue type. 
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34. The method of claim 33 wherein the cell or tissue type is cancerous. 

35. A method for determining the presence of or predisposition to a disease associated 
with altered levels of expression of the nucleic acid molecule of claim 20 in a first 
mammalian subject, the method comprising: 

a) measuring the level of expression of the nucleic acid in a sample from the 
first mammalian subject; and 

b) comparing the level of expression of said nucleic acid in the sample of step 
(a) to the level of expression of the nucleic acid present in a control sample 
from a second mammalian subject known not to have or not be predisposed 
to, the disease; 

wherein an alteration in the level of expression of the nucleic acid in the first subject as 
compared to the control sample indicates the presence of or predisposition to the disease. 

36. A method of producing the polypeptide of claim 1 , the method comprising 
culturing a cell under conditions that lead to expression of the polypeptide, wherein 
said cell comprises a vector comprising an isolated nucleic acid molecule 
comprising a nucleic acid sequence selected from the group consisting of SEQ ID 
NO:2n-l, wherein n is an integer between 1 and 101. 

37. The method of claim 36 wherein the cell is a bacterial cell. 

38. The method of claim 36 wherein the cell is an insect cell. 

39. The method of claim 36 wherein the cell is a yeast cell. 

40. The method of claim 36 wherein the cell is a mammalian cell. 

41. A method of producing the polypeptide of claim 2, the method comprising 
culturing a cell under conditions that lead to expression of the polypeptide, wherein 
said cell comprises a vector comprising an isolated nucleic acid molecule 
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comprising a nucleic acid sequence selected from the group consisting of SEQ ID 
NO:2n-l, wherein n is an integer between 1 and 101. 

42. The method of claim 41 wherein the cell is a bacterial cell. 

43. The method of claim 41 wherein the cell is an insect cell. 

44. The method of claim 41 wherein the cell is a yeast cell. 

45. The method of claim 41 wherein the cell is a mammalian cell. 
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